Podcasts about glm

  • 98PODCASTS
  • 217EPISODES
  • 49mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • Feb 27, 2026LATEST

POPULARITY

20192020202120222023202420252026


Best podcasts about glm

Latest podcast episodes about glm

This Day in AI Podcast
Nano Banana 2 is Here! Gemini-3 Shutdown & The AI Layoff Myth | EP99.36

This Day in AI Podcast

Play Episode Listen Later Feb 27, 2026 62:09


Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Join Simtheory: https://simtheory.aiTDIA Discord: https://discord.gg/gTW4RkAJvnHorse Egg Lifecycle Infographic: https://staging.simtheory.ai/share/file/UZ2KJU----So Chris, this week... we're diving into Google's new Nano Banana 2 image model - 50% cheaper and supposedly faster (when the servers aren't melting). We put it through its paces with annotation-based editing, slide generation, and yes, the return of the legendary horse egg experiment.Plus: Google quietly kills Gemini-3 after just a few months (good riddance?), we discuss why the model was "dead on arrival" for agentic workflows, and break down the real story behind those massive AI layoff announcements from Block and WiseTech. Spoiler: it's probably not actually about AI.We also get into the current state of the model wars (Opus 4.6 vs Codex 5.3), why smaller models like GLM-5 might be the future for enterprise agentic tasks, and Chris's wife teaching Claude to literally speak to her using Mac's text-to-speech. The models are getting creative.---0:00 - Intro0:36 - Nano Banana 2: Price, Speed & First Impressions3:19 - The Compositing Problem & Last Mile Design5:41 - Annotation-Based Editing (This Changes Everything)9:52 - Slide Editing & Real-World Use Cases12:34 - The Horse Egg Experiment Returns14:30 - Image Degradation & Cost Breakdown17:47 - Text-to-Image Leaderboard Discussion20:01 - Why Nano Banana Dominates for Work22:07 - Codex 5.3 vs Opus 4.622:54 - Google Kills Gemini-3 (What Went Wrong?)26:48 - Google's Agentic Problem30:08 - The Model Loyalty Cycle34:22 - Why Opus 4.6 is Still the Best37:05 - Cost Optimization & Smart Model Routing43:30 - When Models Get Stuck on the Wrong Path45:36 - Nicole's AI Learns to Talk Back46:54 - Can Anyone Build Software Now?52:26 - Anthropic's Legal/Finance Plugins & Market Panic57:08 - Block Lays Off 4,000: AI or Excuse?1:00:05 - The AI Job Apocalypse Isn't RealThanks for listening like and sub xoxo

天方烨谈
从Deepseek到Seedance:江山代有才人出,算力饥渴应犹在

天方烨谈

Play Episode Listen Later Feb 21, 2026 4:27


AI的进化,从来不是缓慢爬坡,而是突然跃迁。国产大模型三天六厂密集爆发:从Seedance 2.0的影视级生成到GLM-5开源登顶,参数竞赛转向落地能力,中国AI正以工业级解决方案重塑全球竞争格局。

Marketing Over Coffee Marketing Podcast
Year of the Horse Triggers AI Race!

Marketing Over Coffee Marketing Podcast

Play Episode Listen Later Feb 20, 2026


In this Marketing Over Coffee: Learn about the latest updates, Zwift, the future of movies, and more! Direct Link to File Happy Lunar New Year! The Release Battle Rages! Moonshot Kimi K2.5, Z.ai GLM-5, Alibaba Qwen-3.5, All still open weights ByteDance Seedance 2.0 Video now too real The future of Movies Chris writes a trashy […] The post Year of the Horse Triggers AI Race! appeared first on Marketing Over Coffee Marketing Podcast.

The top AI news from the past week, every ThursdAI

Hey, it's Alex, let me catch you up! Since last week, OpenAI convinced OpenClaw founder Peter Steinberger to join them, while keeping OpenClaw.. well... open. Anthropic dropped Sonnet 4.6 which nearly outperforms the previous Opus and is much cheaper, Qwen released 3.5 on Chinese New Year's Eve, while DeepSeek was silent and Elon and XAI folks deployed Grok 4.20 without any benchmarks, and it's 4 500B models in a trenchcoat? Also, Anthropic updated rules state that it's breaking ToS to use their plans for anything except Claude Code & Claude SDK (and then clarified that it's OK? we're not sure) Then Google decided to drop their Gemini 3.1 Pro preview right at the start of our show, and it's very nearly the best LLM folks can use right now (though it didn't pass Nisten's vibe checks) Also, Google released Lyria 3 for music gen (though only 30 seconds?) and our own Ryan Carson blew up on X again with over 1M views for his Code Factory article, Wolfram did a deep dive into Terminal Bench and .. we have a brand new website: https://thursdai.news

Let's Talk AI
#235 - Opus 4.6, GPT-5.3-codex, Seedance 2.0, GLM-5

Let's Talk AI

Play Episode Listen Later Feb 16, 2026 90:33


Our 235th episode with a summary and discussion of last week's big AI news!Recorded on 01/02/2026Hosted by Andrey Kurenkov and Jeremie HarrisFeel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:* Major model launches include Anthropic's Opus 4.6 with a 1M-token context window and “agent teams,” OpenAI's GPT-5.3 Codex and faster Codex Spark via Cerebras, and Google's Gemini 3 Deep Think posting big jumps on ARC-AGI-2 and other STEM benchmarks amid criticism about missing safety documentation.* Generative media advances feature ByteDance's Seedance 2.0 text-to-video with high realism and broad prompting inputs, new image models Seedream 5.0 and Alibaba's Qwen Image 2.0, plus xAI's Grok Imagine API for text/image-to-video.* Open and competitive releases expand with Zhipu's GLM-5, DeepSeek's 1M-token context model, Cursor Composer 1.5, and open-weight Qwen3 Coder Next using hybrid attention aimed at efficient local/agentic coding.* Business updates include ElevenLabs raising $500M at an $11B valuation, Runway raising $315M at a $5.3B valuation, humanoid robotics firm Apptronik raising $935M at a $5.3B valuation, Waymo announcing readiness for high-volume production of its 6th-gen hardware, plus industry drama around Anthropic's Super Bowl ad and departures from xAI.Timestamps:(00:00:10) Intro / Banter(00:02:03) Sponsor Break(00:05:33) Response to listener commentsTools & Apps(00:07:27) Anthropic releases Opus 4.6 with new 'agent teams' | TechCrunch(00:11:28) OpenAI's new GPT-5.3-Codex is 25% faster and goes way beyond coding now - what's new | ZDNET(00:25:30) OpenAI launches new macOS app for agentic coding | TechCrunch(00:26:38) Google Unveils Gemini 3 Deep Think for Science & Engineering | The Tech Buzz(00:31:26) ByteDance's Seedance 2.0 Might be the Best AI Video Generator Yet - TechEBlog(00:35:14) China's ByteDance, Alibaba unveil AI image tools to rival Google's popular Nano Banana | South China Morning Post(00:36:54) DeepSeek boosts AI model with 10-fold token addition as Zhipu AI unveils GLM-5 | South China Morning Post(00:43:11) Cursor launches Composer 1.5 with upgrades for complex tasks(00:44:03) xAI launches Grok Imagine API for text and image to videoApplications & Business(00:45:47) Nvidia-backed AI voice startups ElevenLabs hits $11 billion valuation(00:52:04) AI video startup Runway raises $315M at $5.3B valuation, eyes more capable world models | TechCrunch(00:54:02) Humanoid robot startup Apptronik has now raised $935M at a $5B+ valuation | TechCrunch(00:57:10) Anthropic says 'Claude will remain ad-free,' unlike an unnamed rival | The Verge(01:00:18) Okay, now exactly half of xAI's founding team has left the company | TechCrunch(01:04:03) Waymo's next-gen robotaxi is ready for passengers — and also 'high-volume production' | The VergeProjects & Open Source(01:04:59) Qwen3-Coder-Next: Pushing Small Hybrid Models on Agentic Coding(01:08:38) OpenClaw's AI 'skill' extensions are a security nightmare | The VergeResearch & Advancements(01:10:40) Learning to Reason in 13 Parameters(01:16:01) Reinforcement World Model Learning for LLM-based Agents(01:20:00) Opus 4.6 on Vending-Bench – Not Just a Helpful AssistantPolicy & Safety(01:22:28) METR GPT-5.2(01:26:59) The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity?See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

DOU Podcast
Anthropic коштує $380 млрд | Шантаж через Starlink | Зарплати DevOps — DOU News #237

DOU Podcast

Play Episode Listen Later Feb 16, 2026 28:59


У свіжому дайджесті DOU News обговорюємо як рф змушує родини полонених реєструвати на себе Starlink. У світі ШІ — справжній бум інвестицій: Anthropic залучає $30 млрд, а в OpenAI черговий скандал через рекламу в ChatGPT. Також у випуску: доля команди Tabletki.ua після угоди з «Київстаром», проблеми нової Siri та новини про GTA VI. Дивіться ці та інші новини українського та світового тек-сектору. Таймкоди 00:00 Інтро 00:21 Зарплати девопсів: свіжа аналітика ринку 01:36 Шантаж полоненими: РФ змушує родичів реєструвати Starlink 02:39 Реєстрація на Algorithms in practice від CS Osvita 03:32 Доля команди Tabletki.ua після угоди з «Київстаром» 05:20 На війні загинув Володимир Фриз — QC Engineer компанії SoftServe 05:58 Anthropic залучила додаткові $30 млрд у раунді Series G 08:28 Дослідниця OpenAI звільнилася через рекламу в ChatGPT 12:04 Арсенал талантів: ярмарок вакансій у Defense Tech від DOU та LobbyX 13:08 «Деплой із маршрутки»: СЕО Spotify про те, як ШІ замінив код 17:03 GLM-5: від вайб-кодингу до агентного інжинірингу 19:53 Оновлення Siri в iOS 26.4: проблеми з тестуванням та затримки 21:46 Ідеальний PR, який відхилили: чому компанії бояться коду від ШІ 24:53 Take-Two звітує про рекордні $1,76 млрд за квартал та статус GTA VI 26:54 Що рекомендує Женя: Analyzecore та статтю «ШІ не зменшує роботу, а посилює її»

AIA Podcast
Moltомания продолжается, новые GPT-5.3 Codex, Opus 4.6 и GLM 5, AI Safety Report 2026 / ПНВ #403

AIA Podcast

Play Episode Listen Later Feb 16, 2026 140:25


Сегодня говорим про взлёт соцсети для ботов Moltbook, где ИИ жалуются на хозяев и создают свои религии, про масштабую экспансию ИИ-инфраструктуры в космос, анонсированную Илоном Маском. Codex 5.3 и Opus 4.6, GLM 5, Qween Coder Next, продажа домена AI.com за 70 млн долларов, «умным» симуляторам Waymo и отчёт о будущем ИИ в 2026 году.

Late Tech Show
La versione podcast della newsletter #Techy del 16/2/2026

Late Tech Show

Play Episode Listen Later Feb 16, 2026 17:59


La versione podcast automatica della newsletter #Techy del 16/2/2026 Il panorama tecnologico sta cambiando a una velocità senza precedenti. Se ti occupi di digitale, questi sono i trend e i dati che non puoi ignorare per restare rilevante nel 2025. Ecco l'analisi sintetica di ciò che sta accadendo:1. La Crisi del Software (SaaSgeddon)

What's Next|科技早知道
从 「Pony Alpha」 到 GLM-5:智谱的大模型周期博弈 |S9E46

What's Next|科技早知道

Play Episode Listen Later Feb 14, 2026 27:57


上周,一个代号为 「Pony Alpha」 的匿名模型突然冲上 OpenRouter 榜单,在 Coding 与 Agent 场景中表现惊艳, 引来了众多开发者对其真实身份的猜测。来自中国的大模型厂商智谱随后确认,Pony Alpha 正是智谱刚刚发布的新模型 GLM-5,这款模型在推理、编程与工具调用能力上大幅提升,性能逼近一线闭源模型。 在中国 AI 生态中,智谱一直占据一个独特的位置。它的创始人来自中国顶尖高校,培养出大量 AI 创业者与工程师,被很多业内人士称为“中国 AI 人才的黄埔军校”。它又是最早完成上市的大模型公司之一,身处资本市场的聚光灯下,在开发者生态与商业之间不断寻找平衡。本期节目,我们邀请到了智谱 Z.ai 负责人李子玄,节目中我们聊到了 Pony Alpha 匿名发布背后的全球策略与 GLM-5 的技术升级,智谱在全球大模型竞争格局中的定位与优势,也讨论了智谱在大模型商业化路径方面的创新与挑战。 本期人物 丁教 Diane,「声动活泼」联合创始人、「科技早知道」主播 Aaron,周玖洲 Aaron, 十年中金、华夏基金等顶级投资机构工作经验,「不止金钱」主播 李子玄,智谱 Z.ai 负责人 主要话题 [03:15] 从「Pony Alpha」 到 GLM-5:为什么要匿名发布? 以匿名模型登陆 OpenRouter,先用实力获取认可 海外开发者的评价对国内市场有强烈反向影响 匿名模型免费开放,带来巨大流量与曝光 [07:07] GLM-5 强在哪里? 强调全维度能力:推理、coding、agent、通用问答 在工具调用与真实工程场景中表现稳定 目标是做到「够强到让用户忽略缺点」 [09:03] 大模型竞争无法预测,敏捷比规划更重要 三个月一代模型,留给大模型厂商的机会窗口极短 真正的竞争优势在于团队保持极致敏捷 [12:03] 国内大模型为何难以靠 ToC 订阅赚钱? 大厂免费策略改变市场预期 模型能力相近,单纯靠聊天难以形成付费心智 [14:58] 智谱如何实践商业创新? 先选模型再选壳,改变传统工具订阅逻辑 按服务订阅,而非按 token 计费 核心目标是提升用户粘性,降低价格波动风险 [18:43] 成本、算力与模型迭代的结构性挑战 新模型上线会挤占下一代模型的算力空间,定价与算力供需高度波动 防止黑灰产与提升工程效率同样重要 [22:07] PMF 到底是什么?智谱找到拐点了吗? Coding 可能是当前最接近 PMF 的方向 Agent 才可能是更长期的阵地 [24:11] Coding 是守阵地,Agent 才是攻城战 AutoGLM 等布局已提前展开 关键在于工具调用与自动化能力成熟 机会可能突然到来,前提是持续积累 幕后制作 监制:Yaxian 后期:迪卡 运营:George 设计:饭团 商业合作 声动活泼商业化小队,点击链接直达声动商务会客厅(https://sourl.cn/9h28kj ),也可发送邮件至 business@shengfm.cn 联系我们。 加入声动活泼 声动活泼目前开放商务合作实习生、社群运营实习生和 BD 经理等职位,详情点击招聘入口详情点击招聘入口 关于声动活泼 「用声音碰撞世界」,声动活泼致力于为人们提供源源不断的思考养料。 我们还有这些播客:声动早咖啡、声东击西、吃喝玩乐了不起、反潮流俱乐部、泡腾 VC、商业WHY酱、跳进兔子洞 、不止金钱 欢迎在即刻、微博等社交媒体上与我们互动,搜索 声动活泼 即可找到我们。 期待你给我们写邮件,邮箱地址是:ting@sheng.fm 欢迎扫码添加声小音,在节目之外和我们保持联系。Special Guest: 李子玄.

HTML All The Things - Web Development, Web Design, Small Business
Web News: AI Competition is Out Of Control

HTML All The Things - Web Development, Web Design, Small Business

Play Episode Listen Later Feb 14, 2026 26:27


The pace of AI model releases is becoming almost impossible to follow. In just two weeks we saw GPT-5.3-Codex, GPT-5.2 updates, Gemini 3 Deep Think upgrades, Claude Opus 4.6 with a 1M context window in beta, Qwen3-Coder-Next, GLM-5, MiniMax M2.5, Cursor Composer 1.5, and even Kimi 2.5 just outside the window. This isn't a quarterly product cycle anymore - it's a daily arms race. In this episode Matt and Mike break down what this acceleration means for developers, open source, frontier labs, and the broader industry. Are we witnessing healthy innovation, or unsustainable velocity? At what point does this stabilize - if it ever does? If you're trying to build, learn, or compete in AI right now… this conversation is for you. ‍Show Notes: https://www.htmlallthethings.com/podcast/ai-competition-is-out-of-control

AI For Humans
Seedance 2.0 Is Peak AI Video. We Tested It. Send Help.

AI For Humans

Play Episode Listen Later Feb 13, 2026 62:43


Seedance 2.0 is the best AI video model we've ever seen. Bytedance's new AI tool generates 15 second clips of basically anything. But what happens next? And where do we go from here? Gavin got early access before it got locked down. We tested it with original animation, fake sitcoms, anime, and a McDonald's ad. The results are genuinely shocking - multi-shot editing, cinematic camera work, and real celebrity voices coming straight out of the model. People are making fake Seinfeld episodes, Avengers deleted scenes, and Rocky working at a fast food restaurant with Optimus Prime. Plus two new Chinese LLMs beating American models on benchmarks, Google Deep Think scores 84% on ARC-AGI, OpenAI's Codex Spark model, that viral AI post your mom sent you, and Kevin loses his mind building an Open Claw agent named Mr. Tibs who now requests server upgrades at 3am. HOLLYWOOD LAWYERS ARE GOING TO HAVE A VERY INTERESTING YEAR. ITS FINE. Come to our Discord: https://discord.gg/muD2TYgC8f Join our Patreon: https://www.patreon.com/AIForHumansShow AI For Humans Newsletter: https://aiforhumans.beehiiv.com/ Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow To book us for speaking, please visit our website: https://www.aiforhumans.show/   // Show Links // Seedance 2.0 - Bytedance's New AI Video Best Since Sora 2 https://seed.bytedance.com/en/seedance2_0 https://www.reuters.com/business/media-telecom/bytedances-new-ai-video-model-goes-viral-china-looks-second-deepseek-moment-2026-02-12/ The Seinfeld Test: https://x.com/apples_jimmy/status/2021351821718225330?s=20 Even better The Seinfeld Fight: https://x.com/itspoidaman/status/2021409465355075655?s=20 Wolverine vs Thanos: https://x.com/AndrewCurran_/status/2021979655130296487?s=20 Avengers Endgame: https://x.com/cfryant/status/2021398605278376201?s=20 Tom Cruise vs Brad Pitt: https://x.com/RuairiRobinson/status/2021394940757209134?s=20 Ethan Hunt vs John Wick: https://x.com/chatcutapp/status/2021902856367092108?s=20 Gavin's Game of Friends Sitcom Test: https://x.com/gavinpurcell/status/2021418263432032635?s=20 Gavin's Rocky Balboa & Optimus Prime Test: https://x.com/gavinpurcell/status/2021429329012650045?s=20 Original Animation style: https://x.com/gavinpurcell/status/2021396803787383254?s=20 Anime test: https://x.com/gavinpurcell/status/2021732810554507352?s=20 Comparing AI McDonald's Ads: https://x.com/AIForHumansShow/status/1802715910488400047?s=20 GLM-5 from Z.AI https://x.com/Zai_org/status/2021638634739527773 https://z.ai/blog/glm-5 GLM-5 Gameboy Emulation https://x.com/Zai_org/status/2021754659590033565?s=20 New Minimax 2.5 Model https://x.com/MiniMax_AI/status/2021980761210134808?s=20 CODEX SPARK - Faster Codex https://openai.com/index/introducing-gpt-5-3-codex-spark/ New Google Deep Think Model CRUSHES Benchmarks https://x.com/GoogleDeepMind/status/2021981510400709092?s=20 The 'Something Big Is Happening' Post Heard Round The World  https://x.com/mattshumer_/status/2021256989876109403?s=20 Gavin's Simultaneous Take (in Monday's newsletter too) https://x.com/gavinpurcell/status/2021292314291999182 AI 2027 https://ai-2027.com/ One-Shot Otter Anime From Ethan Mollick: https://x.com/emollick/status/2021412306291392535 The Best Prompt Ever Seedance https://x.com/Gossip_Goblin/status/2021468902220497061?s=20 MattVidPro's Shrek & Donkey Crash a Honda Accord https://x.com/MattVidPro/status/2021739211674566808?s=20  

This Day in AI Podcast
Am I Even Needed Anymore? GLM-5, Agentic Loops & AI Productivity Psychosis - EP99.34

This Day in AI Podcast

Play Episode Listen Later Feb 13, 2026 63:07


Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80GLM-5 just dropped and it's trained entirely on Huawei chips – zero US hardware dependency. Meanwhile, we're having existential crises about whether we're even needed anymore. In this episode, we break down China's new frontier model that's competing with Opus 4.6 and Codex at a fraction of the price, why agentic loops are making 200K context windows the sweet spot (sorry, million-token dreams), and the very real phenomenon of AI productivity psychosis. We dive into why coding-optimized models are secretly winning at everything, the Harvard study confirming AI doesn't reduce work – it intensifies it, and the exodus of safety researchers from XAI, Anthropic, and OpenAI (spoiler: they're not giving back their shares). Plus: Mike's arm is failing from too much mouse usage, we debate whether the chatbot era is actually fading, and yes – there's a safety researcher diss track called "Is This The End?"CHAPTERS:0:00 Intro - Is This The End? (Song Preview)0:11 Still Relevant Tour Update & NASA Listener Callout1:42 AI Productivity Psychosis: The Pressure of Infinite Capability4:25 GLM-5 Breakdown: China's New Frontier Model on Huawei Chips7:24 First Impressions: GLM-5 in Agentic Loops9:48 Why Cheap Models Matter & The New Model War14:09 Codex Vibe Shift: Is OpenAI Winning?16:24 Does Context Window Size Even Matter Anymore?22:27 The Parallelization Problem & Cognitive Overload27:27 Mike's Arm Injury & The Voice Input Pivot31:17 Single-Threaded Work & The 95% Problem35:06 UX is Unsolved: Rolling Back Agentic Mistakes38:45 Harvard Study: AI Doesn't Reduce Work, It Intensifies It44:01 How AI Erodes Company Structure & Why Adoption Takes Years50:14 My AI vs Your AI: Household Debates50:43 The Safety Researcher Exodus: XAI, Anthropic, OpenAI56:49 Final Thoughts: Are We All Still Relevant?59:04 BONUS: Full "Is This The End?" Diss TrackThanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. GLM-5 is here, your productivity psychosis is valid, and the safety researchers are becoming poets. xoxo

The top AI news from the past week, every ThursdAI

Hey dear subscriber, Alex here from W&B, let me catch you up! This week started with Anthropic releasing /fast mode for Opus 4.6, continued with ByteDance reality-shattering video model called SeeDance 2.0, and then the open weights folks pulled up! Z.ai releasing GLM-5, a 744B top ranking coder beast, and then today MiniMax dropping a heavily RL'd MiniMax M2.5, showing 80.2% on SWE-bench, nearly beating Opus 4.6! I've interviewed Lou from Z.AI and Olive from MiniMax on the show today back to back btw, very interesting conversations, starting after TL;DR!So while the OpenSource models were catching up to frontier, OpenAI and Google both dropped breaking news (again, during the show), with Gemini 3 Deep Think shattering the ArcAGI 2 (84.6%) and Humanity's Last Exam (48% w/o tools)... Just an absolute beast of a model update, and OpenAI launched their Cerebras collaboration, with GPT 5.3 Codex Spark, supposedly running at over 1000 tokens per second (but not as smart) Also, crazy week for us at W&B as we scrambled to host GLM-5 at day of release, and are working on dropping Kimi K2.5 and MiniMax both on our inference service! As always, all show notes in the end, let's DIVE IN! ThursdAI - AI is speeding up, don't get left behind! Sub and I'll keep you up to date with a weekly catch upOpen Source LLMsZ.ai launches GLM-5 - #1 open-weights coder with 744B parameters (X, HF, W&B inference)The breakaway open-source model of the week is undeniably GLM-5 from Z.ai (formerly known to many of us as Zhipu AI). We were honored to have Lou, the Head of DevRel at Z.ai, join us live on the show at 1:00 AM Shanghai time to break down this monster of a release.GLM-5 is massive, not something you run at home (hey, that's what W&B inference is for!) but it's absolutely a model that's worth thinking about if your company has on prem requirements and can't share code with OpenAI or Anthropic. They jumped from 355B in GLM4.5 and expanded their pre-training data to a whopping 28.5T tokens to get these results. But Lou explained that it's not only about data, they adopted DeepSeeks sparse attention (DSA) to help preserve deep reasoning over long contexts (this one has 200K)Lou summed up the generational leap from version 4.5 to 5 perfectly in four words: “Bigger, faster, better, and cheaper.” I dunno about faster, this may be one of those models that you hand off more difficult tasks to, but definitely cheaper, with $1 input/$3.20 output per 1M tokens on W&B! While the evaluations are ongoing, the one interesting tid-bit from Artificial Analysis was, this model scores the lowest on their hallucination rate bench! Think about this for a second, this model is neck-in-neck with Opus 4.5, and if Anthropic didn't release Opus 4.6 just last week, this would be an open weights model that rivals Opus! One of the best models the western foundational labs with all their investments has out there. Absolutely insane times. MiniMax drops M2.5 - 80.2% on SWE-bench verified with just 10B active parameters (X, Blog)Just as we wrapped up our conversation with Lou, MiniMax dropped their release (though not weights yet, we're waiting ⏰) and then Olive Song, a senior RL researcher on the team, joined the pod, and she was an absolute wealth of knowledge! Olive shared that they achieved an unbelievable 80.2% on SWE-Bench Verified. Digest this for a second: a 10B active parameter open-source model is directly trading blows with Claude Opus 4.6 (80.8%) on the one of the hardest real-world software engineering benchmark we currently have. While being alex checks notes ... 20X cheaper and much faster to run? Apparently their fast version gets up to 100 tokens/s. Olive shared the “not so secret” sauce behind this punch-above-its-weight performance. The massive leap in intelligence comes entirely from their highly decoupled Reinforcement Learning framework called “Forge.” They heavily optimized not just for correct answers, but for the end-to-end time of task performing. In the era of bloated reasoning models that spit out ten thousand “thinking” tokens before writing a line of code, MiniMax trained their model across thousands of diverse environments to use fewer tools, think more efficiently, and execute plans faster. As Olive noted, less time waiting and fewer tools called means less money spent by the user. (as confirmed by @swyx at the Windsurf leaderboard, developers often prefer fast but good enough models) I really enjoyed the interview with Olive, really recommend you listen to the whole conversation starting at 00:26:15. Kudos MiniMax on the release (and I'll keep you updated when we add this model to our inference service) Big Labs and breaking newsThere's a reason the show is called ThursdAI, and today this reason is more clear than ever, AI biggest updates happen on a Thursday, often live during the show. This happened 2 times last week and 3 times today, first with MiniMax and then with both Google and OpenAI! Google previews Gemini 3 Deep Think, top reasoning intelligence SOTA Arc AGI 2 at 84% & SOTA HLE 48.4% (X , Blog)I literally went

Techmeme Ride Home
Pour Moi, C'est Le Déluge

Techmeme Ride Home

Play Episode Listen Later Feb 12, 2026 21:11


Aaannddd…. Right on time here come the Chinese AI models. Elon Musk kicks off a major reorg of xAI. Google is warning of AI distillation attacks. New Waymo cars hit the road. And another interesting AI essay to read to you. Chinese AI startup Zhipu releases new flagship model GLM-5 (Reuters) Musk announces xAI re-org following co-founder departures, SpaceX merger (CNBC) Elon Musk Wants to Build an A.I. Satellite Factory on the Moon (NYTimes) Google says attackers used 100,000+ prompts to try to clone AI chatbot Gemini (NBCNews) Waymo begins deploying next-gen Ojai robotaxis to extend its U.S. lead (CNBC) The AI Vampire (Steve Yegge) Learn more about your ad choices. Visit megaphone.fm/adchoices

Hacker News Recap
February 11th, 2026 | Claude Code is being dumbed down?

Hacker News Recap

Play Episode Listen Later Feb 12, 2026 15:21


This is a recap of the top 10 posts on Hacker News on February 11, 2026. This podcast was generated by wondercraft.ai (00:30): Claude Code is being dumbed down?Original post: https://news.ycombinator.com/item?id=46978710&utm_source=wondercraft_ai(01:57): Windows Notepad App Remote Code Execution VulnerabilityOriginal post: https://news.ycombinator.com/item?id=46971516&utm_source=wondercraft_ai(03:25): Discord/Twitch/Snapchat age verification bypassOriginal post: https://news.ycombinator.com/item?id=46982421&utm_source=wondercraft_ai(04:52): Amazon Ring's lost dog ad sparks backlash amid fears of mass surveillanceOriginal post: https://news.ycombinator.com/item?id=46978966&utm_source=wondercraft_ai(06:20): Chrome extensions spying on users' browsing dataOriginal post: https://news.ycombinator.com/item?id=46973083&utm_source=wondercraft_ai(07:48): Fluorite – A console-grade game engine fully integrated with FlutterOriginal post: https://news.ycombinator.com/item?id=46976911&utm_source=wondercraft_ai(09:15): GLM-5: From Vibe Coding to Agentic EngineeringOriginal post: https://news.ycombinator.com/item?id=46977210&utm_source=wondercraft_ai(10:43): Why vampires live foreverOriginal post: https://news.ycombinator.com/item?id=46976443&utm_source=wondercraft_ai(12:11): Officials Claim Drone Incursion Led to Shutdown of El Paso AirportOriginal post: https://news.ycombinator.com/item?id=46972610&utm_source=wondercraft_ai(13:38): FAA closes airspace around El Paso, Texas, for 10 days, grounding all flightsOriginal post: https://news.ycombinator.com/item?id=46973647&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store
AI Business and Development Daily News Rundown February 12 2026: Musk's Moon Factory, China's New Open-Source King, & Claude's "Sabotage Risk"

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later Feb 12, 2026 32:51


AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store
Teaser for AI Daily News Rundown February 12 2026: Musk's Moon Factory, China's New Open-Source King, & The "Sabotage Risk" (Ep. Brought to you by AIRIA)

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later Feb 12, 2026 1:55


Netcetera by Myosin.xyz
Octant & Giveth Are Proving Blockchain is Doing Real Good

Netcetera by Myosin.xyz

Play Episode Listen Later Feb 10, 2026 59:17


In Episode 49 of Chain Reactions, we sit down with Mashal Waqar, Head of Marketing at Octant, and get a surprise drop-in from Griff Green, Founder of Giveth, to dig into how public goods funding actually works on Ethereum and why it matters more now than ever.We cover:– How Octant's model works: lock GLM, earn ETH, and choose to fund public goods or keep the yield– The surprisingly heated debate over what counts as a "public good" (yes, Pizza DAO came up)– Why blockchain unlocks speed, transparency, and community-driven capital allocation that traditional grants can't match– Griff's wild story of The DAO hack, how edge case funds turned into $200M+, and the launch of the new DAO Security Fund– The case for an Ethereum security coalition and why L2s need to fund shared infrastructureMashal shares her journey from running a media company with tens of millions of readers to burning out, discovering crypto through NFTs and Gitcoin, and co-authoring the first State of Web3 Grants report.We also get into real-world impact stories, from funding water filters in Gaza to helping doctors in Syria get paid through crypto, and why sustainable funding through DeFi yield beats depleting treasuries. Plus, a great riff on AI in public goods, the Zakat use case for crypto, and why execution beats everything.Timestamps00:00 – Intro and what's on everyone's timeline right now02:08 – Welcome to Chain Reactions and introducing Mashal from Octant03:54 – Mashal's journey from media founder to crypto marketer06:28 – How NFTs and Crypto Covens pulled her back into Web308:53 – Co-authoring the first State of Web3 Grants report and discovering Octant10:25 – What Octant is and how the GLM staking model works13:15 – What actually counts as a public good (and the Pizza DAO debate)16:49 – The $1M Ethereum creator round and lessons from vetting 1,000+ applications18:30 – DeFi vaults, sustainable funding, and the new StreamVote experiment23:30 – Why blockchain unlocks faster, more transparent funding than traditional grants26:34 – Remittances, financial access, and the personal case for crypto in emerging markets33:24 – Griff joins: founding Giveth, The DAO hack, and rescuing $200M+ in edge case funds39:21 – The multiplier effect and why matching makes it hard not to donate44:18 – Launching the DAO Security Fund inspired by Octant's model48:45 – AI experiments at Octant, building with AI, and the case for AI in public goods56:29 – Vitalik's L2 tax tweet, Ethereum sustainability, and the need for a security coalition1:00:00 – Rapid fire: execution beats everything and don't count your chickensShow Notes & Mentions

Opanuj.AI Podcast
Doktor AI nadchodzi - ChatGPT Health vs Google MedGemma, konstytucja Anthropic i GLM-4.7 & KIMI K2.5 z Chin

Opanuj.AI Podcast

Play Episode Listen Later Feb 1, 2026 83:04


ChatGPT Health vs Google MedGemma 1.5 - giganci Generative AI chcą podbić świat medycyny. Czy już wkrótce będzie to realna alternatywa klasycznej służby zdrowia? Inny z gigantów, Anthropic, próbuje nadać technologii moralny kręgosłup, publikując nową konstytucję Claude'a definiującą ścisłą hierarchię wartości modelu. Tymczasem w Chinach Moonshot AI chwali się opanowaniem "Agent Swarm" - dzięki orkiestracji „roju” agentów, firma drastycznie przyspiesza złożone zadania programistyczne w KIMI K2.5. Na horyzoncie pojawia się także GLM-4.7, uderzający w zachodnich gigantów wydajnością klasy premium przy wielokrotnie niższych kosztach. Zastanawiamy się, czy te zmiany to realna demokratyzacja wiedzy, czy raczej ryzykowna gra o nasze najbardziej wrażliwe dane.Komentuj, obserwuj i wystaw nam 5/5 - dzięki!

AIA Podcast
OpenClaw (ClawdBot) ВЗРЫВАЕТ ИНТЕРНЕТ, первая конституция для ИИ и датацентр на 1 ГВт / ПНВ #402

AIA Podcast

Play Episode Listen Later Feb 1, 2026 175:24


Сегодня разбираем инвестиции OpenAI в ультразвуковые нейроинтерфейсы и новую «Конституцию» Anthropic для Claude , смотрим на мега-стройку Илона Маска Colossus 2 мощностью в два Сан-Франциско и вникаем в кризис высшего образования на примере найма 23-летних самоучек в команде Sora. Исследуем научный редактор Prism, шпионский потенциал Open Claw и одноразовые кольца-диктофоны Pebble. В конце обсудим наезд беспилотника Waymo на ребенка , интеграцию Grok в Теслы и этику общения с ИИ через «матюки». А ещё, прощаемся с Виктором и приветствуем Викторию!

The top AI news from the past week, every ThursdAI

Hey! Alex here, with another weekly AI update! It seems like ThursdAI is taking a new direction, as this is our 3rd show this year, and a 3rd deep dive into topics (previously Ralph, Agent Skills), please let me know if the comments if you like this format. This week's deep dive is into Clawdbot, a personal AI assistant you install on your computer, but can control through your phone, has access to your files, is able to write code, help organize your life, but most importantly, it can self improve. Seeing Wolfred (my Clawdbot) learn to transcribe incoming voice messages blew my mind, and I wanted to share this one with you at length! We had Dan Peguine on the show for the deep dive + both Wolfram and Yam are avid users! This one is not to be missed. If ThursdAI is usually too technical for you, use Claude, and install Clawdbot after you read/listen to the deep dive!Also this week, we read Claude's Constitution that Anthropic released, heard a bunch of new TTS models (some are open source and very impressive) and talked about the new lightspeed coding model GLM 4.7 Flash. First the news, then deep dive, lets go

Hacker News Recap
January 19th, 2026 | American importers and consumers bear the cost of 2025 tariffs: analysis

Hacker News Recap

Play Episode Listen Later Jan 20, 2026 15:36


This is a recap of the top 10 posts on Hacker News on January 19, 2026. This podcast was generated by wondercraft.ai (00:30): American importers and consumers bear the cost of 2025 tariffs: analysisOriginal post: https://news.ycombinator.com/item?id=46680212&utm_source=wondercraft_ai(01:59): A decentralized peer-to-peer messaging application that operates over BluetoothOriginal post: https://news.ycombinator.com/item?id=46675853&utm_source=wondercraft_ai(03:28): Radboud University selects Fairphone as standard smartphone for employeesOriginal post: https://news.ycombinator.com/item?id=46676276&utm_source=wondercraft_ai(04:57): Amazon is ending all inventory commingling as of March 31, 2026Original post: https://news.ycombinator.com/item?id=46678205&utm_source=wondercraft_ai(06:26): Letter from a Birmingham Jail (1963)Original post: https://news.ycombinator.com/item?id=46683205&utm_source=wondercraft_ai(07:55): GLM-4.7-FlashOriginal post: https://news.ycombinator.com/item?id=46679872&utm_source=wondercraft_ai(09:24): Level S4 solar radiation eventOriginal post: https://news.ycombinator.com/item?id=46684056&utm_source=wondercraft_ai(10:53): What came first: the CNAME or the A record?Original post: https://news.ycombinator.com/item?id=46681611&utm_source=wondercraft_ai(12:22): Show HN: I quit coding years ago. AI brought me backOriginal post: https://news.ycombinator.com/item?id=46673809&utm_source=wondercraft_ai(13:52): Apple testing new App Store design that blurs the line between ads and resultsOriginal post: https://news.ycombinator.com/item?id=46680974&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

The top AI news from the past week, every ThursdAI
ThursdAI - Jan 8 - Vera Rubin's 5x Jump, Ralph Wiggum Goes Viral, GPT Health Launches & XAI Raises $20B Mid-Controversy

The top AI news from the past week, every ThursdAI

Play Episode Listen Later Jan 8, 2026 106:57


Hey folks, Alex here from Weights & Biases, with your weekly AI update (and a first live show of this year!) For the first time, we had a co-host of the show also be a guest on the show, Ryan Carson (from Amp) went supernova viral this week with an X article (1.5M views) about Ralph Wiggum (yeah, from Simpsons) and he broke down that agentic coding technique at the end of the show. LDJ and Nisten helped cover NVIDIA's incredible announcements during CES with their Vera Rubin upcoming platform (4-5X improvements) and we all got excited about AI medicine with ChatGPT going into Health officially! Plus, a bunch of Open Source news, let's get into this: ThursdAI - Recaps of the most high signal AI weekly spaces is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.Open Source: The “Small” Models Are WinningWe often talk about the massive frontier models, but this week, Open Source came largely from unexpected places and focused on efficiency, agents, and specific domains.Solar Open 100B: A Data MasterclassUpstage released Solar Open 100B, and it's a beast. It's a 102B parameter Mixture-of-Experts (MoE) model, but thanks to MoE magic, it only uses about 12B active parameters during inference. This means it punches incredibly high but runs fast.What I really appreciated here wasn't just the weights, but the transparency. They released a technical report detailing their “Data Factory” approach. They trained on nearly 20 trillion tokens, with a huge chunk being synthetic. They also used a dynamic curriculum that adjusted the difficulty and the ratio of synthetic data as training progressed. This transparency is what pushes the whole open source community forward.Technically, it hits 88.2 on MMLU and competes with top-tier models, especially in Korean language tasks. You can grab it on Hugging Face.MiroThinker 1.5: The DeepSeek Moment for Agents?We also saw MiroThinker 1.5, a 30B parameter model that is challenging the notion that you need massive scale to be smart. It uses something they call “Interactive Scaling.”Wolfram broke this down for us: this agent forms hypotheses, searches for evidence, and then iteratively revises its answers in a time-sensitive sandbox. It effectively “thinks” before answering. The result? It beats trillion-parameter models on search benchmarks like BrowseComp. It's significantly cheaper to run, too. This feels like the year where smaller models + clever harnesses (harnesses are the software wrapping the model) will outperform raw scale.Liquid AI LFM 2.5: Running on Toasters (Almost)We love Liquid AI and they are great friends of the show. They announced LFM 2.5 at CES with AMD, and these are tiny ~1B parameter models designed to run on-device. We're talking about running capable AI on your laptop, your phone, or edge devices (or the Reachy Mini bot that I showed off during the show! I gotta try and run LFM on him!)Probably the coolest part is the audio model. Usually, talking to an AI involves a pipeline: Speech-to-Text (ASR) -> LLM -> Text-to-Speech (TTS). Liquid's model is end-to-end. It hears audio and speaks audio directly. We watched a demo from Maxime Labonne where the model was doing real-time interaction, interleaving text and audio. It's incredibly fast and efficient. While it might not write a symphony for you, for on-device tasks like summarization or quick interactions, this is the future.NousCoder-14B and Zhipu AI IPOA quick shoutout to our friends at Nous Research who released NousCoder-14B, an open-source competitive programming model that achieved a 7% jump on LiveCodeBench accuracy in just four days of RL training on 48 NVIDIA B200 GPUs. The model was trained on 24,000 verifiable problems, and the lead researcher Joe Li noted it achieved in 4 days what took him 2 years as a teenager competing in programming contests. The full RL stack is open-sourced on GitHub and Nous published a great WandB results page as well! And in historic news, Zhipu AI (Z.ai)—the folks behind the GLM series—became the world's first major LLM company to IPO, raising $558 million on the Hong Kong Stock Exchange. Their GLM-4.7 currently ranks #1 among open-source and domestic models on both Artificial Analysis and LM Arena. Congrats to them!Big Companies & APIsNVIDIA CES: Vera Rubin Changes EverythingLDJ brought the heat on this one covering Jensen's CES keynote that unveiled the Vera Rubin platform, and the numbers are almost hard to believe. We're talking about a complete redesign of six chips: the Rubin GPU delivering 50 petaFLOPS of AI inference (5x Blackwell), the Vera CPU with 88 custom Olympus ARM cores, NVLink 6, ConnectX-9 SuperNIC, BlueField-4 DPU, and Spectrum-6 Ethernet.Let me put this in perspective using LDJ's breakdown: if you look at FP8 performance, the jump from Hopper to Blackwell was about 5x. The jump from Blackwell to Vera Rubin is over 3x again—but here's the kicker—while only adding about 200 watts of power draw. That's insane efficiency improvement.The real-world implications Jensen shared: training a 10 trillion parameter mixture-of-experts model now requires 75% fewer GPUs compared to Blackwell. Inference token costs drop roughly 10x—a 1MW cluster goes from 1 million to 10 million tokens per second at the same power. HBM4 memory delivers 22 TB/s bandwidth with 288GB capacity, exceeding NVIDIA's own 2024 projections by nearly 70%.As Ryan noted, when people say there's an AI bubble, this is why it's hilarious. Jensen keeps saying the need for inference is unbelievable and only going up exponentially. We all see this. I can't get enough inference—I want to spin up 10 Ralphs running concurrently! The NVL72 rack-scale system achieves 3.6 exaFLOPS inference with 20.7TB total HBM, and it's already shipping. Runway 4.5 is already running on the new platform, having ported their model from Hopper to Vera Rubin NVL72 in a single day.NVIDIA also recently acqui-hidred Groq (with a Q) in a ~$20 billion deal, bringing the inference chip expertise from the guy who created Google's TPUs in-house.Nemotron Speech ASR & The Speed of Voice (X, HF, Blog)NVIDIA also dropped Nemotron Speech ASR. This is a 600M parameter model that offers streaming transcription with 24ms latency.We showed a demo from our friend Kwindla Kramer at Daily. He was talking to an AI, and the response was virtually instant. The pipeline is: Nemotron (hearing) -> Llama/Nemotron Nano (thinking) -> Magpie TTS (speaking). The total latency is under 500ms. It feels like magic. Instant voice agents are going to be everywhere this year.XAI Raises $20B While Grok Causes Problems (Again)So here's the thing about covering anything Elon-related: it's impossible to separate signal from noise because there's an army of fans who hype everything and an army of critics who hate everything. But let me try to be objective here.XAI raised another massive Round E of $20 billion! at a $230 billion valuation, with NVIDIA and Cisco as strategic investors. The speed of their infrastructure buildout is genuinely incredible. Grok's voice mode is impressive. I use Grok for research and it's really good, notable for it's unprecedented access to X !But. This raise happened in the middle of a controversy where Grok's image model was being used to “put bikinis” on anyone in reply threads, including—and this is where I draw a hard line—minors. As Nisten pointed out on the show, it's not even hard to implement guardrails. You just put a 2B VL model in front and ask “is there a minor in this picture?” But people tested it, asked Grok not to use the feature, and it did it anyway. And yeah, putting Bikini on Claude is funny, but basic moderation is lacking! The response of “we'll prosecute illegal users” is stupid when there's no moderation built into the product. There's an enormous difference between Photoshop technically being able to do something after hours of work, and a feature that generates edited images in one second as the first comment to a celebrity, then gets amplified by the platform's algorithm to millions of people. One is a tool. The other is a product with amplification mechanics. Products need guardrails. I don't often link to CNN (in fact this is the first time) but they have a great writeup about the whole incident here which apparently includes the quitting of a few trust and safety folks and Elon's pushback on guardrails. CrazyThat said, Grok 5 is in training and XAI continues to ship impressive technology. I just wish they'd put the same engineering effort into safety as they do into capabilities!OpenAI Launches GPT HealthThis one's exciting. OpenAI CEO Fidji Simo announced ChatGPT Health, a privacy-first space for personalized health conversations that can connect to electronic health records, Apple Health, Function Health, Peloton, and MyFitnessPal.Here's why this matters: health already represents about 5% of all ChatGPT messages globally and touches 25% of weekly active users—often outside clinic hours or in underserved areas. People are already using these models for health advice constantly.Nisten, who has worked on AI doctors since the GPT-3 days and even published papers on on-device medical AI, gave us some perspective: the models have been fantastic for health stuff for two years now. The key insight is that medical data seems like a lot, but there are really only about 2,000 prescription drugs and 2,000 diseases (10,000 if you count rare ones). That's nothing for an LLM. The models excel at pattern recognition across this relatively contained dataset.The integration with Function Health is particularly interesting to me. Function does 160+ lab tests, but many doctors won't interpret them because they didn't order them. ChatGPT could help bridge that gap, telling you “hey, this biomarker looks off, you should discuss this with your doctor.” The bad news is, this is just a waitlist and you can add yourself to the waitlist here, we'll keep monitoring the situation and let you know when it opens upDoctronic: AI Prescribing Without Physician OversightSpeaking of healthcare, Doctronic launched a pilot in Utah where AI can autonomously renew prescriptions for chronic conditions without any physician in the loop. The system covers about 190 routine medications (excluding controlled substances) at just $4 per renewal. Trial data showed 99.2% concordance with physician treatment plans, and they've secured pioneering malpractice insurance that treats the AI like a clinician.Nisten made the case that it's ethically wrong to delay this kind of automation when ER wait times keep increasing and doctors are overworked. The open source models are already excellent at medical tasks. Governments should be buying GPUs rather than creating administrative roadblocks. Strong strong agree here! Google Brings Gmail into the Gemini Era (X)Breaking news from the day of our show: Google announced Gmail's biggest AI transformation since its 2004 launch, powered by Gemini 3. This brings AI Overviews that summarize email threads, natural language queries (”Who gave me a plumber quote last year?”), Help Me Write, contextual Suggested Replies matching your writing style, and the upcoming AI Inbox that filters noise to surface VIPs and urgent items.For 3 billion Gmail users, this is huge. I'm very excited to test it—though not live on the show because I don't want you reading my emails.This weeks buzz - covering Weights & Biases updatesNot covered on the show, but a great update on stuff from WandB, Chris Van Pelt (@vanpelt), one of the 3 co-founders released a great project I wanted to tell you about! For coders, this is an app that allows you to run multiple Claude Codes on free Github sandboxes, so you can code (or Ralph) and control everything away from home! GitHub gives personal users 120 free Codespaces hours/month, and Catnip automatically shuts down inactive instances so you can code for quite a while with Catnip! It's fully open source on Github and you can download the app hereInterview: Ryan Carson - What the hell is Ralph Wiggum?Okay, let's talk about the character everyone is seeing on their timeline: Ralph Wiggum. My co-host Ryan Carson went viral this week with an article about this technique, and I had to have him break it down.Ralph isn't a new model; it's a technique for running agents in a loop to perform autonomous coding. The core idea is deceptively simple: Ralph is a bash script that loops an AI coding agent. In a loop, until it a certain condition is met. But why is it blowing up? Normally when you use a coding agent like Cursor, Claude Code, or AMP, you need to be in the loop. You approve changes, look at code, fix things when the agent hits walls or runs out of context. Ralph solves this by letting the agent run autonomously while you sleep.Here's how it works: First, you write a Product Requirements Doc (PRD) by talking to your agent for a few minutes about what you want to build. Then you convert that PRD into a JSON file containing atomic user stories with clear acceptance criteria. Each user story is small enough for the agent to complete in one focused thread.The Ralph script then loops: it picks the first incomplete user story, the agent writes code to implement it, tests against the acceptance criteria, commits the changes, marks the story as complete, writes what it learned to a shared “agents.md” file, and loops to the next story. That compound learning step is crucial—without it, the agent would keep making the same mistakes.What makes this work is the pre-work. As Ryan put it, “no real work is done one-shot.” This is how software engineering has always worked—you break big problems into smaller problems into user stories and solve them incrementally. The innovation is letting AI agents work through that queue autonomously while you sleep! Ryan's excellent (and viral) X article is here! Vision & VideoLTX-2 Goes Fully Open Source (HF, Paper)Lightricks finally open-sourced LTX-2, marking a major milestone as the first fully open audio-video generation model. This isn't just “we released the weights” open—it's complete model weights (13B and 2B variants), distilled versions, controllable LoRAs, a full multimodal trainer, benchmarks, and evaluation scripts. For a video model that is aiming to be the open source SORA, supports audio and lipsyncThe model generates synchronized audio and video in a single DiT-based architecture—motion, dialogue, ambience, and music flow simultaneously. Native 4K at up to 50 FPS with audio up to 10 seconds. And there's also a distilled version (Thanks Pruna AI!) hosted on ReplicateComfyUI provided day-0 native support, and community testing shows an A6000 generating 1280x720 at 120 frames in 50 seconds. This is near Sora-level quality that you can fine-tune on your own data for custom styles and voices in about an hour.What a way to start 2026. From chips that are 5x faster to AI doctors prescribing meds in Utah, the pace is only accelerating. If anyone tells you we're in an AI bubble, just show them what we covered today. Even if the models stopped improving tomorrow, the techniques like “Ralph” prove we have years of work ahead of us just figuring out how to use the intelligence we already have.Thank you for being a ThursdAI subscriber. See you next week!As always, here's the show notes and TL;DR links: * Hosts & Guests* Alex Volkov - AI Evangelist & Weights & Biases (@altryne)* Co-Hosts - @WolframRvnwlf, @nisten, @ldjconfirmed* Special Guest - Ryan Carson (@ryancarson) breaking down the Ralph Wiggum technique.* Open Source LLMs* Solar Open 100B - Upstage's 102B MoE model. Trained on 19.7T tokens with a heavy focus on “data factory” synthetic data and high-performance Korean reasoning (X, HF, Tech Report).* MiroThinker 1.5 - A 30B parameter search agent that uses “Interactive Scaling” to beat trillion-parameter models on search benchmarks like BrowseComp (X, HF, GitHub).* Liquid AI LFM 2.5 - A family of 1B models designed for edge devices. Features a revolutionary end-to-end audio model that skips the ASR-LLM-TTS pipeline (X, HF).* NousCoder-14B - competitive coding model from Nous Research that saw a 7% LiveCodeBench accuracy jump in just 4 days of RL (X, WandB Dashboard).* Zhipu AI IPO - The makers of GLM became the first major LLM firm to go public on the HKEX, raising $558M (Announcement).* Big Co LLMs & APIs* NVIDIA Vera Rubin - Jensen Huang's CES reveal of the next-gen platform. Delivers 5x Blackwell inference performance and 75% fewer GPUs needed for MoE training (Blog).* OpenAI ChatGPT Health - A privacy-first vertical for EHR and fitness data integration (Waitlist).* Google Gmail Era - Gemini 3 integration into Gmail for 3 billion users, featuring AI Overviews and natural language inbox search (Blog).* XAI $20B Raise - Elon's XAI raises Series E at a $230B valuation, even as Grok faces heat over bikini-gate and safety guardrails (CNN Report).* Doctronic - The first US pilot in Utah for autonomous AI prescription renewals without a physician in the loop (Web).* Alexa+ Web - Amazon brings the “Smart Alexa” experience to browser-based chat (Announcement).* Autonomous Coding & Tools* Ralph Wiggum - The agentic loop technique for autonomous coding using small, atomic user stories. Ryan Carson's breakdown of why this is the death of “vibe coding” (Viral X Article).* Catnip by W&B - Chris Van Pelt's open-source iOS app to run Claude Code anywhere via GitHub Codespaces (App Store, GitHub).* Vision & Video* LTX-2 - Lightricks open-sources the first truly open audio-video generation model with synchronized output and full training code (GitHub, Replicate Demo).* Avatar Forcing - KAIST's framework for real-time interactive talking heads with ~500ms latency (Arxiv).* Qwen Edit 2512 - Optimized by PrunaAI to generate high-res realistic images in under 7 seconds (Replicate).* Voice & Audio* Nemotron Speech ASR - NVIDIA's 600M parameter streaming model with sub-100ms stable latency for massive-scale voice agents (HF). This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit sub.thursdai.news/subscribe

Let's Talk AI
#230 - 2025 Retrospective, Nvidia buys Groq, GLM 4.7, METR

Let's Talk AI

Play Episode Listen Later Jan 7, 2026 98:08


Our 230th episode with a summary and discussion of last week's big AI news!Recorded on 01/02/2026Hosted by Andrey Kurenkov and Jeremie HarrisFeel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:Nvidia's acquisition of AI chip startup Groq for $20 billion highlights a strategic move for enhanced inference technology in GPUs.New York's RAISE Act legislation aims to regulate AI safety, marking the second major AI safety bill in the US.The launch of GLM 4.7 by Zhipu AI marks a significant advancement in open-source AI models for coding.Evaluation of long-horizon AI agents raises concerns about the rising costs and efficiency of AI in performing extended tasks.Timestamps:(00:00:10) Intro / Banter(00:01:58) 2025 RetrospectiveTools & Apps(00:24:39) OpenAI bets big on audio as Silicon Valley declares war on screens | TechCrunchApplications & Business(00:26:39) Nvidia buying AI chip startup Groq for about $20 billion, biggest deal(00:34:28) Exclusive | Meta Buys AI Startup Manus, Adding Millions of Paying Users - WSJ(00:38:05) Cursor continues acquisition spree with Graphite deal | TechCrunch(00:39:15) Micron Hikes CapEx to $20B with 2026 HBM Supply Fully Booked; HBM4 Ramps 2Q26(00:42:06) Chinese fabs are reportedly upgrading older ASML DUV lithography chipmaking machines — secondary channels and independent engineers used to soup up Twinscan NXT seriesProjects & Open Source(00:47:52) Z.AI launches GLM-4.7, new SOTA open-source model for coding(00:50:11) Evaluating AI's ability to perform scientific research tasksResearch & Advancements(00:54:32) Large Causal Models from Large Language Models(00:57:33) Universally Converging Representations of Matter Across Scientific Foundation Models(01:02:11) META-RL INDUCES EXPLORATION IN LANGUAGE AGENTS(01:07:16) Are the Costs of AI Agents Also Rising Exponentially?(01:11:17) METR eval for Opus 4.5(01:16:19) How to game the METR plotPolicy & Safety(01:17:24) New York governor Kathy Hochul signs RAISE Act to regulate AI safety | TechCrunch(01:20:40) Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers(01:26:46) Monitoring Monitorability(01:32:07) Sam Altman is hiring someone to worry about the dangers of AI | The Verge(01:33:38) X users asking Grok to put this girl in bikini, Grok is happy obliging - India TodaySee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The Cloud Pod
336: We Were Right (Mostly), 2026: The New Prophecies

The Cloud Pod

Play Episode Listen Later Jan 6, 2026 68:15


Welcome to episode 335 of The Cloud Pod, where the forecast is always cloudy! Welcome to the first show of 2026, and it's a full house, too! Justin, Jonathan, Ryan,  and Matt are all here to reflect on 2025, plus bring you their predictions for 2026. Let's get started!  Titles we almost went with this week SQL Me Maybe: AlloyDB Gets Chatty With Your Database **OpenAI SELECT * FROM natural_language WHERE accuracy LIKE ‘100%’ **Anthropic etcd You Were Worried About Database Limits: CloudWatch Has Your Back CSV You Later: Looker Adds Drag-and-Drop Data Uploads AWS Spots an Opportunity to Manage Your Container Costs EKS Network Policies: No More IP Address Whack-a-Mole AWS Security Hub Splits: It’s Not You, It’s CSPM Spot On: ECS Finally Manages Your Cheapest Compute TOON Squad: DigitalOcean’s New Format Makes JSON Look Bloated The Price is Wrong: AWS Breaks Two Decades of Downward Pricing Tradition Show Your Work: Why AI-Generated Code Without Tests is Just Expensive Spam No More Agent Orange: Google Simplifies VM Extension Deployment AWS Discovers Prices Can Go Both Ways, Raises GPU Costs 15 Percent Sovereignty Washing: When Your European Cloud Still Answers to Uncle Sam Agent Builder Gets a Memory Upgrade: Google’s AI Finally Remembers Where It Put Its Keys Ctrl+F for the Future: A year-end Scorecard & Next-Gen Bets AI Agents, GPU Prices, and The best of the Cloud Pod 2025 Beyond the Hype: The Cloud Pods Definitive 2025 Year in Review Apocalypse Now… What? Our 2026 Forecast Follow Up  01:27 RYAN’S PREDICTIONS Prediction Status Notes Quick LLM models for individuals ACCURATE Meta-Llama-3.1-8B-Instruct, GLM-4-9B-0414, and Qwen2.5-VL-7B-Instruct—each chosen for an outstanding balance of performance and computational efficiency, making them ideal for edge AI deployment. A new AI inference application called Inferencer allows even modest Apple Mac computers to run the largest open-source LLMs. AI at the edge natively (Lambda-esque) ACCURATE Akamai launched a new Inference Cloud product for edge AI using Nvidia’s Blackwell 6000 GPUs in 17 cities. AWS IoT Greengrass with Lambda functions for edge logic. “Edge AI allows for instant decision-making where it matters most—close to the data source.” Cloud native security mesh multi-cloud UNCLEAR Service mesh technologies continue to evolve (Istio, Linkerd), but I didn’t find a breakthrough “app-to-app at the edge” security mesh product announcement in 2025. This one needs more specific evidence. Ryan Score: 2/3 02:25 MATTHEW’S PREDICTIONS Prediction Status Notes FOCUS adopted by Snowflake or Databricks ACCURATE FOCUS version 1.2 was ratified on May 29, 2025. Three new providers announced support: Alibaba Cloud, Databricks, and Grafana. Databricks officially adopted FOCUS! AI security/ethical standard (SOC or ISO) ACCURATE ISO 42001 is the first international standard outlining requirements for AI governance. Major companies achieving certification in 2025: Automation Anywhere is among the first 100 companies worldwide to earn ISO/IEC 42001:2023 certification. Anthropic also achieved ISO 42001 certification. Amazon deprecates 5+ services (WorkMail bonus) ACCURATE (no bonus) 19 services are mothballed, four are being sunset, and one is end of its supported life. Deprecated services include CodeCommit, Cloud9, S3 Select, CloudSearch, SimpleDB, Forecast, Data Pipeline, QLDB, Snowball Edge, and more. WorkMail NOT deprecated – WorkDocs was (April 2025), but WorkMail remains active. Matthew Score: 3/3 03:22 JONATHAN’S PREDICTIONS Prediction Status Notes Company claims AGI achieved ACC

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Welcome to AI Unraveled (December 30th, 2025): Your strategic briefing on the business, technology, and policy reshaping artificial intelligence.Hardware & Industry ConsolidationNvidia's $20B Dominance Play: In a massive move to secure its inference future, Nvidia has agreed to acquire key assets and employees from AI chip startup Groq for $20 billion. The deal is structured as an asset purchase and non-exclusive licensing agreement—likely to navigate antitrust scrutiny—allowing Nvidia to integrate Groq's ultra-fast LPU (Language Processing Unit) technology into its "AI Factory" roadmap.Cursor Acquires Graphite:Model Breakthroughs & BenchmarksChina's Z.ai Takes the Crown: Z.ai's new GLM-4.7 model has topped open-source benchmarks, reportedly outperforming GPT-5.1 High in coding tasks and introducing "Preserved Thinking" to prevent context decay in long agentic workflows.Claude Opus 4.5's Stamina: A new analysis by evaluation firm METR reveals that Anthropic's Claude Opus 4.5 can successfully execute tasks that require nearly 5 hours of human work,Poetiq Crushes Reasoning Benchmarks:Policy, Risk & GeopoliticsChina's "Ideological Test": New regulations in China require AI chatbots to pass a rigorous 2,000-question ideological exam,Pentagon Partners with xAI: The Department of Defense will embed Grok-based AI systems directly into its GenAI.mil platform by early 2026,Italy vs. Meta:Society & The WorkforceThe "Slop" Epidemic: A new study finds that over 20% of videos recommended to new YouTube users are now "AI slop"—low-quality, generative content designed solely to farm views.OpenAI's "Head of Preparedness": Sam Altman is hiring a lead to secure "systems that can self-improve,"Sal Khan's 1% Solution: Khan Academy founder Sal Khan is proposing that companies donate 1% of profits to retrain workers displaced by the looming AI job apocalypse.Keywords: Nvidia, Groq, GLM-4.7, Z.ai, Claude Opus 4.5, AI Slop, GenAI.mil, Pentagon, xAI, Grok, ARC-AGI-2, Graphite, Sal Khan, AI Regulation, Antitrust.Host Connection & Engagement:Etienne on Linkedin: https://www.linkedin.com/in/enoumen

AIA Podcast

Сегодня разбираем максимально хайповую неделю в ИИ: Amazon заходит к OpenAI с миллиардными деньгами, выходят GPT-5.2, Pro и Codex, ChatGPT внезапно получает Photoshop и редактирование PDF, а Disney добровольно отдаёт своих персонажей нейросетям. Google делает Gemini 3 Flash дефолтом для миллионов, Cursor начинает покупать компании, Grok рвёт всех в speech-to-speech, появляются «наркотики для AI», роботакси Tesla за $4.20, Waymo замирает на перекрёстках, а Пентагон официально начинает готовиться к AGI. Финал — слово года «slop» и ИИ-архитекторы как «Человек года». Лампово, тревожно и очень показательно.

Hacker News Recap
December 22nd, 2025 | US blocks all offshore wind construction, says reason is classified

Hacker News Recap

Play Episode Listen Later Dec 23, 2025 14:31


This is a recap of the top 10 posts on Hacker News on December 22, 2025. This podcast was generated by wondercraft.ai (00:30): US blocks all offshore wind construction, says reason is classifiedOriginal post: https://news.ycombinator.com/item?id=46357881&utm_source=wondercraft_ai(01:52): Flock Exposed Its AI-Powered Cameras to the Internet. We Tracked OurselvesOriginal post: https://news.ycombinator.com/item?id=46355548&utm_source=wondercraft_ai(03:15): Cecot – 60 MinutesOriginal post: https://news.ycombinator.com/item?id=46361024&utm_source=wondercraft_ai(04:38): If you don't design your career, someone else will (2014)Original post: https://news.ycombinator.com/item?id=46352930&utm_source=wondercraft_ai(06:00): Claude Code gets native LSP supportOriginal post: https://news.ycombinator.com/item?id=46355165&utm_source=wondercraft_ai(07:23): Jimmy Lai Is a Martyr for FreedomOriginal post: https://news.ycombinator.com/item?id=46355888&utm_source=wondercraft_ai(08:46): The Illustrated TransformerOriginal post: https://news.ycombinator.com/item?id=46357675&utm_source=wondercraft_ai(10:08): GLM-4.7: Advancing the Coding CapabilityOriginal post: https://news.ycombinator.com/item?id=46357287&utm_source=wondercraft_ai(11:31): Lotusbail npm package found to be harvesting WhatsApp messages and contactsOriginal post: https://news.ycombinator.com/item?id=46359996&utm_source=wondercraft_ai(12:54): The biggest CRT ever made: Sony's PVM-4300Original post: https://news.ycombinator.com/item?id=46353777&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

Crazy Wisdom
Episode #516: China's AI Moment, Functional Code, and a Post-Centralized World

Crazy Wisdom

Play Episode Listen Later Dec 22, 2025 64:59


In this episode, Stewart Alsop sits down with Joe Wilkinson of Artisan Growth Strategies to talk through how vibe coding is changing who gets to build software, why functional programming and immutability may be better suited for AI-written code, and how tools like LLMs are reshaping learning, work, and curiosity itself. The conversation ranges from Joe's experience living in China and his perspective on Chinese AI labs like DeepSeek, Kimi, Minimax, and GLM, to mesh networks, Raspberry Pi–powered infrastructure, decentralization, and what sovereignty might mean in a world where intelligence is increasingly distributed. They also explore hallucinations, AlphaGo's Move 37, and why creative “wrongness” may be essential for real breakthroughs, along with the tension between centralized power and open access to advanced technology. You can find more about Joe's work at https://artisangrowthstrategies.com and follow him on X at https://x.com/artisangrowth.Check out this GPT we trained on the conversationTimestamps00:00 – Vibe coding as a new learning unlock, China experience, information overload, and AI-powered ingestion systems05:00 – Learning to code late, Exercism, syntax friction, AI as a real-time coding partner10:00 – Functional programming, Elixir, immutability, and why AI struggles with mutable state15:00 – Coding metaphors, “spooky action at a distance,” and making software AI-readable20:00 – Raspberry Pi, personal servers, mesh networks, and peer-to-peer infrastructure25:00 – Curiosity as activation energy, tech literacy gaps, and AI-enabled problem solving30:00 – Knowledge work superpowers, decentralization, and small groups reshaping systems35:00 – Open source vs open weights, Chinese AI labs, data ingestion, and competitive dynamics40:00 – Power, safety, and why broad access to AI beats centralized control45:00 – Hallucinations, AlphaGo's Move 37, creativity, and logical consistency in AI50:00 – Provenance, epistemology, ontologies, and risks of closed-loop science55:00 – Centralization vs decentralization, sovereign countries, and post-global-order shifts01:00:00 – U.S.–China dynamics, war skepticism, pragmatism, and cautious optimism about the futureKey InsightsVibe coding fundamentally lowers the barrier to entry for technical creation by shifting the focus from syntax mastery to intent, structure, and iteration. Instead of learning code the traditional way and hitting constant friction, AI lets people learn by doing, correcting mistakes in real time, and gradually building mental models of how systems work, which changes who gets to participate in software creation.Functional programming and immutability may be better aligned with AI-written code than object-oriented paradigms because they reduce hidden state and unintended side effects. By making data flows explicit and preventing “spooky action at a distance,” immutable systems are easier for both humans and AI to reason about, debug, and extend, especially as code becomes increasingly machine-authored.AI is compressing the entire learning stack, from software to physical reality, enabling people to move fluidly between abstract knowledge and hands-on problem solving. Whether fixing hardware, setting up servers, or understanding networks, the combination of curiosity and AI assistance turns complex systems into navigable terrain rather than expert-only domains.Decentralized infrastructure like mesh networks and personal servers becomes viable when cognitive overhead drops. What once required extreme dedication or specialist knowledge can now be done by small groups, meaning that relatively few motivated individuals can meaningfully change communication, resilience, and local autonomy without waiting for institutions to act.Chinese AI labs are likely underestimated because they operate with different constraints, incentives, and cultural inputs. Their openness to alternative training methods, massive data ingestion, and open-weight strategies creates competitive pressure that limits monopolistic control by Western labs and gives users real leverage through choice.Hallucinations and “mistakes” are not purely failures but potential sources of creative breakthroughs, similar to AlphaGo's Move 37. If AI systems are overly constrained to consensus truth or authority-approved outputs, they risk losing the capacity for novel insight, suggesting that future progress depends on balancing correctness with exploratory freedom.The next phase of decentralization may begin with sovereign countries before sovereign individuals, as AI enables smaller nations to reason from first principles in areas like medicine, regulation, and science. Rather than a collapse into chaos, this points toward a more pluralistic world where power, knowledge, and decision-making are distributed across many competing systems instead of centralized authorities.

China's AI Upstarts: How Z.ai Builds, Benchmarks & Ships in Hours, from ChinaTalk

Play Episode Listen Later Dec 3, 2025 83:08


This special ChinaTalk cross-post features Zixuan Li of Z.ai (Zhipu AI), exploring the culture, incentives, and constraints shaping Chinese AI development. PSA for AI builders: Interested in alignment, governance, or AI safety? Learn more about the MATS Summer 2026 Fellowship and submit your name to be notified when applications open: https://matsprogram.org/s26-tcr. The discussion covers Z.ai's powerful GLM 4.6 model, their open weights strategy as a marketing tactic, and unique Chinese AI use cases like "role-play." Gain insights into the rapid pace of innovation, the talent market, and how Chinese companies view their position relative to global AI leaders. Sponsors: Google AI Studio: Google AI Studio features a revamped coding experience to turn your ideas into reality faster than ever. Describe your app and Gemini will automatically wire up the right models and APIs for you at https://ai.studio/build Agents of Scale: Agents of Scale is a podcast from Zapier CEO Wade Foster, featuring conversations with C-suite leaders who are leading AI transformation. Subscribe to the show wherever you get your podcasts Framer: Framer is the all-in-one platform that unifies design, content management, and publishing on a single canvas, now enhanced with powerful AI features. Start creating for free and get a free month of Framer Pro with code COGNITIVE at https://framer.com/design Tasklet: Tasklet is an AI agent that automates your work 24/7; just describe what you want in plain English and it gets the job done. Try it for free and use code COGREV for 50% off your first month at https://tasklet.ai Shopify: Shopify powers millions of businesses worldwide, handling 10% of U.S. e-commerce. With hundreds of templates, AI tools for product descriptions, and seamless marketing campaign creation, it's like having a design studio and marketing team in one. Start your $1/month trial today at https://shopify.com/cognitive PRODUCED BY: https://aipodcast.ing CHAPTERS: (00:00) Sponsor: Google AI Studio (00:31) About the Episode (03:44) Introducing Z.AI (07:07) Drupu AI's Backstory (09:38) Achieving Global Recognition (Part 1) (12:53) Sponsors: Agents of Scale | Framer (15:15) Achieving Global Recognition (Part 2) (15:15) Z.AI's Internal Culture (19:17) China's AI Talent Market (24:39) Open vs. Closed Source (Part 1) (24:46) Sponsors: Tasklet | Shopify (27:54) Open vs. Closed Source (Part 2) (35:16) Enterprise Sales in China (40:38) AI for Role-Playing (45:56) Optimism vs. Fear of AI (51:36) Translating Internet Culture (57:11) Navigating Compute Constraints (01:03:59) Future Model Directions (01:15:02) Release Velocity & Work Culture (01:25:04) Outro

ChinaTalk
The Z.ai Playbook

ChinaTalk

Play Episode Listen Later Nov 21, 2025 76:09


Zixuan Li is Director of Product and genAI Strategy at Z.ai (also known as Zhipu 智谱 AI). The release of their benchmark-topping flagship model, GLM 4.5, was akin to “another DeepSeek moment,” in the words of Nathan Lambert. Our conversation today covers… What sets Z.ai apart from other Chinese models, including coding, role-playing capabilities, and translations of cryptic Chinese internet content, Why Chinese AI companies chase recognition from Silicon Valley thought leaders, The role of open source in the Chinese AI ecosystem, Fears of job loss and the prevalence of AI pessimism in China, How Z.ai trains its models, and what capabilities the company is targeting next. Co-hosting today are Irene Zhang, long-time ChinaTalk analyst, as well as Nathan Lambert of the Interconnects Substack. Follow Z.ai on X: https://x.com/Zai_org Learn more about your ad choices. Visit megaphone.fm/adchoices

ChinaEconTalk
The Z.ai Playbook

ChinaEconTalk

Play Episode Listen Later Nov 21, 2025 76:09


Zixuan Li is Director of Product and genAI Strategy at Z.ai (also known as Zhipu 智谱 AI). The release of their benchmark-topping flagship model, GLM 4.5, was akin to “another DeepSeek moment,” in the words of Nathan Lambert. Our conversation today covers… What sets Z.ai apart from other Chinese models, including coding, role-playing capabilities, and translations of cryptic Chinese internet content, Why Chinese AI companies chase recognition from Silicon Valley thought leaders, The role of open source in the Chinese AI ecosystem, Fears of job loss and the prevalence of AI pessimism in China, How Z.ai trains its models, and what capabilities the company is targeting next. Co-hosting today are Irene Zhang, long-time ChinaTalk analyst, as well as Nathan Lambert of the Interconnects Substack. Follow Z.ai on X: https://x.com/Zai_org Learn more about your ad choices. Visit megaphone.fm/adchoices

Let's Talk AI
#220 - Gemini 2.5 Flash Image, Claude for Chrome, DeepConf

Let's Talk AI

Play Episode Listen Later Sep 1, 2025 52:43 Transcription Available


Our 220th episode with a summary and discussion of last week's big AI news! Recorded on 08/30/2025 Check out Andrey's work over at Astrocade , sign up to be an ambassador here Hosted by Andrey Kurenkov and co-hosted by Daniel Bashir Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ In this episode: Google's newly released Gemini 2.5 image editing model showcases remarkable advancements, enabling highly accurate modifications of subjects while retaining their original features. Anthropic expands Claude with an AI browser agent for Chrome and adds features to remember past conversations, enhancing the user experience and personalization. NVIDIA and AMD to share revenue from AI chip sales to China with US government, marking a notable shift in export control policies and trade practices. AI companion apps are experiencing substantial growth, with projected revenues expected to reach $120 million by 2025, raising questions about social implications and user engagement. Timestamps + Links: Tools & Apps (00:02:12) Google Gemini's AI image model gets a 'bananas' upgrade | TechCrunch (00:05:32) Anthropic launches a Claude AI agent that lives in Chrome | TechCrunch (00:08:30) Anthropic's Claude chatbot can now remember your past conversations | The Verge (00:11:46) Google Launches AI ‘Guided Learning' Tool to Teach Users (00:14:55) Apple Intelligence's ChatGPT integration will use GPT-5 starting with iOS 26 | The Verge (00:15:39) OpenAI Adds New Features to Codex, Like IDE Extension and GitHub Code Reviews Applications & Business (00:16:49) Lovable projects $1B in ARR within next 12 months | TechCrunch (00:18:56) Decart hits $3.1 billion valuation on $100 million raise to power real-time interacti | Ctech (00:20:19) Cohere raises $500M to beat back generative AI rivals | TechCrunch (00:21:25) Pony AI, Nearing Full-Year Robotaxi Goal, Eyes European Markets - Bloomberg (00:22:41) Co-founder of Elon Musk's xAI departs the company | TechCrunch Projects & Open Source (00:24:39) Meta AI Just Released DINOv3: A State-of-the-Art Computer Vision Model Trained with Self-Supervised Learning, Generating High-Resolution Image Features - MarkTechPost (00:27:02) GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models (00:29:49) China's DeepSeek Releases V3.1, Boosting AI Model's Capabilities - Bloomberg (00:30:36) Open weight LLMs exhibit inconsistent performance across providers (00:32:02) Microsoft Released VibeVoice-1.5B: An Open-Source Text-to-Speech Model that can Synthesize up to 90 Minutes of Speech with Four Distinct Speakers - MarkTechPost Research & Advancements (00:33:43) Deep Think with Confidence (00:36:30) Generative AI reshapes U.S. job market, Stanford study shows Policy & Safety (00:41:42) Inside the US Government's Unpublished Report on AI Safety | WIRED (00:44:10) U.S. Government to Take Cut of Nvidia and AMD A.I. Chip Sales to China - The New York Times (00:45:13) Anthropic Settles High-Profile AI Copyright Lawsuit Brought by Book Authors (00:46:56) AI companion apps on track to pull in $120M in 2025 | TechCrunch

Gull Lake Ministries
GLM #647 - Daniel Wallace : 3 Reasons Teens Aren't Leaving the Church

Gull Lake Ministries

Play Episode Listen Later Aug 12, 2025 33:06


Daniel Wallace is the Executive Director of Gull Lake Ministries, a Christian family ministry and retreat center in Hickory Corners, Michigan. Prior to serving 20 years at GLM, he was the Senior Director of Camps at a camp and conference center in Texas, overseeing six separate facilities which ministered to families, senior high, junior high and grade school students. Daniel, better known as Ambush, has 40 summers of Christian camping experience in Michigan, Texas, Missouri, and Kansas.

Gull Lake Ministries
GLM #646 - Daniel Wallace : Luke 15 Recap & A Parenting Recalibration

Gull Lake Ministries

Play Episode Listen Later Aug 11, 2025 48:23


Daniel Wallace is the Executive Director of Gull Lake Ministries, a Christian family ministry and retreat center in Hickory Corners, Michigan. Prior to serving 20 years at GLM, he was the Senior Director of Camps at a camp and conference center in Texas, overseeing six separate facilities which ministered to families, senior high, junior high and grade school students. Daniel, better known as Ambush, has 40 summers of Christian camping experience in Michigan, Texas, Missouri, and Kansas.

Gull Lake Ministries
GLM #645 - Daniel Wallace : Walk Through Luke 15 & the Surprise Ending

Gull Lake Ministries

Play Episode Listen Later Aug 10, 2025 43:21


Daniel Wallace is the Executive Director of Gull Lake Ministries, a Christian family ministry and retreat center in Hickory Corners, Michigan. Prior to serving 20 years at GLM, he was the Senior Director of Camps at a camp and conference center in Texas, overseeing six separate facilities which ministered to families, senior high, junior high and grade school students. Daniel, better known as Ambush, has 40 summers of Christian camping experience in Michigan, Texas, Missouri, and Kansas.

Marketing Over Coffee Marketing Podcast
Is SEO Worth Doing? Rode CallMe, and Google Opal for AI Orchestration

Marketing Over Coffee Marketing Podcast

Play Episode Listen Later Aug 1, 2025


In this Marketing Over Coffee: Learn about evaluating the ROI of SEO, Field Recording, Using Opal as an agent, and more! Direct Link to File Moonshot Kimi K2, Alibaba Qwen 3 Coder, GLM-4.5 Is SEO worthwhile? Why it’s more diffcult than ever to measure NetSuite is the number one cloud financial system, bringing accounting, financial […] The post Is SEO Worth Doing? Rode CallMe, and Google Opal for AI Orchestration appeared first on Marketing Over Coffee Marketing Podcast.

This Week in Google (MP3)
IM 830: I Pay A Gentleman on Etsy - Personal Superintelligence?

This Week in Google (MP3)

Play Episode Listen Later Jul 31, 2025 172:55 Transcription Available


Interview with Ian Krietzberg Leo's shows off his new AI toys Paris unveils her new desk setup Personal Superintelligence You might want to delve into this paper. I want to underscore, that's a joke you'll comprehend only with meticulous reading of it. Source: Yann LeCun will continue to work at Meta as chief scientist of the AI research group FAIR and will report to Alexandr Wang Last Week on My Mac:

All TWiT.tv Shows (MP3)
Intelligent Machines 830: I Pay A Gentleman on Etsy

All TWiT.tv Shows (MP3)

Play Episode Listen Later Jul 31, 2025 172:55 Transcription Available


Interview with Ian Krietzberg Leo's shows off his new AI toys Paris unveils her new desk setup Personal Superintelligence You might want to delve into this paper. I want to underscore, that's a joke you'll comprehend only with meticulous reading of it. Source: Yann LeCun will continue to work at Meta as chief scientist of the AI research group FAIR and will report to Alexandr Wang Last Week on My Mac:

Radio Leo (Audio)
Intelligent Machines 830: I Pay A Gentleman on Etsy

Radio Leo (Audio)

Play Episode Listen Later Jul 31, 2025 172:55 Transcription Available


Interview with Ian Krietzberg Leo's shows off his new AI toys Paris unveils her new desk setup Personal Superintelligence You might want to delve into this paper. I want to underscore, that's a joke you'll comprehend only with meticulous reading of it. Source: Yann LeCun will continue to work at Meta as chief scientist of the AI research group FAIR and will report to Alexandr Wang Last Week on My Mac:

This Week in Google (Video HI)
IM 830: I Pay A Gentleman on Etsy - Personal Superintelligence?

This Week in Google (Video HI)

Play Episode Listen Later Jul 31, 2025 172:55 Transcription Available


Interview with Ian Krietzberg Leo's shows off his new AI toys Paris unveils her new desk setup Personal Superintelligence You might want to delve into this paper. I want to underscore, that's a joke you'll comprehend only with meticulous reading of it. Source: Yann LeCun will continue to work at Meta as chief scientist of the AI research group FAIR and will report to Alexandr Wang Last Week on My Mac:

All TWiT.tv Shows (Video LO)
Intelligent Machines 830: I Pay A Gentleman on Etsy

All TWiT.tv Shows (Video LO)

Play Episode Listen Later Jul 31, 2025 172:55 Transcription Available


Interview with Ian Krietzberg Leo's shows off his new AI toys Paris unveils her new desk setup Personal Superintelligence You might want to delve into this paper. I want to underscore, that's a joke you'll comprehend only with meticulous reading of it. Source: Yann LeCun will continue to work at Meta as chief scientist of the AI research group FAIR and will report to Alexandr Wang Last Week on My Mac:

Let's Talk AI
#215 - Runway games, Meta Superintelligence, ERNIE 4.5, Adaptive Tree Search

Let's Talk AI

Play Episode Listen Later Jul 8, 2025 116:21 Transcription Available


Our 215th episode with a summary and discussion of last week's big AI news! Recorded on 07/04/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. In this episode: Cloudflare's new AI data scraper blocking feature, its potential implications, and technical challenges Meta's aggressive recruitment for its Super Intelligence Labs division is covered, highlighting key hires from OpenAI and other leaders in the field Anthropic loses significant talent to Cursor, with details on their new economic futures program focusing on AI's impact on the labor market Notable open-source AI model releases from Baidu and Tencent are also discussed, including their performance metrics and potential applications. Timestamps + Links: (00:00:11) Intro / Banter (00:01:43) News Preview Tools & Apps (00:02:55) Cloudflare Introduces Default Blocking of A.I. Data Scrapers (00:05:44) Runway is going to let people generate video games with AI (00:11:24) Google embraces AI in the classroom with new Gemini tools for educators, chatbots for students, and more (00:16:23) No one likes meetings. They're sending their AI note takers instead. (00:18:08) Google launches Doppl, a new app that lets you visualize how an outfit might look on you (00:19:14) Google's Imagen 4 text-to-image model promises 'significantly improved' boring images Applications & Business (00:22:18) Mark Zuckerberg announces his AI ‘superintelligence' super-group (00:29:35) Anthropic Revenue Hits $4 Billion Annual Pace as Competition With Cursor Intensifies (00:35:10) As job losses loom, Anthropic launches program to track AI's economic fallout (00:38:04) OpenAI says it has no plan to use Google's in-house chip (00:41:08) Nvidia stakes new startup that flips script on data center power (00:44:11) TSMC Arizona Chips Are Reportedly Being Flown Back to Taiwan For Packaging; U.S. Semiconductor Supply Chain Still Remains Dependent on Taiwan Projects & Open Source (00:46:57) Baidu releases open source model family ERNIE 4.5 (00:51:55) Tencent Open Sources Hunyuan-A13B: A 13B Active Parameter MoE Model with Dual-Mode Reasoning and 256K Context (00:57:09) Together AI Releases DeepSWE: A Fully Open-Source RL-Trained Coding Agent Based on Qwen3-32B and Achieves 59% on SWEBench (01:00:11) GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning (01:04:10) DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Research & Advancements (01:06:21) Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search (01:13:07) The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements (01:18:04) Claude 4 Opus and Sonnet reach 50%-time-horizon point estimates of about 80 and 65 minutes, respectively (01:21:37) Performance Prediction for Large Systems via Text-to-Text Regression (01:25:38) Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning (01:26:33) Correlated Errors in Large Language Models Policy & Safety (01:29:04) Forecasting Biosecurity Risks from LLMs (01:36:06) AI Task Length Horizons in Offensive Cybersecurity (01:42:30) Inside Tech's Risky Gamble to Kill State AI Regulations for a Decade (01:52:56) Denmark to tackle deepfakes by giving people copyright to their own features

Cultish
Part 2: Hermetic Mormonism with Skyler Hamilton

Cultish

Play Episode Listen Later Jun 4, 2025 72:44


In this episode of Cultish, Andrew Soncrant and Bradley Campbell (of ‪@GLM‬) continue their conversation with Skyler Hamilton from Distinctive Christianity to explore the hidden world of Hermetic Mormonism. What happens when esoteric traditions, alchemy, and mysticism creep into a religion that already claims divine revelation? Skyler unpacks his story of coming to faith in Christ by unpacking the surprising roots and modern expressions of this strange blend of Hermeticism and Latter-day Saint theology. How deep does the rabbit hole go? And why should Christians be concerned about this growing trend in fringe Mormon circles? Tune in for a fascinating and eye-opening conversation you won't want to miss. Cultish is a 100% crowdfunded ministry. Partner with us & be part of the mission to mission to change lives: https://donorbox.org/cultishSkyler's Podcast Distinctive Christianity: https://redcircle.com/shows/distincti...Bradley Campbell ‪@GLM‬

Cultish
Part 1: Hermetic Mormonism with Skyler Hamilton

Cultish

Play Episode Listen Later May 28, 2025 56:00


In this episode of Cultish, Andrew Soncrant and Bradley Campbell (of ‪@GLM‬ )sit down with Skyler Hamilton from Distinctive Christianity to explore the hidden world of Hermetic Mormonism. What happens when esoteric traditions, alchemy, and mysticism creep into a religion that already claims divine revelation? Skyler unpacks his story of coming to faith in Christ by unpacking the surprising roots and modern expressions of this strange blend of Hermeticism and Latter-day Saint theology. How deep does the rabbit hole go? And why should Christians be concerned about this growing trend in fringe Mormon circles? Tune in for a fascinating and eye-opening conversation you won't want to miss. Cultish is a 100% crowdfunded ministry. Partner with us & be part of the mission to mission to change lives: https://donorbox.org/cultish Skyler's Podcast Distinctive Christianity: https://redcircle.com/shows/distincti... Bradley Campbell ‪@GLM‬