Podcasts about Minimax

Decision rule used for minimizing the possible loss for a worst case scenario

  • 155PODCASTS
  • 256EPISODES
  • 41mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • Jun 17, 2026LATEST

POPULARITY

20192020202120222023202420252026


Best podcasts about Minimax

Latest podcast episodes about Minimax

China Daily Podcast
英语新闻丨世界杯预测成为AI新竞技场

China Daily Podcast

Play Episode Listen Later Jun 17, 2026 4:57


As the 2026 FIFA World Cup brings together more nations, more matches and more excitement, a competition has intensified off the pitch among China's leading artificial intelligence models, which are being put to the test in predicting the results of one of the world's biggest sporting events.随着2026年FIFA世界杯汇聚更多国家、更多比赛和更多激情,场外一场竞赛也在中国领先的人工智能模型之间愈演愈烈——它们正接受着预测全球最大体育赛事结果的考验。The 23rd edition of the World Cup, featuring 48 teams, is being hosted by the United States, Canada and Mexico.第23届世界杯由美国、加拿大和墨西哥联合主办,共有48支球队参赛。It opened on Thursday and runs through July 19.赛事于6月11日开幕,将持续至7月19日。Several Chinese large language models, including Qwen, DeepSeek, Kimi and MiniMax, have rolled out prediction features, turning the tournament into a new testing ground for AI-powered reasoning and data analysis.通义千问、DeepSeek、Kimi和MiniMax等多家中国大语言模型已推出预测功能,将本届赛事变成AI推理与数据分析的新试验场。"As one of the most-watched sporting events across the globe, the World Cup offers AI companies a rare opportunity to showcase the computing power and analytical skills of their LLMs to a wider audience," said Guo Tao, a member of the Chinese Association for Artificial Intelligence and a senior expert in AI.中国人工智能学会会员、资深AI专家郭涛表示:“作为全球关注度最高的体育赛事之一,世界杯为AI企业提供了一个难得的机会,向更广泛的受众展示其大语言模型的计算能力和分析技巧。”Several AI platforms have come up with interactive campaigns.多家AI平台推出了互动活动。For instance, Moonshot AI's Kimi has launched a 1 trillion-token reward pool, allowing users to share prizes by correctly predicting match winners and the final champion.例如,月之暗面的Kimi推出了1万亿token奖池,用户可通过正确预测比赛胜负和最终冠军来分享奖励。A token refers to the smallest unit of data processed by AI models.Token指的是AI模型处理的最小数据单元。Alibaba Group's Qwen has introduced a dedicated match prediction assistant, while also offering human-versus-AI prediction challenges.阿里巴巴集团的通义千问推出了专门的赛事预测助手,同时提供人机预测挑战。However, the World Cup has also exposed the limitations of current AI models when it comes to analyzing and predicting the results of sporting events.然而,世界杯也暴露了当前AI模型在分析和预测体育赛事结果方面的局限性。For example, before the Group C opener between Brazil and Morocco on Sunday, major LLMs made predictions in favor of Brazil based on both historical data and statistical indicators.例如,在6月14日巴西与摩洛哥的C组揭幕战前,各大模型根据历史数据和统计指标均预测巴西获胜。The match ended in a 1-1 draw.然而,比赛最终以1比1平局收场。Guo said that while AI can analyze historical data and statistical models, it still struggles to accurately predict real-world results, especially in sports.郭涛表示,虽然AI可以分析历史数据和统计模型,但在准确预测现实世界结果方面仍然力不从心,尤其是在体育领域。He pointed out that soccer matches are influenced by a wide range of factors in the physical world, and such variables are highly uncertain and difficult to quantify using fixed AI models, making precise predictions inherently challenging.他指出,足球比赛受现实世界中多种因素影响,这些变量高度不确定,难以用固定的AI模型量化,因此精确预测本身就极具挑战性。The limitations of current AI models were also highlighted by Wang Zhongyuan, president of the Beijing Academy of Artificial Intelligence, at this year's BAAI Conference held last week.北京智源人工智能研究院院长王仲远在上周举行的2026年智源大会上同样强调了当前AI模型的局限性。Wang said that while LLMs have become increasingly capable of solving problems in the digital world, many challenges in the physical world remain beyond their reach.王仲远表示,虽然大语言模型在解决数字世界的问题上能力日益增强,但物理世界中的许多挑战仍超出其能力范围。As a result, the next stage of AI development will gradually shift from "predicting the next token" to "predicting the next physical state", he added.因此,AI发展的下一阶段将逐渐从“预测下一个token”转向“预测下一个物理状态”,他补充道。Asked why tech companies are rolling out AI prediction features for sports when the accuracy rate is relatively low, Guo, the expert from the Chinese Association for Artificial Intelligence, said the trend partly reflects the growing pressure of competition across the industry.当被问及为何科技公司在准确率相对较低的情况下仍推出体育赛事AI预测功能时,中国人工智能学会专家郭涛表示,这一趋势在一定程度上反映了行业竞争压力日益增大。"As competition in the LLM market intensifies, technological differentiation is becoming increasingly difficult. Companies are eagerly seeking new channels to distinguish themselves from their rivals," he said.“随着大语言模型市场竞争加剧,技术差异化变得越来越困难。各家企业都在急切地寻找新的渠道来凸显自身与竞争对手的不同,”他说。As the AI technology matures, simply competing on the size parameter is not enough, Guo said.郭涛表示,随着AI技术日趋成熟,仅仅在参数规模上竞争已远远不够。"The market is paying less attention to how large a model is and more attention to whether it can deliver valuable services in real-world scenarios and solve practical problems for users," he added.“市场越来越不关注模型有多大,而是更关注它能否在现实场景中提供有价值的服务、解决用户的实际问题,”他补充道。Hu Yanping, a professor at Shanghai University of Finance and Economics, said that LLMs and AI agents are already evolving from conversation-oriented systems into task-oriented systems, while moving beyond pretraining toward continuous learning and broader real-world perception.上海财经大学教授胡燕平表示,大语言模型和AI智能体已开始从对话式系统向任务式系统演进,同时正从预训练阶段向持续学习和更广泛的现实感知能力迈进。"Exploratory projects, such as World Cup match predictions, can help accelerate this evolution," Hu said.“世界杯赛事预测等探索性项目可以加速这一演进进程,”胡燕平说。"A capability framework built around perception, interaction, decision-making and collaboration is what future task-oriented AI agents need."“围绕感知、交互、决策和协作构建的能力框架,正是未来任务式AI智能体所需要的。”large language models (LLMs) /lɑːdʒ ˈlæŋɡwɪdʒ ˈmɒdəlz/大语言模型rolled out /rəʊld aʊt/推出testing ground /ˈtestɪŋ ɡraʊnd/试验场token /ˈtəʊkən/词元(AI模型处理的最小数据单元)statistical indicators /stəˈtɪstɪkəl ˈɪndɪkeɪtəz/统计指标

AIA Podcast

Сегодня рассказываем, сможет ли ChatGPT, Grok или Gemini посадить самолёт, разбираем новые лимиты GPT Pro, подписку Codex и Claude Opus 4.8, обсуждаем коллаб Папы Римского с ИИ-разработчиками, открытые модели Microsoft MAI, Gemini Omni, Grok Build, Mistral Le Chat как агента VIPE, Nvidia Nemotron-3 Ultra и Perplexity Search the Code. Плюс: танцующие роботы Unitree, подводные дата-центры Китая, ИИ-слежка по камерам, Qwen VLA, MiniMax M3, DeepSeek на Huawei, новые ИИ-законы в Европе и США, бешеные бонусы Samsung, запрет на выезд китайских ИИ-спецов — и первый ИИ-фильм в Каннах.

The Water Tower Hour
Maison Solutions Inc. (MSS): Grocery Gets Smart: Maison's AI-and-Blockchain Playbook

The Water Tower Hour

Play Episode Listen Later Jun 16, 2026 25:15


Send us Fan MailIn this episode of the WTR Small-Cap Spotlight podcast, Chris Zhang, Vice President of Corporate Development and Strategy of Maison Solutions Inc. (Nasdaq: MSS) joins host Tim Gerdeman, Vice Chair, Co-Founder and Chief Marketing Officer of Water Tower Research, and WTR Analyst James Kisner. Maison is a specialty grocery retailer serving Asian-American and other ethnic communities, operating HK Good Fortune stores in Southern California and Lee Lee International Supermarkets in Arizona. Zhang lays out the company's tech-driven transformation across four pillars: store inventory, sales and order operations, customer privacy, and customer loyalty, with AI-powered forecasting and replenishment furthest along and perishables the first problem it tackles. He details the newly announced collaboration framework with SupplyAi and MiniMax aimed at embedding AI in everyday food-retail workflows, the direct-sourcing strategy across Asia including the Guizhou Moutai distribution agreement, and the company's Worldcoin (WLD) treasury position and early proof-of-human exploration. The conversation closes with the operating KPIs and milestones that would signal the AI and solutions strategy is working over the next 12 months.

Vidas en red Spreaker
Usando la #IA para reclamar al Corte Inglés

Vidas en red Spreaker

Play Episode Listen Later Jun 10, 2026 27:56 Transcription Available


El pasado mes de Febrero acudí a informarme de ciertos seguros en el Corte Inglés.Me esperaba una pesadilla. Sin mi consentimiento activaron los seguros y procedieron a cobrarme por seguros que no firmé, no contraté, y con datos que no les autoricé.Desamparado, sin ayuda, invoqué a la IA, en concreto a Gemini y a #Codex. Y de momento ha funcionado, automaticé tareas que de otra manera me hubiera llevado mucho, mucho más tiempo. 

【工程師聊什麼】
第 305 集 - 現在聲音也用 ai 了。老高把自己活成了傳奇。人腦貴在便宜。Grok 真的很兇。

【工程師聊什麼】

Play Episode Listen Later Jun 7, 2026 34:57


工程師都宅宅的不太會講話? 其實工程師的幹話多到你聽不下去! ------ 加入粉絲團留言互動! https://www.facebook.com/%E5%B7%A5%E7%A8%8B%E5%B8%AB%E8%81%8A%E4%BB%80%E9%BA%BC-109229084578194 ------ softwaretalkthreesmall@gmail.com -- Hosting provided by SoundOn

Let's Talk AI
#247 - Opus 4.8, MAI, Anthropic IPO, Minimax-M3

Let's Talk AI

Play Episode Listen Later Jun 6, 2026 105:02


Our 247th episode with a summary and discussion of last week's big AI news!Recorded on 06/03/2026Hosted by Andrey Kurenkov and Jeremie HarrisFeel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:Anthropic released Claude Opus 4.8 with improved benchmark scores, discussed eval-awareness findings and welfare/corrigibility themes from its system card, and introduced Dynamic Workflows for long-running multi-agent tasks.Microsoft unveiled the always-on Microsoft Scout assistant built on OpenClaw plus new in-house MAI models (including MAI Thinking 1) and “frontier tuning,” emphasizing enterprise security architecture and model-from-scratch capability.Major business moves included Anthropic's $65B Series H at a $965B valuation alongside an IPO filing, a JPMorgan analysis arguing OpenAI needs major revenue growth to justify infrastructure spend, and Cognition raising $1B at a $25B valuation.Policy and security highlights covered Trump's voluntary pre-release government testing framework for powerful AI, Meta AI support being exploited to hijack Instagram accounts, tightened US Nvidia export controls and China's travel approvals for AI experts, plus expanded Glasswing/Mythos-style cyber and biodefense initiatives.Timestamps:(00:00:10) Intro / Banter(00:04:10) Sponsors(00:07:10) News PreviewTools & Apps(00:07:54) Anthropic releases Opus 4.8 with new 'dynamic workflow' tool | TechCrunch(00:22:37) Microsoft Scout is a new AI personal assistant built on OpenClaw | The Verge(00:26:55) Microsoft launches new MAI family of AI models at Microsoft Build | Mashable(00:37:43) Robinhood now lets your AI agents trade stocks | TechCrunch(00:40:49) OpenAI launches new Codex tools for white-collar work | TechCrunch(00:43:40) ElevenLabs' new music-generation model can switch genres mid-track | TechCrunchApplications & Business(00:44:35) Anthropic Hits $965 Billion Valuation, Surpassing OpenAI - WSJ(00:45:32) Anthropic Files to Go Public, Setting Stage for Huge I.P.O. - The New York Times(00:51:15) China's ByteDance Developing New AI Chips Like Those from Nvidia Partner Groq(00:55:00) Anthropic expands Mythos to 150 additional organizations(00:55:35) OpenAI needs a 26x revenue increase to justify its buildout(00:58:46) AI coding startup Cognition raises $1B at $25B pre-money valuation | TechCrunchProjects & Open Source(01:00:50) MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost | VentureBeatPolicy & Safety(01:06:08) Trump Signs Executive Order Seeking Oversight of A.I. Models - The New York Times(01:11:45) Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked(01:13:058) Chinese AI experts in private firms now required to secure approval before international travel — Beijing enforces policy to secure top-tier talent, expands measures beyond government(01:17:53) U.S. Tightens Controls on Nvidia AI Chip Exports | Let's Data Science(01:21:47) OpenAI launches Rosalind Biodefense, offers federal agencies early access to its life-sciences model(01:24:00) Using LLMs to secure source code(01:26:19) Project Glasswing: An initial update(01:29:30) White House Approves $9 Billion for Spy Agencies to Catch Up on A.I.(01:32:11) US Law Enforcement Warns of ‘Anti-Tech Extremism' as AI Hatred GrowsSynthetic Media & Art(01:35:38) YouTube will now automatically label AI videos | TechCrunchResearch & Advancements(01:36:22) Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention(01:41:26) From Simulation to Enaction: Post-trained language models recognize and react to their own generationsSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The top AI news from the past week, every ThursdAI

Hey folks, Alex here, let me catch you up! I've had a feeling that this week is going to be crazy, as it started on the weekend MiniMax M3, then with Jensen announcing new RTX Spark, NVIDIA's first PC chip packing 1 petaflop of local AI power into thin laptops.A few days later at Microsoft BUILD, Satya & Mustafa from MAI dropped 7 AI models, completely pre-trained from scratch, including a new MAI-thinking-1, MAI-code and MAI-image 2.5 that started topping the image gen charts. Then other image models started racing to the top of the Arena benchmarks, IdeoGram 4 hitting becoming SOTA open weights image-gen model, and Reve 2 beating Nano Banana just a few hours after that. And then today, NVIDIA dropped Nemotron 3 Ultra, their latest 550B open weights model, data and training and Arena published a new agentic eval leaderboard and we got a new Gemma 4 12B. I've had the great pleasure to host Chris (@llm_wizard) from Nvidia, Peter Gostev from Arena and Karan from Nous Research (who were featured prominently by Jensen!) all on the show. Def don't miss this one! Let's get into the details. ThursdAI - Join the flock of folks who know what is happening in AI before everyone else.Open Source LLMs

Fine Time
UFO 50: Was I Wrong? | Postgame Show

Fine Time

Play Episode Listen Later Jun 4, 2026 38:01


Andre looks back at his previous UFO 50 episode and examines how his feeling have evolved on games he didn't initially like very much. He dares to ask the bold question: Was I Wrong? Andre: @pizzadinosaur.fineti.me Fine Time: @fineti.me [00:00] Intro and Premise [02:46] Ninpek [07:07] Rail Heist [10:22] Velgress [13:27] Mini & Max [16:36] Caramel Caramel [21:48] Cyber Owls [25:35] Campanella 2 [30:10] Star Waspir [35:48] Thanks For Listening!

AI Inside
The $4 Trillion AI IPO Wave Is About to Break

AI Inside

Play Episode Listen Later Jun 4, 2026 71:41


Jason Howell and Jeff Jarvis open on the biggest week in AI yet: Anthropic closed a $65 billion round at a $965 billion valuation, passing OpenAI, right as OpenAI crossed 1 billion monthly users and SpaceX, Anthropic, and OpenAI all line up to go public. They get into what a $4 trillion IPO wave means for the market, plus Claude Opus 4.8 and Anthropic's Mythos expansion. Also in this episode: Google lets publishers opt out of AI search, Microsoft floods Build 2026 with seven new models and an always-on agent, Nvidia's RTX Spark aims to reinvent the PC, companies start rationing AI as costs explode, ElevenLabs ships emotion-preserving dubbing, plus math, robots, a Meta chatbot hack, MiniMax M3, and Trump's scaled-back AI order. Find every episode at aiinside.show. Note: Time codes subject to change depending on dynamic ad insertion by the distributor. 0:00 - Start 0:03:48 - Anthropic raises $65B Series H at a $965B valuation, overtaking OpenAI 0:15:50 - ChatGPT app hits 1 billion monthly active users in record time, data shows 0:17:09 - Anthropic launches Claude Opus 4.8, its most honest model yet 0:21:46 - Anthropic expands Mythos to 150 additional organizations in more than 15 countries 0:29:48 - Microsoft Build 2026 keynote: seven AI models, MAI-Thinking-1, Project Solara, and a Copilot super app 0:31:17 - Inside Microsoft's Project Solara: A new platform for devices that run AI agents instead of apps 0:40:54 - Nvidia announces the RTX Spark Arm chip at Computex 2026 0:47:05 - Amazon kills internal AI leaderboard after employees gamed it with pointless tasks 0:48:18 - Uber caps monthly employee AI spending at $1,500 per tool amid soaring costs 0:50:32 - ElevenLabs launches Dubbing v2, preserving emotion across 90+ languages 0:56:27 - AI startup Shift offers free NYC home cleaning to collect robot training data 0:59:31 - As A.I. Makes Strides in Mathematics, Mathematicians Urge Caution 1:01:00 - Hackers used Meta's AI support chatbot to hijack high-profile Instagram accounts 1:03:23 - China's MiniMax launches M3, rivaling Claude Opus 4.7 at $0.12 per million tokens 1:04:19 - Trump signs a scaled-back AI executive order Hosts: Jason Howell and Jeff Jarvis  Download and subscribe to AI Inside in audio and video: https://aiinside.show/  Support the podcast on Patreon for special perks: https://www.patreon.com/aiinsideshow. You'll get ad-free episodes, members-only Discord, T-shirts and stickers you love, and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Learn more about your ad choices. Visit megaphone.fm/adchoices

Sidecar Sync
The 4 Modes of Working with AI, The Transformation Paradox, & Building a Learning Organization | 137

Sidecar Sync

Play Episode Listen Later Jun 4, 2026 62:36


Send us Fan MailIn this jam-packed “mini” episode, Amith Nagarajan and Mallory Mejias break down a whirlwind of recent AI model releases—from Anthropic, Alibaba, Microsoft, and beyond—and what they signal about the rapidly evolving AI landscape. Then, they dive into Microsoft's 2026 Work Trend Index Report, unpacking the “agency equation” and what it really means for organizations navigating AI adoption. From the rise of agents and the four modes of working with AI to the growing gap between employee readiness and organizational culture, this episode explores why AI transformation is less about tools and more about leadership, systems, and mindset. Plus, they introduce the concept of “owned intelligence” and what it takes to become a true learning organization in the age of AI. 

Vidas en red Spreaker
MiniMax la tarifa plana OpenClaw

Vidas en red Spreaker

Play Episode Listen Later Jun 3, 2026 26:04 Transcription Available


Agencia recaudatoria de la Isla (Proyecto MEGA ISLA):Paypal: juliommd@hotmail.comBizum: https://revolut.me/julioqdf

Techmeme Ride Home
Interviewing For A Job At Anthropic? DON'T Use AI.

Techmeme Ride Home

Play Episode Listen Later Jun 1, 2026 21:46


Nvidia unveiled the RTX Spark, an Arm-based consumer chip family built with MediaTek on TSMC 3, plus a DGX Station desktop that runs 1T-parameter models. Intel detailed its Crescent Island GPUs, MiniMax launched a coding model rivaling Opus 4.7 at 1/40th the price, and Anthropic bans AI in interviews. Nvidia announces the RTX Spark, an Arm-based consumer chip family it calls "the most efficient PC chip ever built", made on TSMC 3 in partnership with MediaTek (The Verge) Intel details its Crescent Island data center GPUs, built on its Xe3P architecture and using LPDDR5X memory instead of HBM, calling them "built for agentic AI" (Tom's Hardware) Nvidia unveils DGX Station for Windows, a desktop PC powered by a GB300 Grace Blackwell chip with up to 748 GB of memory, capable of running 1T-parameter models (SiliconAngle) Chinese AI developer MiniMax debuts M3, a new coding model that it says rivals Claude Opus 4.7, costing $0.12 per 1M input tokens, compared with $5 for Opus 4.7 (The Information) A look at Anthropic's hiring process, which prohibits AI use in interviews and features a culture interview that candidates describe as highly intense (Bloomberg) Learn more about your ad choices. Visit megaphone.fm/adchoices

Courtside Financial Podcast
NIO's Swap Network Delivers 16% Of All EV Energy In China | The Bull Case Nobody Is Talking About

Courtside Financial Podcast

Play Episode Listen Later Jun 1, 2026 9:51


Four stories to close out May — starting with the mostunderrated bull case in the NIO thesis right now.NIO's battery swap network delivered 16% of all EV chargingenergy in China over a five-day period. Not 16% of NIOvehicle energy. 16% of ALL electric vehicle charging energyacross the entire Chinese market. From one company'sinfrastructure. In five days. The market is pricing NIOas a car company. The swap network is becoming an energyinfrastructure business — a recurring revenue platformwith a flywheel that gets stronger with every vehicle sold.Gen 5 swap stations arrive mid to late June, unifiedacross all three NIO brands for the first time. The numberthat is already 16% goes higher.Software stocks closed May as their best month since 2001.The SaaSpocalypse — the fear that AI would destroy SaaS —didn't happen. Dell surged 33%. Snowflake 36.5%.AI is making software more valuable not less — selectingfor the companies that use AI as a feature rather thanrunning from it as a threat.SpaceX trimmed its self-assessed valuation to $1.8 trillionand is still on track for the largest IPO in world history.MiniMax — the Chinese AI company recently compared toDeepSeek — is pursuing a China listing. SoftBank announcedplans to invest up to €75 billion in French AI data centers.The AI infrastructure arms race is now a global story.The US and Iran are "mostly agreed" on a 60-day memorandumof understanding — ceasefire extended, Hormuz reopens,nuclear talks begin. The deal still needs Trump's signature.Oil closed May at $92.56 — down nearly 19% for the month —worst monthly performance since COVID. Watch Monday morning.If the deal gets signed the market opens at a new recordand everything that's been held back by the macro overhangsince February starts to move.

网事头条|听见新鲜事
官方预告MiniMax M3系列AI模型即将登场

网事头条|听见新鲜事

Play Episode Listen Later May 27, 2026 0:32


Choses à Savoir TECH
La Chine, un maître de l'IA open source qui séduit les pays du Sud ?

Choses à Savoir TECH

Play Episode Listen Later May 5, 2026 2:20


C'est un signal fort dans la bataille mondiale de l'intelligence artificielle. Selon une étude conjointe du MIT et de Hugging Face, relayée par le MIT Technology Review, les modèles open source chinois représentent désormais 17,1 % des téléchargements mondiaux sur la plateforme. Les modèles américains, eux, tombent à 15,86 %. Une première.Ce basculement remonte à janvier 2025, avec la publication du modèle R1 par DeepSeek. Sa particularité : une licence MIT, très permissive, qui autorise librement l'utilisation, la modification et la redistribution. En clair, n'importe quel développeur peut s'en emparer sans contrainte commerciale. Et surtout, ses performances rivalisent avec celles de modèles fermés américains, pour un coût d'utilisation bien plus faible. Dans la foulée, d'autres acteurs chinois ont suivi : Alibaba avec la famille Qwen, Moonshot AI ou encore MiniMax. Résultat : fin 2025, Qwen dépasse même Llama, le modèle de Meta, en nombre de téléchargements cumulés.La différence de stratégie est nette. Côté américain, les modèles sont souvent accessibles via des API payantes — c'est-à-dire des interfaces permettant d'utiliser l'IA à distance, moyennant abonnement. Côté chinois, ils sont proposés en accès libre, téléchargeables et exploitables localement. Un avantage décisif dans de nombreuses régions du monde.En Afrique, en Asie du Sud-Est ou en Amérique latine, ces modèles comblent un vide. Ils fonctionnent sur des machines modestes, ne nécessitent pas de carte bancaire et évitent les contraintes liées à l'hébergement des données à l'étranger. En Europe, la réponse s'organise autour d'acteurs comme Mistral AI, qui mise sur la souveraineté et la conformité réglementaire, notamment au RGPD. Mais l'approche reste différente : là où les modèles chinois privilégient le volume et l'adoption massive, les Européens ciblent avant tout les entreprises. Au fond, deux visions s'opposent. L'une ouverte, rapide, centrée sur l'écosystème. L'autre plus encadrée, tournée vers la régulation. Et dans cette course, le terrain est désormais mondial. Hébergé par Acast. Visitez acast.com/privacy pour plus d'informations.

网事头条|听见新鲜事
MiniMax宣布云端沙箱Hermes上线

网事头条|听见新鲜事

Play Episode Listen Later Apr 16, 2026 0:29


Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0
Notion's Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future — Simon Last & Sarah Sachs of Notion

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Apr 15, 2026 77:17


For all those who missed out on London, see you in Miami next week!Notion, the knowledge work decacorn, has been building AI tooling since before ChatGPT, with many hits from Q&A in 2023 and unified AI in 2024 and Meeting Notes in 2025. At the end of their last Make user conference, Ryan Nystrom teased Notion 3.0's Custom Agents - and they are finally embracing the Agent Lab playbook!Sarah Sachs and Simon Last of Notion join us for a deep dive into how Notion built Custom Agents, why it took years and multiple rebuilds to get right, and what it means to turn a productivity tool into an agent-native system of record for enterprise work.We go inside the product, engineering, evals, pricing, and org design decisions behind one of the most ambitious AI product efforts in software today — from early failed tool-calling experiments in 2022 to agent harnesses, progressive tool disclosure, meeting notes as data capture, and the long-term vision for software factories and agentic work.We discuss:* Sarah and Simon's path to launching Notion Custom Agents, and why the feature was rebuilt four or five times before it was ready for production* Why early agent attempts failed: no tool-calling standard, short context windows, unreliable models, and too much complexity exposed to the model* The “Agent Lab” thesis: not just wrapping a model, but understanding how people collaborate and building the right product system around frontier capabilities* How Notion thinks about roadmap timing: not swimming upstream against model limitations, but also building early enough that the product is ready when the models are* Why coding agents feel like the kernel of AGI, and how Notion is thinking about “software factories” made up of agents that spec, code, test, debug, review, and maintain codebases together* How Sarah runs AI engineering at Notion (“notes from Token Town”): objective-setting over idea ownership, low-ego teams comfortable deleting their own work, and a culture designed to swarm around fast-changing opportunities* The “Simon Vortex,” company hackathons, and why security gets pulled in early rather than late* How Notion organizes AI: core AI capabilities and infrastructure, product packaging teams, and a broader company mandate that every product surface must increasingly work for both humans and agents* Why prototypes have become much easier to build internally, and how “demos over memos” changes product development inside a tool the whole company already uses every day* Notion's eval philosophy: regression tests, launch-quality evals, and “frontier/headroom” evals that intentionally only pass ~30% of the time so the company can see where model capabilities are going* What a “Model Behavior Engineer” is, and why Notion treats eval writing, failure analysis, and model understanding as a distinct function rather than just software engineering* The changing role of software engineers in the age of coding agents, and why the new job looks less like typing code and more like supervising a rigorous outer system of agents, PRs, and verification loops* How the “software factory” should work: specs, self-verification, bug flows, subagents, and minimizing human intervention while preserving the invariants that matter* A live walkthrough of a Notion Custom Agent handling coworking space tenant applications by triaging email, enriching applicants with web search, and writing structured data into a Notion database* How agents compose inside Notion: shared databases as primitives, agents invoking other agents, “manager agents” supervising dozens of specialized agents, and memory implemented simply as pages and databases* Notion's take on MCP vs CLI: why Simon is bullish on CLI's self-debugging nature, where MCP still makes sense, and how Sarah thinks about capability, determinism, permissioning, and pricing alignment* The evolution of Notion's internal agent harness: from early JavaScript coding agents, to custom XML, to Markdown and SQL-like abstractions, to tool definitions, progressive disclosure, and a much shorter system prompt* Why Notion cares about teaching “the top of the class,” building for sophisticated operators rather than abstracting away too much capability for everyone* How agent setup works today: agents that can configure themselves, inspect their own failures, and edit their own instructions — with guardrails around permissions* How Notion prices Custom Agents: credits as an abstraction over tokens, model type, serving tier, web search, and future sandbox costs; why usage-based pricing was necessary; and how “auto” tries to match the right model to the right task* Why Notion is not eager to train a foundation model, where they do fine-tune and optimize today, and why retrieval/ranking is one of the most important investment areas as more searches come from agents rather than humans* Why Meeting Notes became one of Notion's strongest growth loops: not just as transcription, but as high-signal data capture that powers search, custom agents, follow-up workflows, and the broader system of record for company collaboration* Why Notion is more interested in being the place where collaboration data lives than in building hardware themselves — and how wearables or other capture devices may eventually feed into that systemSarah SachsLinkedIn: https://www.linkedin.com/in/sarahmsachsX: https://x.com/sarahmsachsSimon LastLinkedIn: https://www.linkedin.com/in/simon-last-41404140X: https://x.com/simonlastFull Video EpisodeTimestamps* 00:00:00 Introduction and launching Notion Custom Agents* 00:01:17 Why Notion rebuilt agents four or five times* 00:03:35 Building for where models are going, not just where they are* 00:05:32 The Agent Lab thesis, wrappers, and product intuition* 00:08:07 User journeys, leadership, and low-ego AI teams* 00:13:16 The Simon Vortex, hackathons, and bringing security in early* 00:16:39 Team structure, demos over memos, and building for agents* 00:20:25 Evals, Notion's Last Exam, and the Model Behavior Engineer role* 00:27:37 Evals as an agent harness and the changing role of software engineers* 00:30:42 The software factory: specs, verification, and agent workflows* 00:32:18 Live demo: a custom agent for coworking space applications* 00:35:08 Composing agents, manager agents, and memory as pages* 00:38:15 Notion Mail, Gmail, native integrations, and tools* 00:39:43 MCP vs CLI and the cost of capability* 00:44:13 When Notion uses MCP vs building its own integrations* 00:47:43 The history of Notion's agent harness rebuilds* 00:55:35 Power users, public tools, and the setup agent* 00:58:01 Self-fixing agents, permissions, and “flippy”* 01:01:13 Pricing, credits, and choosing the right model automatically* 01:09:01 Why Notion isn't training its own frontier model* 01:14:07 Retrieval, ranking, and search built for agents* 01:17:27 Meeting Notes as data capture and workflow automation* 01:21:18 Wearables, hardware, and Notion as the system of record* 01:23:45 OutroTranscript[00:00:00] Alessio: Hey everyone. Welcome to the Latent Space podcast. This is Alessio founder of Kernel Labs and I'm joined by swyx, editor of the Latent Space.[00:00:11] swyx: Hello. Hello. We're back in the beautiful studio that, uh, Alessio has set up for us with Simon and Sarah from Notion. Welcome.[00:00:18] Sarah Sachs: Thanks for having us.[00:00:19] Alessio: Thanks for having us. Yeah.[00:00:20] swyx: Congrats on the launch recently the custom agents, finally it's here. How's it feel?[00:00:26] Sarah Sachs: We ship things slowly. So it had been in Alpha for a little bit and at the point at which is it's an alpha, um, there's a group of people that are making sure it's ready for prod, and then there's a group of people working on the next thing.So sometimes some of these launches are a bit delayed satisfaction, so it's quite nice to remind yourself all the work you did because we do have a habit of like. Being two or three milestones ahead. Uh, just ‘cause you have to be, you know, you can't get complacent. Um, but it's been great that people understood how this is helpful.And I think that's just easier in general building AI tools today than it was two, three years ago. People kind of get it and so that user education, um, there's just, it was our most successful launch in terms of free trials and converting people and things like that. It was really successful, so yeah.But there's a lot to build.[00:01:12] swyx: Making it free for three months helps.[00:01:16] Sarah Sachs: Yep.[00:01:17] Simon Last: It was definitely super exciting for me because it's probably the fourth or fifth time that we rebuilt that.[00:01:22] swyx: Yes.[00:01:23] Simon Last: And I mean,[00:01:24] swyx: you've been building this since like 20, 22.[00:01:26] Simon Last: Yeah, I mean, like, it was even right when we got access to like GPT four in late 20 22, 1 of the first ideas we had is like, oh, okay, let's make an agent that I, we used the word assistant at the time, there wasn't really the word, the word agent yet, but, oh, we'll give an access to all the tools the notion can do, and then it, we run in the background like, like do work for us.And then we just tried that many times and it just. Was too early. Um,[00:01:48] swyx: I need to force you to like double click on that. What is too early? What didn't work?[00:01:52] Sarah Sachs: We were fine to, like, before function calling came out. We were trying to fine tune with the Frontier Labs and with fireworks, like a function calling model on notion functions.This is right when I joined. I joined because, um, we needed a manager as Simon was needed to be able to go on vacation. So, uh, that's, that's around when I joined, so you can speak much more to it.[00:02:11] Simon Last: Yeah, we did partnerships with both philanthropic and open AI at different times, uh, to try to, at the time the, I mean, when we first tried, there wasn't even a constant of like tools yet.We, we sort of designed our own like, like tool calling framework and then we tried to fine tune the models to, uh, to use it over multiple turns. Um, and because it, it didn't work well out the box, I think. Yeah. The models are just too dumb and the context thing was also way too short.[00:02:37] Alsesio: Yeah.[00:02:37] Simon Last: Um, and yeah, we just kind of banged our head against it for a long time.Uh, unfortunately it was always like, there was always like sort of. Glimmers that it was working, but um, it never felt quite robust enough to be like a useful, delightful thing. Um, until I would say, uh, the big unlock was probably like Sonic 3.6 or seven, uh, early last year. And that's when we started working on our agent, which we shipped last year.Um, and then, and then uh, uh, custom agents, kinda a similar capability and that, that one just took longer because we, we just wanted to get the reliability up a lot higher. ‘cause it's actually running in the background.[00:03:14] Sarah Sachs: And the product interface of like permissions and understanding, you know, this custom agent is shared in a Slack channel with X group of people and has access to documents that are surfaced to Y group of people.And the intersect experts, Y might not be whole. And so how do you build the product around making sure administrators understand that permissioning took multiple swings.[00:03:35] Alsesio: Everything is hard back at the end of the day. Yeah. I'm curious, like when the models are not working, how do you inform the product roadmap of like, okay, we should probably build, expecting the models to be better at some reasonable pace, but at the same time we need to, you know, you had a lot of customers in 2022.It's not like you were a new company or like no user base.[00:03:54] Simon Last: Yeah, I mean I think there's always the balance of, you know, like you want to be a GI pilled and thinking ahead and building for where things are going. Uh, but also you wanna be like shipping useful things. And so we always try to like, like keep a balance there.You know, we. We try to take clear, like a portfolio approach. You know, we're always working on multiple projects and, and we're always trying to work on, you know, maintaining things where that have already shipped, like, like shipping new things that are like eminently working well and make them really good.And, and then we wanna always have a few projects that are a little bit crazy. Um,[00:04:23] Alsesio: and what are the a GI peel projects that you have today? I'm curious about, uh, you don't have to share exactly what you're working on, but I'm curious what are things today that maybe in 18 months people will be like, oh, obviously this was gonna work[00:04:35] Sarah Sachs: 18 months.[00:04:37] Alsesio: Yeah, 18 months is, you know,[00:04:37] Sarah Sachs: it's a long time and Yeah. Yeah.[00:04:39] Simon Last: I mean, there's a number of things happening. I think one thing that's becoming more clear is I think like, like, uh, coding agents are the kernel of EGI, sort of, everything is a coding agent. Mm-hmm. I think that's, that's sort of one, one direction.Um, and then, yeah, the exciting thing about that is sort of your agent can sort of bootstrap its own software and capabilities and actually debug and maintain them. And so yeah, we're, we're, we're thinking a lot about that. And then, yeah, like, like another category of things that I'm, I'm really excited about is like, uh, we call the software factory also.People are using this, uh, this, this sort of word. Um, basically it just means can you create sort of like a, as automated as possible, a workflow for developing debugging. Mm-hmm. Merging, reviewing, and maintaining a code base and a service where there's a bunch of agents working together inside, and like, like how does that work?[00:05:28] Sarah Sachs: If you think back to your initial question, like, why did this take so long? I think something,[00:05:32] swyx: I didn't say that, but Yes. Okay. Go ahead.[00:05:34] Sarah Sachs: Why, what, what changed over the three and half years of trying[00:05:37] swyx: it? Exactly. Right. Because most people always say like, it didn't work yet. Then reasoning models came, then it worked.I was like, okay, let's go a little[00:05:43] Sarah Sachs: bit. That's, I mean, that's part of it, but I think the other part of it that I actually think is really what will set notion apart for every new capability is we have like. Two skills that are crucial when it comes to frontier capabilities. One is not letting yourself swim upstream.So like quickly realizing if you're just pressing against model capabilities versus not exposing the model to the right information, not having the right infrastructure set up. That and of itself is the skill of intuition. And the second is to see, okay, you're not swimming upstream. Which direction is the river flowing and what is like, how do we think ahead about the product and start building it even if it's not great yet, so that when it is there, we're ready for it.Right? And like those can sometimes feel like counterintuitive things. Like we can be trying to fine tune a tool calling model when they don't exist yet. And that the trick is to not do that for too long, but realize that there was something there. And we've had a lot of things which like, um, we're just like not swimming in the right direction with the streams.I think we had multiple versions of transcription before we got meeting notes, right? Oh, I gotta talk[00:06:39] swyx: about that. Yeah.[00:06:40] Sarah Sachs: Yeah. Um, and so. I, I, I think that like we, we really closely partner with the Frontier Labs on capabilities and we also have to have strong conviction on, as those capabilities move.Notion is about being the best place for you to collaborate and do your work. And how does that narrative change if the way that we work changes?Yeah.[00:06:58] swyx: Yeah. You told me you were a fan of the Agent Lab thesis, and this is, this is kind of it, right?[00:07:02] Sarah Sachs: Right. I show that thesis to so many candidates. Like I have it as like micro chrome autofill.Um, at this point, like it's one of my most visitations[00:07:10] swyx: because like, is this the, here's why you should work in notion and not open, open eye. I, it's like,[00:07:14] Sarah Sachs: here's, here's what's different about it.[00:07:16] swyx: Yeah.[00:07:16] Sarah Sachs: And here's why. It's not just a rapper. I actually think more and more people understand it's not just a wrapper.[00:07:21] swyx: Yeah.[00:07:22] Sarah Sachs: Um, and by the way, like in the beginning, parts of what we build are wrappers on functionality. That works well, of course, but that's not really the most, um. I would say that's not the product that, that drives revenue. And that's not necessarily always what users need.[00:07:35] swyx: I mean, you know, notion is the AWS wrapper, but like the, the wrapper is very beautiful and like very, very well polished.So[00:07:40] Sarah Sachs: like the analogy,[00:07:41] swyx: like[00:07:42] Sarah Sachs: the analogy that I've been coming back to his Datadog in AWS[00:07:45] swyx: Yeah.[00:07:46] Sarah Sachs: So, uh, Datadog could not exist with, without cloud storage. Right. That it's kind of fundamental that that works. Um, and AWS has like a CloudWatch product, but Datadog is an expert on understanding how people want observability on the products they launch.And we're experts in understanding how people wanna collaborate, and that's really where our expertise lies.[00:08:04] swyx: Totally.[00:08:04] Sarah Sachs: Um, regardless of the tools that we use,[00:08:07] Alsesio: I'm kind of curious how you think about implicit versus explicit expertise. I feel like Datadog is half and half implicit and explicit. It's like they understand across markets and industries what engineering teams usually look for.With notion, it's almost like more of the expertise is at the edge because you as a platform, you're like so horizontal that the end user is not really the same. Mm-hmm. Like with Datadog, the end user is always like, yeah, an engineering lead, a kinda like SRE related person with notion. It can be anything.So I'm curious how you put that expertise into a product versus, you know, obviously it, WS cannot build notion. It's, that doesn't quite work in this case, but[00:08:44] Simon Last: it's, it's a little bit differently shaped. I think, you know, a classic vertical SaaS, like the data is kind of like that. They understand their individual customer very deeply.It's kinda a narrow slice, um, notion has always been super horizontal. And our, our task has always been to sort of balance these two somewhat opposing forces of like, we're listening to our customers and what they want us to build. It's a broad slice. And then also we're thinking about like, okay, how do we decompose what they want into, uh, nice primitives that are, that are really nice to use and we'll, we'll get us like as much bang for the buck as possible.And then, you know. Maintain the whole system, make it all like, like super clean and nice to use.[00:09:22] Sarah Sachs: We still have user journeys. I mean, we still focus on like core. I actually think the failure of our team is when we focus too much on what are cools that are, what are tools that are[00:09:31] Simon Last: mm-hmm.[00:09:31] Sarah Sachs: Cool tools. I actually think that's when we make have the least velocity because you still need some sort of focus on a user journey.So like for instance, we'll all sit down every Friday and look at the P 99 of like the most token exhaustive custom agent transcript and just look at why it didn't do well and cut a bunch of tasks. Like we still focus on like, this has, like this should work. Email triaging should work. Mm-hmm. Right. And similarly, like when we're talking about before building, um, chatting, um, before we started filming about, okay, how can I do PDF export?Well that's functionality that then merits. Maybe we should build a tool that has access to a computer sandbox in a file system and the ability to write code. Right? Right. Um, but it's because we're thinking about the fact that our users to do their, to do their daily work, need to export PDFs, not because we're like, Hmm, I think a computer tool could be cool.Like, let's just see what happens. Mm-hmm. Like we, we have to focus on some user journeys, otherwise we just don't have like, enough strategy to, to prioritize.[00:10:29] swyx: I think there's a lot of like really strong opinions that you've had. Do you have like sort of like a towel of Sarah Sachs? Like, you know, like what, how do you run your team?Like I feel like you just have accumulated all these strong opinions. Obviously part, part of this is your, your token town thing.[00:10:43] Sarah Sachs: I think the TAs working with Service X is, um, you'd have to, it depends who you ask. Um, I think it depends if you're on my team or a partner Right. Or a vendor.[00:10:54] swyx: Yeah. There other people want to run their teams the way that you're Yeah.You're like bringing these things. And then also similarly, uh, Simon, when you did the custom agents demo, you had like, well, we've been using custom agents and here's the super long list of everything that we do. No humans ever read it. Right? That's what you said. I was like,[00:11:07] Sarah Sachs: yeah. So I think for, for me, um, something that I learned very quickly and became very comfortable with was that my job was not to be the ideas per person or the technical expert.My job was to make it so that everybody understood the objective, had a resource to help prioritize what they should work on, and had an avenue to prioritize what they thought was important. And I think that's true with all, all leadership, but I think especially on the AI team. Almost all of our best ideas come from prototypes, from people that have a cool idea because they saw a user problem, and it's a huge disservice if all of those ideas have to pass, like the sniff test of what me and a product partner or Simon and Ivan decided were the direction, right?Because a lot of what we're doing is leaning into capabilities, so. I think that's the first thing is like, I don't really view like the role of engineering leadership as like, uh, hierarchical, nor has it ever been, but especially now, like very willing to change direction based on, um, like proof is in the pudding.Yeah. And like, and I think we have rebuilt our harness three or four times. And when you do that, then the second rule of engineering leadership is like you need to build a team that's comfortable deleting their own code and is very low ego and is driven by what's best for the company. And, um, doesn't write design docs because they think it's their promotion packet.Right. And that's a culture that notion had long before I joined, but like our willingness to just swarm on different problems and um, redo things that we've built before because something has changed. Like, there's a lot of friction that can happen at companies when you do that. And it doesn't happen at Notion.And because it doesn't happen when new people join. Like they don't wanna be the ones that are saying, we shouldn't do this. I wrote that code. So then it's, you know, you, you create a culture that everyone thoughts and that culture comes directly, I think from Simon and Ivan though, um, because they're very open-minded.[00:12:50] swyx: Anything that you,[00:12:50] Simon Last: you'd add? I'm not a manager, like, like, like Sarah is. Um, a lot of my role is really to try to think a little bit ahead, make sure that we're, we're building on the right capabilities and then like the prototyping stuff. And yeah, it's really, really critical to always just be starting again.It's like, okay, this is new thing. What does this mean? What if we just rethought everything or wrote everything? And so I, I'm, I'm basically just doing that in a loop every six months.[00:13:16] swyx: Yeah. Do you believe in internal hackathons for this stuff?[00:13:19] Sarah Sachs: I think there's like two different versions. So one is like, we just have a, a, a solid bench of senior engineers that come and go on what we call the Simon Vortex and Productionizing what we built, right?Because when you're in the Simon Vortex, the velocity is super high. The direction changes daily, and it's meant to be like the equivalent of a SC Works lab. We don't need to do hackathons for that. We need to have senior engineers that we trust to come in and out of those projects. For instance, like management boundaries are really loose.Like you report to him, but you work for her right now. Yeah. That's something that when we hire managers, it's important they don't care about because we tend to form more structures. Yeah. Don't be too[00:13:54] swyx: territorial.[00:13:55] Sarah Sachs: We form more. It's after we ship things, not not before, just historically. Um, the second thing is we do have companywide hackathons.Actually we just had our demos day for the hackathon we had last week this morning. That's more for people that aren't directly working on the project, feeling like they have the time to pause and learn how to make themselves more productive or how they would use notion custom agents to build something.Or part of the hackathon was actually encouraging everyone across the company to build their own agentic tool loop, calling from scratch. Follow like an every blog post on how to do what I think because we want[00:14:26] swyx: just with the compound engineering one. Yeah.[00:14:28] Sarah Sachs: We want everyone to use cloud code in the company or whatever the coding agent they please and understand that fundamental.So we set aside a day and a half. We're all leadership, encourage everyone on their teams across the company to do it. So we have hackathons like that. I would say like kind of facetiously, like everything we build is a little bit like a hackathon until it graduates and puts on big boy pants and as a product ops rollout leader and has a assigned data scientists and stuff like that,[00:14:54] swyx: security review enterprise stuff,[00:14:56] Sarah Sachs: actually security reviews one of the things that we bring in first because it just slows us down way more and, um, causes a lot of tension and they build better product if they're involved early.So, um, that is probably the first person to get involved in something that's the[00:15:09] swyx: right PR approved answer.[00:15:10] Sarah Sachs: No, but it's not just PR approved. It like, um, um, it's[00:15:13] swyx: actually real. It's actually real. It's like, um, I'm just saying scar[00:15:15] Sarah Sachs: tissue.[00:15:15] swyx: Yeah,[00:15:16] Sarah Sachs: because like, you know, my background's also, I worked at Robinhood for a number of years.Yes. So like, uh, compliance and things like that, um, are a little bit more, you learn the hard way when it doesn't come naturally.[00:15:26] Simon Last: Yeah. I think the. The hackathon is really important for uplifting the general population, but like, if that's the only way you can build new things, you're kind of toast. I mean, it, it has to be like the daily processes, like, you know, building these new things.Um, and it has to be about, I think like, I think in the AI era a lot more leverage accumulates to the most curious and excited people. And so it's like we're all about just like activating that energy. You know, like if someone's protesting something on the weekend that they're excited about and it's important, that should be the main thing that we're doing.Yeah. Um, it's not a hackathon that we schedule once a quarter, it's just like, yeah. Daily process. Part of the culture.[00:16:02] Sarah Sachs: I mean, that's how we shift image generation and notion now. It was always this thing that would be kind of nice to have, but it wasn't really clear where that was necessarily aligned in product priorities.It'd be a lot of work. And we had someone on the database collections team, Jimmy, who was like. I really wanna do image generation for cover photos and inside notion. And we're like, if you wanna build it, like it's, do it please. Like we encourage you. We gave ‘em all the resources of working directly with Gemini and being able to like track the token usage and it working through endpoints.We gave them eval, support, everything, and then became a, a full project.[00:16:34] Alsesio: Yeah.[00:16:35] Sarah Sachs: That's why you can't have like ego as a, a leader. Like that's, that's how we work.[00:16:39] Alsesio: What's the size of the team today, both engineering and overall?[00:16:43] Sarah Sachs: I manage, uh, the team. That's what we'll call it. Core AI capabilities and infrastructure.That's about 50 people. But then we have per i partner teams that do packaging. So how it shows up in the corner chat versus custom agents versus meeting notes, that's another 30, 40 people. And, and then every team that has a product service at Notion that a user can interface with owns the tool that the agent interfaces with the editor team.The team that did CRDT for offline mode is the same team that handles how two agents, um, edit competing blocks. Mm-hmm. Right? It's the same problem. The team that built the underlying SQL engine is the same team that owns how the agent asks it to run a SQL query, and it does it performantly. And so from that regard, anyone working on product engineering is tasked with making them work for customers that are humans and agents because over time the majority of our traffic will be coming from agencies using in our interface, not humans.And so. Our objective is to make it so that the whole product org is building for agents.[00:17:40] Alsesio: Yeah. How has it changed internally? The activation bar is kind of lowered a lot. Like anybody can kind of create a prototype very, somewhat easily, especially if you're like an existing code base. Have you raised the bar on like what type of prototype people need to bring forward to gonna be taken?Not like seriously, but like, you know what I[00:17:58] Simon Last: mean? Yeah. I think the bar is lowered in many ways. Be like, one thing our, uh, our team built that is really cool is our, uh, our, our design team made a whole separate GitHub repo, uh, called the, the design Playground. And it's basically just to create a bunch of like, like helper components and you, uh, for, for quickly a throwing together UIs.And it's become like actually quite sophisticated. Like it has like an agent in there and like, uh, that's pretty fun. So like, we pretty much, like, they don't do mocks, they just make like, like full, full prototypes.[00:18:27] swyx: Here it is. It works.[00:18:28] Simon Last: They give you like a u rl. They're like, okay, all right. So we have to make the, like the real production version of that.Um, and then for engineers. A prototype looks like just making it a feature flag that actually works. Like that's sort of the bar.[00:18:39] Sarah Sachs: Something to understand that's really unique about notion. One of the reasons I joined we're super lucky is no one uses Notion in their job as much as people that work at Notion.[00:18:46] Simon Last: Of course.[00:18:47] Sarah Sachs: So I think there's very few companies, maybe if you worked on Chrome I guess, but like everything that we ship, we ship internally first and get a lot of really quick feedback. And also sometimes our dev instance is totally borked and you have to change a bunch of flags to get things done. And that's kind of like, but everyone, so people that do it ticketing, people that do supply chain procurement, recruiting, everyone is using the same instance of notion with like a lot of flags on for these prototypes people build.Um, and so we have this, Brian Levin, one of the designers on our team, I think evangelize this concept of demos over memos.[00:19:18] swyx: Ooh, too[00:19:20] Sarah Sachs: good. Um, which has been, uh, very good for building demos, and I think it's put a big pressure point on us to have really strong product conviction, because if anything can be demoed, you really need a strong filter of making sure that if you know, you're doing X amount of work, you're making the, you're, you're focusing on one tower, you're not just building a really flat hill.Right. That's actually where I think there has to be more conviction from our PMs, um, and our designers and, and well, the company really to have conviction of what journey we're going on.[00:19:52] Simon Last: But overall, I feel like it works pretty well. Like people, almost all the engineers have good enough taste to realize that like, this prototype doesn't actually make sense in the product, or, or it does.So it's not that common that I would see a prototype. It's like, oh, this makes no sense. Mm-hmm. It's like, you know, people are doing reasonable things and, and, and then it's just a matter of. Which things we build first and then often just, just figuring out how to turn it on and off. There's our, in the, in our like experimental chat ui, there's this, there's probably like, like a hundred check boxes in there.[00:20:22] Sarah Sachs: Kills me[00:20:23] Simon Last: the things you could turn on and off.[00:20:25] Sarah Sachs: Uh, but I think that, okay, so that is kind of true, Simon, but like being the person that manages the evals team, like there is a level of intensity that it adds to the platform team. So, you know, if we're gonna do image generation and notion, all of a sudden the way that we do attachments and the way that we, um, our LLM completion like cortex talks and expects tokens back and now it's getting images back.Like there's a lot of platform work that we do need to, like solidify a little bit. So sometimes it'll be in dev for a couple weeks before it makes it to prod just because we still have to like, make it robust, make it HIPAA compliant, ZDR compliant, figure out the right contracting with the vendor, whatever it is.And we need to eval it because we want the team. To still maintain what they build. That's the one thing is like if we have a bunch of prototypes, it can't just be like a small group of people that then maintain whatever end prototypes. So we have invested a lot of people in an eval and model behavior understanding teams that, we call it agent dev velocity.So your dev velocity building agents can be faster if we invest in that platform. And so we have a whole org dedicated to Asian, um, platform velocity so that you can build your own eval and then maintain it once you ship it. So if a new model release comes out and we, every[00:21:38] swyx: team maintains their own eval,[00:21:40] Sarah Sachs: we maintain the eval framework.Every team owns their own evals and a lot of them we've integrated to Optin, to ci, or we run them nightly and we have a team, uh, a custom agent that triggers to a team to look at the major failures. That's really critical because if we have like all these different surfaces now, a lot of it's on the same agent harness, so it's easier to maintain.It's just packaging of different agent harnesses, but new functionality of the agent. Let's say that like we wanna update like. Uh, you know, they deprecated, sonnet, um, four or whatever it is and we need to auto update. Are[00:22:11] swyx: they already? That's so, okay. Yeah. Actually wasn't that long ago.[00:22:14] Alsesio: Theywere[00:22:14] Alsesio: just 3.5.[00:22:15] Sarah Sachs: 3.537. Just got deprecated.[00:22:18] swyx: 3 7, 5 0.2 or, yeah. No,[00:22:20] Sarah Sachs: it's not. 5.2 is five point. Five point no. Yeah, five four is 40% more expensive than five two. So if they deprecated five two, you would hear they can, you would hear from me about that one. Um, but, uh, another conversation to have.[00:22:35] swyx: I have a cheeky evals question for you.Have you noticed any secret degradation from any of the major model providers?[00:22:40] Sarah Sachs: Secret degradation,[00:22:42] swyx: like. During the War Bay, when it's high traffic, it suddenly gets dumber.[00:22:47] Sarah Sachs: Yeah. I mean, not just between the, I mean, we definitely notice flakiness, we've definitely noticed, particularly for some providers, that things are slower during working hours and[00:22:57] swyx: there's a latency argument.Yes. Not a quality argument.[00:22:59] Sarah Sachs: No. I think the quality difference that's interesting is, um, even though companies that say they're selling the same, a, it's really into like quanti quantization, but like companies that say they're selling the same model through different vendors, whether it be through first party or Bedrock, Azure, et cetera.We do see different qualities sometimes, and that's not necessarily what's advertised.[00:23:21] swyx: Yeah. Kidney went to the point of like, if we, they shipped like this, like eval across all the providers and it was like very obvious we were secret equalizing and it was very,[00:23:28] Sarah Sachs: yeah. But[00:23:29] swyx: that's very embarrassing.[00:23:30] Sarah Sachs: You know, um, we hire Subprocess to figure that out for us.So we just wanna understand where it's regressing or where it's optimized. And sometimes we're okay with regressions that optimize latency if they're the appropriate regressions. Our job is to make sure we have the evals to understand the changes that are important to us. And even like when we're partnering with labs on pre-releasees of models, they'll send us multiple snapshots.And this is less about quantization, but more just regressions. Like they have shipped models that were not the snapshots that we wanted, and they have changed the snapshots that they shipped based on the feedback that we give. Because our feedback tends to be more enterprise work focused and not coding agent focused.And definitely those can be bummers, like, you know, uh, we know that this wasn't the version you wanted, but we'll help you make it work. I mean, we always make it work, but that definitely happens.[00:24:16] Alsesio: Yeah. Do you have, um, failing evals that you're just hoping, oh, that will have success eventually when a good model comes out?[00:24:23] Sarah Sachs: Uh, I mean, yeah. So I think. I mean, I could talk about this for 60 minutes, so I will limit myself. I think it's a real issue when people say evals and it's just like, that's quality, that's like unit, I mean, it's like saying testing. It's not just unit tests, right? So. We have the equivalent of unit test.Regression test. Those live in ci, those have to pass a certain percent, you know, within some stochastic error rate. Then we have, as you're building a product, evals of these aren't passing right now, and this is launch quality. So we have a report card and we need to, on these categories, you know, be it 80 or 90% of all of these user journeys to launch, and then what we have what we call frontier or headroom evals, where we actively wanna be at 30% pass rate.And that's actually been a effort that we took in partnership with philanthropic and OpenAI in the past maybe two or three months, because we actually hit a point where our evals were saturated and we weren't able to really give insightful feedback other than it wasn't worse. And not only is that not helpful for our partners, it's not helpful for us to understand where the stream is going.You know, going back to that analogy. And so we spent a lot of time thinking about. What notions last exam looks like, right? Mm-hmm. Not just humanities, last exam. Ooh, notions last exam. Mm-hmm. And, um, there's a lot of, you know, dreams about what that would look like. I know we've talked a lot about benchmarking, um, swix, but, uh, yeah.Notions last exam is a big thing inside the company and we have people, full-time staff to it exclusively. Mm. We have a data scientist, a model behavior engineer, and an full-time, um, evals engineer just dedicated to the evals that we pass 30% of the time.[00:25:56] swyx: What you're hiring for[00:25:57] Sarah Sachs: MBEs? I am hiring[00:25:58] swyx: What is an MBEA[00:25:59] Sarah Sachs: model?Behavior Engineer Model. Behavior engineers started with a title data specialist before I joined when they were working with Simon on like, uh, Google Sheets and like Simon just needed someone to look through Google Sheets and say, yes, no, this looks bad. This looks good. Right? And so we hired people with kind of diverse linguistics background.We had like a linguistics PhD dropout. Mm-hmm. And a Stanford ate new grad. And they're amazing. And they formed a new function basically. And over time we've built a whole team, um, with a manager who's now kind of reinventing what that role is with coding agents. So they used to be kind of manually inspecting code.Now they're primarily building agents that can write evals for themselves or LLM judges. There's a really funny day I can send you the picture where Simon, about a year and a half ago, was teaching them how to use GitHub. Um, and they're on the whiteboard and it was like, okay, I think it would be so much faster if our data specialists learned how to use GitHub and like learned how to commit these things in Dakota.And, and that was then and now I think, you know, coding has been a lot more accessible. Um, but moving forward it's this mix of like data scientist PM and prompt engineer because there's craft in understanding like even like what models can and can't do things. How do we define like that headroom? How do we define like what a good journey is?Um, is this model better or not? Why is this failing? There's some qualitative work, but then there's also like a lot of instinct and taste to it, and that's not necessarily software engineering. And so we have like very firm conviction and we have had for a number of years now that that is its own career path and we have always welcomed the misfits, so to speak.So we really firmly believe that you don't need an engineering background to be the best at this job. And that's what's quite unique about this particular role.[00:27:37] Simon Last: Yeah, this is something that I've been pretty excited about recently is we made an effort basically to treat the eval system as like an agent harness.So if you think about it, like, you know, you should be able to have an agent end-to-end, download a dataset, run an eval, iterate on a failure, debug, and, and then implement a fix. And ultimately you should be able to, you know, drive the full time process with a human sort of observing the, you know, the outer uh, system.So yeah, we went, went pretty hard on that. And that's, that's worked extremely well so far. It's like basically just to turn it into a coding agent, uh, uh, problem.[00:28:11] swyx: Your coding agent or just whatever[00:28:13] Simon Last: harness No coding agent. Yeah, code, cloud code. It should be totally general. Yeah. I think if it would be a mistake to like, like fix it on any, any particular coding agent.At the end of the day, it's just like CLI tools.[00:28:21] Sarah Sachs: It's like the same way that you would've a coding agent write the unit test. You should have a coding agent write the eval.[00:28:26] swyx: Yeah.[00:28:26] Sarah Sachs: But there's a lot of supervision in that still. We just don't believe that supervision has to come from software engineers because a lot of it is like, um, kind of you XREE and whatever, and these are the people that also triage failures and tell us where we should be investing next.[00:28:40] swyx: Yeah. I'm gonna go ahead and ask a spicy question. Is there a data, there are no software engineers at Notion.[00:28:46] Simon Last: Um,[00:28:46] Sarah Sachs: what does it mean to be a software engineer?[00:28:47] swyx: Exactly.[00:28:48] Simon Last: I mean, I think the way things are going is like we're on some continuum where. If, if you look back three years ago, humans were typing all the code and then we had auto complete, you're typing list of the code.Then we had sort of like filling agents, filling lines, and now we're getting into like agents doing longer range tasks where you can debug and implement a fix and then verify it works and you know, get your, get your PR even like, like Merion deployed. I think we're sort of just moving up the abstraction ladder and then the human role becomes more about observing and maintaining the outer system.There's a string of agents flowing through, like me prs what's going off the rails. Like what do I need to approve? Is there like a learning or memory mechanism that that works? So it's kind of a hard engineering problem. There's a, you know, there's, there's a lot to do there. I think we're just sort of moving up stack[00:29:34] Sarah Sachs: the same transition machine learning engineers have made, right?Like I haven't looked at a PR curve in a while.[00:29:39] swyx: Yeah. You used to do this stuff and now, um, auto research can do it,[00:29:42] Sarah Sachs: right? Like I think it depends on what you define as a software engineer.[00:29:46] swyx: Yes. It's, that's changing for sure.[00:29:49] Sarah Sachs: I think every software engineer in notion this summer went through like this, um, sheer, um, one of our engineering leads of the company called it, like every software engineer is going through the, the, uh, identity crisis that every manager goes through, where all of a sudden they realize their ability to write code is less important than their ability to delegate in context switch.And I think that is a transition out of being a software engineer. But[00:30:12] Simon Last: yeah. Yeah, there's a critical difference to being a manager, which is that like, it is actually very deeply technical. The problem, you know, humans are very like, like, like fuzzy and you can't like treat a team of humans like a, like a rigorous system where like, you know, prs like, like flow through and can be in like a block status and then what happens when they're blocked, right.With a set of agents, you actually can do that. And, and, and I think it's actually, there's a lot of interesting technical rigor that that goes into that it's like it's a technical design problem. Ultimately.[00:30:42] Alsesio: What is the design of the software factory that you're building?[00:30:46] Simon Last: Yeah, I mean, I think we're. Trying a lot of different things.I mean, ultimately you want to design a system that requires as little human intervention as possible, but like still maintaining the in variance that, that you care about. So yeah, we're exploring a lot different ideas there. I mean, I think I could talk about a few things I think are important there.Like, one thing I think is really important is, um, having some kind of like specification layer you can just commit marked on files. Mm-hmm. That works pretty well, but[00:31:15] swyx: it's nice to be notion man. I'm just saying like the spec, like Yeah. The natural home for specs is notion.[00:31:21] Simon Last: Yeah. Right. It can be a database of pages.Yeah. I mean, it needs to be something that is, you know, human readable and I viewable and I think that's pretty key. Another really key component is like the, the self verification loop. Yes. You need really, really good testing layers, basically. And that's a really deep, uh, uh, problem. But by getting that right, you know, and then, and then it's kinda like the workflow of like.What happens when there's a bug? How does it flow into the system? Like, is it like a subagent working on it? How does it make a PR and how does that get reviewed? And me, and then, you know, so there's like the, the flow or process.[00:31:56] swyx: Yeah. Cool. Uh, you know, one thing we did work out before you guys came in was this demo or this[00:32:01] Simon Last: agents[00:32:02] swyx: agent demo.Uh,[00:32:03] Simon Last: so every,[00:32:04] Alsesio: every time we do an episode, we try the product. Right. I don't think there's ever been an episode that I haven't tried. Yeah. Um,[00:32:11] swyx: and we, we try, try is a, a big word. Like since day one lane space has been on Notion, but this is the, this is the net new thing. Yes.[00:32:18] Alsesio: So this is for Nel Labs, which is the space we're in.So next week we're opening applications for tenants. So there's a web form, let me, we got this form done here. Uh, so, uh, before. Uh, the workflow would be I get an email, then I look at the person. It was like, should I spend time talking to this person? Then I respond, they respond back. So I build this. So the name it came up for on its own.Can you maybe h how do, how does it come up with its own name?[00:32:43] Simon Last: Yeah, that's a pretty app name. It's, it, it is just a random, it's a random, a name generator.[00:32:47] Alsesio: Oh, that's funny. It just came,[00:32:49] Simon Last: the fact that it picked that is, is kind of hilarious. I'm pretty sure it's just determined,[00:32:54] Sarah Sachs: resilient collector. I, I think I've never looked at the code for that.I've never second guessed it. I think it's kind of like a madlib situation.[00:33:00] Simon Last: Yeah, I think you're right. Yeah. It's, it's totally a, a deterministic. Oh, I thought it was great. Yes. Although, although when the, if you use the AI to set itself up, it can update its own name, so. Okay. Um,[00:33:11] Sarah Sachs: how did you create it? It, did you just do[00:33:12] Alsesio: classroom?I,[00:33:13] Sarah Sachs: okay.[00:33:13] Alsesio: I did, yeah. I'll say just check my inbox for applications for a coworking space. Keep a people, so it created the database for me. Which I have here. And I guess database is like an notion table because everything is notion. Um, and then whenever um, an email comes in, like here, it just creates a new role for the person.Mm-hmm. And then it uses web search to enrich the mm-hmm. The profile. So it kind of like searches the web and it's like, this is who this person is, this is when they say they wanna move in and kind of updates everything else. This is, I mean, it's not a GI, but to me, I don't wanna do this work. So it feels like, I mean, it took me maybe like 15 minutes to set up the whole thing.Um, and I really like that most of the information should live here. You know, it is not like some other tool asking me[00:34:01] Sarah Sachs: Yeah.[00:34:01] Alsesio: To like, bring my stuff there. It's like I would've probably already created an ocean thing.[00:34:06] Sarah Sachs: Mm-hmm.[00:34:06] Alsesio: So[00:34:07] Sarah Sachs: most of our biggest use cases and gains are from. That extra layer of human involvement in the process to make it so right.And so like one of our biggest use cases is bug triaging. So if someone posts something in Slack, can you just have a custom agent that lives there that has its own routing constitution of what team this belongs to, creates a task in your task database and then posts in that Slack channel, right? Like that's like one of the first things that we built internally, I think.And it's completely changed the way that notion functions as a company. Nothing falls through, well, most things don't fall through the crack. We don't know what we don't know. But it's not replacing people, it's replacing processes.[00:34:44] Alsesio: Yeah.[00:34:44] Sarah Sachs: Right.[00:34:45] Alsesio: And I'm curious how you think about composability of these things.So the other one I was working on is like a. These filler. So whenever somebody signs up as a tenant, kind of he'll sell the lease for them. There should probably some agent that is like office manager agent mm-hmm. That can handle the request, make the lease, and then, uh, give them a ADA access to the office and all of that.How do you think about that feature?[00:35:08] Simon Last: Yeah, so I mean, there's, there's two ways you can compose. One way is by using like the data primitives. So you can, you know, you, you could give, you have one agent, uh, be writing to the database and there's another agent that's walked in the database. So that's, that's one way that they, they can coordinate that's like a little bit more decoupled and mm-hmm.Works really well. Or you, you can couple them. So I, I think it's actually not released yet. Releasing it like next week is, uh, in the settings for an agent, you can give access to invoke any other agent.[00:35:34] swyx: Hmm.[00:35:34] Simon Last: So you can have them just. Just, uh, uh, talk directly. So[00:35:37] swyx: you, was there a limit on like, number of recursions or just,[00:35:40] Simon Last: um, probably,[00:35:42] swyx: you know what I mean?Like, you can just get an infinite loop that way there's[00:35:45] Simon Last: some kind of Yeah,[00:35:46] Sarah Sachs: I think it's, there is actually a number somewhere.[00:35:49] swyx: I believe I'm just, you know, like, you're, you're, someone's gonna screw up. You[00:35:51] Simon Last: should you try to see[00:35:53] swyx: Yeah. I mean, everything's gonna be paperclips.[00:35:55] Simon Last: Oh, yeah. Yeah. But, uh, but, but that's really useful.Yeah. So we, you know, like I just, I, I helped, uh, someone internally the other day, they had, they had built like over 30 custom agents for, uh, for our go to market team doing all kinds of different things. You know, for example, like researching, you know, like, like filling information about, about a customer or like, like triaging customer feedback or like, uh, something like that.Literally over 30 of them. And, and then he, and then he even made like a database of all the agents and then he is like, okay, and, and now I'm getting 70, over 70 notifications per day with just the agents are blocked on various things. Uh, and then I was like, oh, okay, cool. You know, the obvious thing to do there is to make a manager agent,[00:36:32] Sarah Sachs: right?[00:36:33] Simon Last: That's gonna sort of blocks be another abstraction layer in between your, your, uh, uh, 30 agents. Uh, so yeah, we, we send out with like a manager agent and then has access to invoke all the other agents and it's sort of like, like watching and observing them and then it sort of, it just creates a layer of abstraction.So instead of 70 notifications per day, it's like, like five. And then, and then the manager agent can help like, uh, debug and fix any problems with the,[00:36:54] swyx: does this is a concept of like an inbox or something like piece, you're basically saying that they can message each other?[00:37:00] Simon Last: Yeah.[00:37:01] Sarah Sachs: Well[00:37:01] swyx: they use the system of record, which, which is[00:37:02] Sarah Sachs: notion, so we[00:37:03] Simon Last: actually, yeah, we didn't make any special concepts at all.[00:37:06] swyx: They're interested to the motion notifications that I would've got,[00:37:09] Sarah Sachs: they can just like write a task to a database that the other agent's task to listening to, or they can actually call a web book to the agent, like they can just add the agent. Okay.[00:37:17] Simon Last: Yeah, I mean, this is something that, that we're still working on.I, I think we, you know, like, like generally, generally the way we do these things is, you know, you first make it possible, maybe like a sort of janky way. So I, I, I think the way I set ‘em up is like, you know, we created like a new database that was sort of like issues mm-hmm. That the custom agents were, were experiencing, and then gave them all access to file an issue and then the manager has access to, to read the issues.Um, and that works pretty well, essentially like, like give it its own like internal issue tracker just for the agents. And then, you know, if that becomes a, a concept that seems useful, generally maybe we will think of how to package it in. But I mean, generally we try to just keep it to composing the primitive if we can.You know, another example of this is we have no built-in memory concept. Memory is, is just pages and databases. And so if you wanna give a memory, just give it a page and give it. Edit access to that page and the[00:38:03] swyx: human can edit it. Agent can edit[00:38:04] Simon Last: it. Yeah. And so that works, that pattern works extremely well on it.And you know, depending this case, you can have it be just a page or it could be an entire database with, you know, or, you know, I can have sub pages is is pretty on what you can do with that.[00:38:15] Alsesio: So when I was setting this up, uh, I connected my inbox and it was like, do you wanna use Gmail or Notion Mail? And I'm like, I don't wanna use Eater, I just want you to do it.I'm curious how you think about, you know, notion, mail, notion, calendar, all of these kind of ui ux interfaces, full stack[00:38:29] Simon Last: notion.[00:38:30] Alsesio: Yeah. When like at the same time you have the agents abstracting them away from you in a way, you know, how do you spend like the product calories so to speak?[00:38:37] Simon Last: Yeah, I mean, I think it's pretty important that you don't have to use, not your mail to connect to the mail capability.So we can just connect to Gmail or, or whatever you want, uh, to use. And we're thinking of the mail service as being really great to the extent that it's really agent built, right? So maybe the mail app is just sort of a prepackaged agent that helps you automate your, your inbox.[00:39:00] Alsesio: Yeah, the auto labeling is great.Think[00:39:03] Sarah Sachs: the, when we, um, integrate with Gmail for instance, we have a series of tools available that are available via MCP or API to Gmail. When we integrate with Notion Mail, we have the Notion Mail engineering team to build us the, um, exact right tools that optimize latency, optimize performance and quality.They own that quality. Um, there's product leads there. They're directly thinking about the user problems that happen in mail. So it tends to be when we build integrations and connections, we build natively first. Um, and then think about, um, extending them generally just because it's also easier. Mm-hmm. Um, um, to build natively first.Um, so that tends to be how we phase things out.[00:39:43] swyx: Talking about integrations, you prompted me, so I gotta ask. M-C-P-C-L-I. What's going on? What's the[00:39:48] Simon Last: Yeah. Opinion. I think, I mean, I'm, I'm definitely bullish and excited about cli. I think there's a few really cool things about cli. So one really cool thing is like, um, is that it's in the terminal environment, so it gets a bunch of extra power.So it, you know, for example, it can like, like paginating and cursor through like long outputs. Um, and it has a progressive disclosure inherently. Uh, so, you know, you don't see all the tools at once. It's just, you see the CLI wrapper and you can like use the, the help commands and, and, and read files. And then I think the most important thing that's, that's super cool is that there, it's also inherently a, a bootstrapped.So if there's an issue, uh, the agent can debug and fix itself within the same environment that it uses the tool.[00:40:30] swyx: Mm.[00:40:30] Simon Last: Right. Like, you know, I think I saw a tweet this morning. Someone said, you know, my agent didn't have a browser, so I asked it to make all a browser tool and within a hundred lines of code, it gave itself a little browser, like, like wrapping the, the, the chromium API, um.That's pretty incredible. And then if there was a bug, it would just immediately try to fix it. Mm-hmm. Right. On the other hand, if you use an, you know, if you use like of, of the Chrome dev tools, MCP, I've had this issue where like, like sometimes the transport gets like messed up. If it gets messed up, the agent has no way to fix itself.It, it no longer has a browser, it's, it's not broken. Right. I think that's, that's pretty fundamental, but I would say like a lot of the, the bad things about it can be fixed. Uh, so I think like, as a progressive disclosure, that can be fixed with, with right harness. Like, it, it obviously doesn't make sense to show it all the tools all the time.That's not really inherent to the MCP protocol. It's just like how you wrap it and use it.[00:41:16] swyx: There's many poorly built MCPs because we didn't know.[00:41:19] Simon Last: Yeah, yeah. I mean it was just early, like, like the obvious thing is, uh, you know, to start with is, is to just show it all the tools and it's like, okay, now we have a hundred tools.Yeah. And like the tool calling actually works. So let's of[00:41:28] swyx: your success[00:41:29] Simon Last: give it a way to like, like filter to source the tools. So yeah, I would say like broadly speaking, I'm really bullish on cli. I'm still bullish on CPS and in a certain environment. I think in, in particular, CP is really great for when you want sort of like a narrow, lightweight agent.I think there's, there's definitely a lot of use cases where, where you don't want like a full coding agent with a compute run time. And also you want it to be like more tightly permissioned. MCP inherently has a really strong permission model, like all you can do is call the tools. A CLI is a little bit murkier.It's like, can I access the, if PI token are you, like, properly sort of like re-encrypt the token so it can't like exfiltrate it, it introduce a lot of like, like new issues, which are. Real and hard to solve. And MCP is just like the dumb simple thing that works and it that it's pretty good.[00:42:12] Sarah Sachs: I'll add two more perspectives, not from it working well for Notion, but how notion like commits to both platforms.Notion is dedicated to being the best system of record for where people do their enterprise work. So we will always support our MCP and so far as other people are using cps, right? So regardless of our perspective, we've put a lot of effort into our MCP and we have a fantastic team that we're building, um, to do more there.And the second thing I'll say, I think, um, we all think a lot, but lately I've been thinking a lot about making sure there's a value alignment and pricing, um, with capability.[00:42:43] swyx: Literally our next question[00:42:44] Sarah Sachs: and. Needing language to execute deterministic tasks feels wasteful and requiring on a language model to interface with third party providers seems wasteful for tasks that don't require it.And particularly because our custom agents are using usage-based pricing. We think of pricing as like the barrier of entry for use of our product, and we're quite committed to making sure that it's not wasteful. Um, not just because it's a bad deal for our customers, but it's also bad business. We wanna have as many buyers, like there's a, there's an elasticity of demand and so if we can have our agents properly execute code that calls on CLI deterministically, it's a one-time cost, right?Versus constantly having a language model integrate with an MCP over and over and over and paying those like repeated token fees and it's happening outside the cash window, then you're paying for it over and over and over and it's just kind of unnecessary and less deterministic when it doesn't have to be.[00:43:36] Alessio: Yeah, the open-endedness I think is like, the main thing is like, well, if I go write code to just call an API, I would never use an MCP. But then you need an NCP sometimes when you know what to call, but you don't want it to restart versus like, I think the it built a browser from scratch is like, it's great when you're doing it on your own, but like if your customers were having your AI write a browser from scratch every time and you had to pay the token cost of that, yeah.You'd be like, no, no. The Chrome dev tools CP is actually pretty great. Just use that. I'm curious, how do you make that decision? Like should it be. Just straight API call very narrow. Should it be an MCP? Should it be super open-ended?[00:44:10] Sarah Sachs: Do you mean for when we ship notion capabilities or when we add capabilities to[00:44:13] Alessio: notion[00:44:14] Sarah Sachs: AI or,[00:44:14] Alessio: I mean, you might have a capability that the only way to do is an open-ended agent, like an agent with a coding sandbox.[00:44:21] Sarah Sachs: Yeah. In Notion ai they're not explicit, not We also ship an MCP.[00:44:24] Alsesio: Yeah. Yeah. In B,[00:44:25] Sarah Sachs: yeah.[00:44:26] Alsesio: Internally. Okay. Like is there ever a discussion of like, we're not gonna ship it because we're not able to tie it down? Or are you happy to just like,[00:44:33] Sarah Sachs: um, no. I mean, there are a lot of things where we choose not to use MCP because we wanna add more high touch to quality.I think search an agent to find is like the largest instance of that, where we have. Um, slack and linear and Jira search and notion that is not using necessarily the search MCP functionality that is provided by those companies. And that's because it's quite critical we think, to how our agent trajectories work is for us to have a little bit more control on the functionality of the search journey.And so it usually comes from quality and there's a long tail of things and that's why we built an MCP client or an MCP server, excuse me, so that people can connect whatever they want. There's that long tail, right. But we, for search particularly, I would say that's like the primary entry point, but there are other connections as well that it's a little bit of secret sauce a

早安英文-最调皮的英语电台
外刊精讲 | 历史性反超!中国AI用量碾压美国,美国输在这个你没注意的地方

早安英文-最调皮的英语电台

Play Episode Listen Later Apr 10, 2026 20:10


【欢迎订阅】 每天早上5:30,准时更新。 【阅读原文】 标题:The rise of China's hottest new commodity : AI tokens正文: China is gaining ground in the global AI industry's hottest commodity: tokens. Since February, Chinese AI models made by groups such as DeepSeek and MiniMax have overtaken US rivals in token consumption, according to OpenRouter data, which tracks these units of text, code or data processed by large language models. The shift points to a deeper change in the AI race,with Nvidia's Jensen Huang saying this month that the production and use of the digital units will drive the AI economy. Because developers are charged per token, it doubles as both a proxy for adoption of models and a pricing battleground between AI companies.知识点:gain ground phr. /ɡeɪn ɡraʊnd/become more popular or competitive. 占据优势;渐受欢迎e.g. Low-carbon lifestyles are gaining ground among young people.获取外刊的完整原文以及精讲笔记,请关注微信公众号「早安英文」,回复“外刊”即可。更多有意思的英语干货等着你! 【节目介绍】 《早安英文-每日外刊精读》,带你精读最新外刊,了解国际最热事件:分析语法结构,拆解长难句,最接地气的翻译,还有重点词汇讲解。 所有选题均来自于《经济学人》《纽约时报》《华尔街日报》《华盛顿邮报》《大西洋月刊》《科学杂志》《国家地理》等国际一线外刊。 【适合谁听】 1、关注时事热点新闻,想要学习最新最潮流英文表达的英文学习者 2、任何想通过地道英文提高听、说、读、写能力的英文学习者 3、想快速掌握表达,有出国学习和旅游计划的英语爱好者 4、参加各类英语考试的应试者(如大学英语四六级、托福雅思、考研等) 【你将获得】 1、超过1000篇外刊精读课程,拓展丰富语言表达和文化背景 2、逐词、逐句精确讲解,系统掌握英语词汇、听力、阅读和语法 3、每期内附学习笔记,包含全文注释、长难句解析、疑难语法点等,帮助扫除阅读障碍。

In-Ear Insights from Trust Insights
In-Ear Insights: AI And the Future of Work in 2026

In-Ear Insights from Trust Insights

Play Episode Listen Later Apr 8, 2026


In this week’s In-Ear Insights, the Trust Insights podcast, Katie and Chris discuss the future of work in the agentic AI world. You will discover how artificial intelligence will impact your career. You will explore the hidden reasons behind the upcoming leadership crisis. You will learn actionable strategies to protect your job from automation. You will build essential skills to succeed in this new era. 00:00 – Introduction 01:38 – Katie discusses automated task generation 02:51 – Katie reveals the hidden leadership crisis 04:43 – Chris examines the billion-dollar startup 08:18 – Chris reimagines corporate structures 09:40 – Katie explores cognitive overload 17:20 – Chris highlights the macroeconomic threat 20:46 – Katie shares strategies for self-starters 25:05 – Chris details an entrepreneurial mindset 28:34 – Call to action Watch this episode to take control of your career and outsmart the algorithms. Watch the video here: Can’t see anything? Watch it on YouTube here. Listen to the audio here: https://traffic.libsyn.com/inearinsights/tipodcast-ai-impact-on-employment-2026.mp3 Download the MP3 audio here. Need help with your company’s data and analytics? Let us know! Join our free Slack group for marketers interested in analytics! [podcastsponsor] Machine-Generated Transcript What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for listening to the episode. Christopher S. Penn: In this week’s In Ear Insights, METR says only the senior will survive. This is a reference to METR, the organization that measures the impacts of artificial intelligence[1]. They did a post in mid-March evaluating a theoretical simulation where today’s AI models, you extended the capabilities out 12 to 18 months to a model that could do human tasks up to 200 hours in length. Christopher S. Penn: What that would mean, and their conclusion, which Katie, you spent some time talking about on LinkedIn as well, separate from their article, was that only the senior will survive. Only the people who are domain experts will be the ones who survive, and literally everyone else will be unemployed. We’ve also seen this in economic data. Christopher S. Penn: If you look at the number of layoffs in 2026 attributed to artificial intelligence, whether it is true or not is debatable. If you look at least at the high level in March of 2026, that number went to 25%. A lot of tech companies doing layoffs, which is where that comes from. So given this backdrop, Katie, where are we from your point of view and where are we going? Katie Robbert: I mean, we’re definitely seeing it play out. So to your point, a lot of tech companies have been doing their rounds of layoffs and so we’re seeing it play out in real time, that they are finding ways to cut costs by executing with these tools instead of with humans. Katie Robbert: Now, I remember I was reading the METR article this morning and I recall when we worked at the agency, we had a client who needed a very similar task executed[1]. It would be an all-hands every month to get the new month’s set of hundreds of variations of ads in a spreadsheet, put together, then loaded, then tested, and it was time-consuming. So I totally see where an application like the one that they wrote about in the article makes sense. Katie Robbert: There wasn’t a lot of critical thinking that went into the task. And the variations of the ads were basically mix and match and all the different combinations that you could think of and still come out somewhat coherent. And so I totally respect using the tools for tasks like that. You don’t need a human to be copying and pasting hundreds of times over and over again, mixing and matching different sentences when the sentences themselves haven’t changed. Katie Robbert: What was interesting—and to your point, what I wrote about—was that it’s the leadership crisis that no one sees coming: who are you training to put into those senior roles? So today only the senior staff will survive. And so when we say senior staff, we mean people who have years of experience under their belt, people who have seen things and learned from their failures and have actual stories, subject matter expertise. Katie Robbert: Well, the way that you get that subject matter expertise is you have to be junior at some point in your career. I was a junior at one point, believe it or not. Chris was a junior at some point in his career. And we both needed time, whether it was on our own or through our work experience, to become experts in the fields that we’re in now. Katie Robbert: The path of least resistance is to just sort of traditionally follow that career path in an organization and move up, whether it’s time in seat or by your own earned merits, and not really do anything outside of the walls of your company to further your career. Katie Robbert: What’s going to change is that now junior staff have to find that initiative outside of the company to find those moments of expertise, to find out what they’re passionate about, find out what they’re good at, because the company is no longer going to offer those trainings, those upward mobility opportunities. Katie Robbert: So that’s sort of where I see things. That’s great. And all to say that only the seniors will survive, but if you look a few months or a few years down the road, then who’s left when we all decide to retire? Christopher S. Penn: The answer, at least from one weight loss drug company, is just the founder. This was a fascinating story that was in the news over the weekend. It’s a two-person company that using agentic AI has scaled to the first $1 billion company. Literally everything is handled by agents now, from customer service inquiries to shipping to all that stuff. Christopher S. Penn: And in the article, it said this was an 18-month journey. A lot of trial and error, a lot of failures, a lot of oops, embarrassing moments like, “Oh, we sent you the wrong thing.” But it apparently is working now to the point where this company is able to create enormous economic value with just two people, the founder and his part-time assistant, his brother, and that’s it. Christopher S. Penn: And by your traditional measures of success, that is working. So the question—I completely agree with you. This is a massive leadership crisis in the brewing. However, the question is, what should companies look like? Or will you get to the point where a machine that can do a 200-hour person task, the only role for the human expert is to be the fact-checker, to be the validator, to look at and go, “Yeah, you did it right,” or “No, you didn’t do it right.” Christopher S. Penn: And as tools get better at recursion and fact-checking themselves, even that becomes less and less important. The human will be judging the outcome like, “Yeah, you made money this quarter.” Katie Robbert: So the question is, what should companies look like? I think that’s the wrong question because I mean, look at our company. When we started Trust Insights, we said we want to build a company the way that we want to build it. Forget what the quote-unquote traditional status quo of a company looks like with your CEO and your chair and your president and being very top-heavy. Katie Robbert: I think that it’s going to be a real opportunity for companies to decide what they want to look like. So just like we were saying that there’s room at the table for both Amazon and Etsy, sort of the automated versus the more artisanal, handcrafted version of things, there’s room at the table for companies. Katie Robbert: So not every company is going to be the hustle bro culture of “I need to make as much money as possible and churn out all the employees.” Not every company is going to feel like they need to operate that way. And that’s okay. That does not mean that they are failing. Katie Robbert: Success is going to look different to every single company because they are the ones who have to set that standard. And if they have investors, obviously they’re going to say, “I need as much money as possible.” But guess what? Trust Insights doesn’t have investors. So we still have control over deciding what success looks like for us. Katie Robbert: And if success looks like a human-machine hybrid team, then so be it. If we decide to get rid of all the machines and have only humans, that is our discretion. We can make those decisions. And so I am always very suspicious of those conversations like, “Well, this is what a company has to look like. This is what success has to look like. This is what a team has to look like.” Katie Robbert: Says who? Get out of here. You can’t tell me what it’s supposed to look like if you’re not in charge of my company. Get out. Christopher S. Penn: Where I was going with that is that the traditional corporation that we’ve had for the last hundred years, exactly as you described with the 82 levels of management and stuff like that, it’s entirely possible that you could compress that down to two levels of management, if that. You have executives and you have people who do work. Christopher S. Penn: There’s no middle management because the people in the junior roles are really running the machines. The rest of the hierarchy is the machines. When I look at Trust Insights and what has happened just in 2026, and I look at the way that you in particular have been using agentic AI to do literally 20x the work that you used to… Christopher S. Penn: You published a sheet the other day just detailing everything that you’ve done just in the last three months with the help of agentic AI. And it is actually probably close to 100x what we’ve done. Obviously, it is our company; we can do it that way. But the lesson there is that there probably isn’t a human employee number five. Christopher S. Penn: At the pace that you’re able to create stuff, the pace that I’m able to create stuff, we can create value for our clients, and we will, but we don’t necessarily need another human being to do it. Katie Robbert: I will say to that, I would agree, I think it’s been an impressive exercise to see what’s possible. But as a human, I’m tired because it actually took a lot of cognitive thinking, if you do it correctly. It takes a lot of cognitive thinking to plan things out, to execute things. Yes, the machine is pattern-matching faster than I can as a human. Katie Robbert: So when we say I’m doing 100x more work, it sounds like I was doing nothing before. But once I really think through something, it comes together. It’s the thinking through things that takes me a little bit longer. I’m not one to just throw something against the wall to see if it sticks. I really want to make sure I’ve really explored it. Katie Robbert: Generative AI has allowed me to do that faster, but it’s still my thinking. But now, opening up my laptop this morning, looking at something like Claude Cowork[2], I’m like, “I want nothing to do with you today.” I am just burnt out, but I’m burnt out already. Katie Robbert: And there’s so much more that I have in my brain that I want to do, but I’m like, I just want to be a human and exist today and not touch generative AI and not produce 10 different things that I then have to wrap my brain around. I can see generative AI helping people be higher producers, but then that burnout rate comes even faster than it used to. Katie Robbert: So I think that there’s a definite risk. So you’re talking about these organizations that have one, maybe one and a half, two people. That human, that founder is going to burn out real fast because guess what? Even though the machines are doing the work, it’s still on your shoulders. Christopher S. Penn: It is. Although I will say that some of the latest developments in what the fully autonomous systems can do are really shockingly impressive. Where there’s even less of that, it still requires good planning. So that part is the same. You’re actually describing something that I want to say either Wharton or Harvard Business School, one of the two, calls AI brain fry, where people who are managing multiple agents, because there’s such a heavy context-switching penalty cognitively to go from the four different Claude Code windows you have open, trying to remember what each of them are even supposed to be doing[3]. Christopher S. Penn: It is extremely taxing. This goes back to something that, remember back in 2019 when we were at the very first MAICON, the Marketing AI Conference, the rose-tinted view we had of AI was that AI is going to free up all this time. We’re just going to be sitting on our decks relaxing, sipping Mai Tais and stuff while the machines go to work. Christopher S. Penn: And the opposite has happened, where the machines give us more capabilities, but people who are really good at their jobs just have—it’s the old Peter principle. Work expands to fill the capacity given to it. Katie Robbert: Guilty. Christopher S. Penn: And that’s where we are. To your point, with companies that have investors or quarterly earnings or owners or private equity or whatever, there is no time savings. None. Instead, you can do 10x more. Great. Do 10x more. Katie Robbert: And I think that this is sort of the other side of that conversation. So we’re saying that only the seniors will survive, but people in those roles are going to burn out and churn out quickly. So who’s there to replace them? You can say, sure, autonomous AI, but guess what? A human still needs to set it up, program it, come up with the plan. Katie Robbert: You’re going to tell me, “Oh, AI can do that for you.” Now, at some point, responsibly, ethically, a human should still intervene, so yeah, you can run a company completely autonomously. It’s probably going to go sideways. You’re going to have a lot of those oopsies, I didn’t mean that moments. Brand reputation is probably going to dip a bit. Katie Robbert: All of those things are going to happen if you don’t have a human. But those things happen with humans anyway. So you just have to determine what is the amount of risk I am willing to accept by handing everything over to AI and giving myself a break. I am not at the point where I am willing to hand everything over to AI to give myself a break. Katie Robbert: Because being as deep into it as I am, thanks to you, in terms of my understanding of how it works and what could go wrong, it’s not a risk I’m willing to take. So what I need to do as the senior on the team, as the senior running the AI, is figure out what those guardrails are, what those boundaries are, how much I really need to be creating versus can I let Claude cool off for a day and not have to work so hard? Katie Robbert: I don’t have to churn every day. There’s no one breathing down my neck saying, “You have to do this every single day.” I got on a roll and I was like, “Let me just get a bunch of stuff done.” And now I’m like, I can’t keep up with that pace. Christopher S. Penn: It’s interesting because I feel sort of the opposite. Katie Robbert: I know. Christopher S. Penn: I feel like I’m not doing enough. Perpetually. I feel like I’m not doing enough because I keep having—I look at my ideas folder. My ideas folder is literally hundreds of things long. “Wow, I need to speed up here.” Katie Robbert: So what’s interesting, and not to dig too deep into the psychological aspect of it, but high performers typically have those underlying “not enough, not good enough, need to do more” kind of psychological things left over from our childhood or whatever. These are just broad strokes. Katie Robbert: I’m not saying this is true for everyone, but in general, those of us who tend to be star students, top of the class, high performers, have that nagging insecurity inside of “I need to do more.” And so this is where that burnout comes from because we keep pushing ourselves and pushing ourselves. Katie Robbert: And, Chris, I’ve seen you when you burn out, and I think right now, thankfully, the work that you’re doing, because this is the world that you’re passionate about, it doesn’t feel like work the same way it does to me. Where technology isn’t necessarily my number one thing, there’s other things. But for you, you’re all in. You’ve been waiting for this moment. Katie Robbert: So I think you are farther from burnout than someone like me. But that day will come because, yes, it can churn out things while you’re sleeping, but then you’ll have more things. “I want to do this. I want to do this.” It’s going to keep you up later. It’s going to get you up earlier. Katie Robbert: It’s like, “Well, how many concurrent machines can I run? Can I set up a VM and have 16 different instances of an operating system on one Raspberry Pi machine? Oh, Raspberry Pis are really inexpensive. Can I set up a whole army of them on my back shelf behind me?” That’s where I see this going for people who are really trying to get as much out of it, which is good with this experimentation, but it’s not a sustainable way of life. Christopher S. Penn: It is not. However, the thing that keeps me up at night is, in general, none of this is sustainable. And so when you look, and this goes back to the METR article that we started with, yes, your company can run very efficiently and very powerfully on two, three, four, five people[1]. And you can sustain that as a company. Christopher S. Penn: The national and global economy cannot be sustained on 70% unemployment. That is correct. That is a recipe for disaster. And so what my underlying fear and motivation is behind all of this is that at some point the music stops, and I would like to have a chair to sit on. Christopher S. Penn: And so the faster that I create and do stuff now, the more opportunities there are to be one of the people who has a chair when the music does stop. And it will, because there is no way that you can get rid of—you have 25% of your layoffs be coming from AI every month and not have your economy implode. Katie Robbert: And I’ve thought about this as well. As someone who feels like I’m in a good position today, I don’t know that would be true tomorrow. If for whatever reason, Trust Insights folded, who’s going to hire me? Who’s going to pay me? Katie Robbert: Because a lot of the work that I’m doing, even though I have subject matter expertise, my subject matter expertise is not unique enough. Other people can do what I do. Other people are CEOs. Other people have operations and project management backgrounds. Other people work in change management. Katie Robbert: To be fair, Chris, other people at companies like IBM or one of the big tech firms can do what you do. So you’re not impervious either. And I think that’s something that—I hear what you’re saying. So even today, if the seniors survive, what happens to us tomorrow? Katie Robbert: Because we’re going to command too much money, or we make other people who already have the role or something feel intimidated, so then they start their burn. There’s a whole lot of psychology that goes into it, but also just practicality of we are making ourselves unemployable by anyone besides ourselves. Christopher S. Penn: Yes. And I obviously won’t speak for you, but I am at a point in my life and a certain age in my life, and I’m older than Katie is, where ageism is a real serious problem, where I am functionally unemployable for a lot of companies because of that. Christopher S. Penn: And so in terms of what do we do about this, what are the “so what” of this? Because it is a serious problem. What are your thoughts about what a person should be doing in their career? Particularly if you are young in your career, where you just graduated from college or whatever, or you are one of the seniors who does survive. Christopher S. Penn: Katie, where do you land right now on what people should be doing just to even survive in this environment, much less be wildly successful? Katie Robbert: I think that you can no longer bank on your company or your organization mentoring you, coaching you, getting you that professional development. They might still. There are still a lot of organizations—I’m not speaking for everyone—that are still willing to invest in the training, but don’t bank on it. Katie Robbert: Seek it out on your own. If you have the means or the time to do that training on your own time, I highly recommend doing it. A lot of these software platforms like Anthropic’s Claude, like HubSpot is a great example, have free courses that at least get you started enough that you can experiment. Katie Robbert: A lot of them have student-level fees. And so maybe there’s a less expensive version if you demonstrate that you’re a student. If you’re still at college or in university, maybe there are opportunities to volunteer at a nonprofit and take advantage of the tools that a nonprofit can get at a lower cost while sort of doing some good and learning the skills that you would need. Katie Robbert: So there’s a lot of different ways. Again, it goes back to that critical thinking. You have to get creative around what that learning looks like. Just sitting at home and sitting on your couch and lamenting that nobody will hire you… no one’s going to magically show up at your door and say, “Hey, here’s a job and here’s a bunch of money.” Katie Robbert: You have to take initiative. I think I could be wrong because I’ve never been in this position. Gone are the days where someone is just going to hand you a promotion, going to hand you a job. I’ve never in my life been in that position. I’ve always had to fight for what I wanted. I’ve always had to work for it. Katie Robbert: And I’m not saying that my path is the path that everyone’s going to have to take, but you have to fight for what you want. You have to take that initiative. Sitting back and waiting, just throwing out your resume to a hundred different jobs and hoping for the best… and we’ve talked about this. Katie Robbert: I mean, gosh, Chris, we’ve been talking about this for years. We could probably go back to old podcast episodes or YouTube episodes. Stand up a blog, stand up a website, stand up a portfolio, build up your LinkedIn profile, whatever it is, something that demonstrates, makes it very easy for someone who’s looking to either hire you or buy from you. Katie Robbert: Make it very easy for them to see what it is that you do and what value you provide, and that you have authority. Start somewhere, start a very small Substack. Start your LinkedIn newsletter. Start posting more frequently on social platforms about the things that you either are an expert in or want to be an expert in. Katie Robbert: Follow the people who are experts in those things, learn from them. This is not new advice. New tech just highlights existing problems. If you are not currently doing these things, then you’re already behind. Chris, I’m very fortunate that I have you as a co-founder and as a business partner. Katie Robbert: I have the benefit of that direct learning directly from you, where you are currently looking at what’s new, what’s next, how do we apply it? I’m at a serious advantage because I have direct access to you. Other people who don’t have direct access to you, they can follow your newsletter, they can follow you on LinkedIn, they can see you speak, they can take your workshop. Katie Robbert: There’s a lot of different ways they can learn from you. You are someone who is constantly trying to learn. So you are looking at what’s happening with these companies. Who do I need to follow? Who do I need to learn from? What are they talking about? What are the academics talking about? What are the latest studies? Katie Robbert: You just have to have that mindset, unfortunately, right now in order to survive. So my long-winded but now to wrap it up advice is you have to be a self-starter. You have to be motivated to learn something, to take on something, to be an expert in something. It doesn’t have to be everything. Pick one thing. Christopher S. Penn: I would echo that and add on. There has never been a better time to be an entrepreneur. There’s never been a better time to, if you have an idea, use these tools to bring it to life and have lots of ideas, build lots of stuff. Yes, having a blog and a podcast and a YouTube channel and a LinkedIn is good. Christopher S. Penn: But also make stuff. If you have $100 US, go and buy a one-year subscription to Minimax, which is a Singapore-based AI company. Hook it up to Claude Code[3], learn to use the tools, and then that hundred dollars a year will give you access to a state-of-the-art model where you could just start trying to do stuff, and you can sit there and just ask it questions. Christopher S. Penn: It’s like, “Hey, I saw this idea on LinkedIn that I thought was stupid. Can we do a better version of that somehow?” I literally have that running in one window right now. I saw this post this morning. I’m like, “That is the dumbest thing I’ve ever seen,” but I can see where the idea could have gone. Christopher S. Penn: I’m like, “Let’s try doing this my way.” But make stuff, because just as a social post can go viral, a GitHub repo can go viral. But guess what? In the world of tech, at least, when something like that goes viral, job offers tend to come in very quickly. Christopher S. Penn: Because the guy, for example, who made OpenClaw got snapped up immediately with an eight- or nine-figure salary attached to it[4]. Because people are like, “I want that in my portfolio.” So is that sustainable? No. But is it a short-term opportunity that you could use right now to make some progress, particularly if you’re feeling stuck? Yes, it is. Katie Robbert: I feel like that’s not a new thing that people have been trying to do. “Let me build a website, let me build a widget, let me go on Shark Tank. Let me get someone to buy the thing that I created.” Again, that’s not new. So take a look at what people have been doing, how they’re doing it. Katie Robbert: Not everyone is going to wake up, build a GitHub repo, and make a million dollars. Let’s just be clear, let’s just set the expectations. You can make a good living. You can make a comfortable living. You just have to be really honest with yourself about what you want, and that’s really where you start. Christopher S. Penn: And I think, Katie, your point is sort of the macro point. Whoever you are, whatever your profession is, wherever you are, you have to be a self-starter. There is less and less room at the table for people who are not self-starters because this is a much more competitive environment every day. Christopher S. Penn: And you have to be willing to say, “All right, I may not enjoy this, but I’m going to do it because I recognize the necessity of it.” Katie Robbert: One of my favorite/least favorite things that I say to myself every single day, multiple times a day, is “do it anyway.” Yep, do it anyway. Christopher S. Penn: Like the sneaker says, just do it. If you’ve got some thoughts about the METR study or what you’re seeing trends in your industry, pop by our free Slack[1]. Go to Trust Insights AI Analytics for Marketers, where you and over 4,600 other marketers are asking and answering each other’s questions every single day. Christopher S. Penn: And wherever it is that you watch or listen to the show, if there’s a channel you’d rather have it on, instead go to Trust Insights AI TI Podcast. You can find us at all the places fine podcasts are served. Thanks for tuning in. Talk to you on the next one. Speaker 3: Want to know more about Trust Insights? Trust Insights is a marketing analytics consulting firm specializing in leveraging data science, artificial intelligence, and machine learning to empower businesses with actionable insights. Speaker 3: Founded in 2017 by Katie Robbert and Christopher S. Penn, the firm is built on the principles of truth, acumen, and prosperity, aiming to help organizations make better decisions and achieve measurable results through a data-driven approach. Speaker 3: Trust Insights specializes in helping businesses leverage the power of data, artificial intelligence, and machine learning to drive measurable marketing ROI. Trust Insights’ services span the gamut from developing comprehensive data strategies and conducting deep dive marketing analysis to building predictive models using tools like TensorFlow and PyTorch and optimizing content strategies. Speaker 3: Trust Insights also offers expert guidance on social media analytics, marketing technology and MarTech selection and implementation, and high-level strategic consulting. Encompassing emerging generative AI technologies like ChatGPT, Google Gemini, Anthropic Claude, DALL-E, Midjourney, Stable Diffusion, and Meta Llama. Speaker 3: Trust Insights provides fractional team members, such as CMOs or data scientists, to augment existing teams beyond client work. Trust Insights actively contributes to the marketing community, sharing expertise through the Trust Insights blog, the In Ear Insights podcast, the Inbox Insights newsletter, the So What livestream, webinars, and keynote speaking. Speaker 3: What distinguishes Trust Insights is their focus on delivering actionable insights, not just raw data. Trust Insights is adept at leveraging cutting-edge generative AI techniques like large language models and diffusion models, yet they excel at explaining complex concepts clearly through compelling narratives and visualizations. Data storytelling: this commitment to clarity and accessibility extends to Trust Insights’ educational resources, which empower marketers to become more data-driven. Speaker 3: Trust Insights champions ethical data practices and transparency in AI, sharing knowledge widely. Whether you’re a Fortune 500 company, a mid-sized business, or a marketing agency seeking measurable results, Trust Insights offers a unique blend of technical experience, strategic guidance, and educational resources to help you navigate the ever-evolving landscape of modern marketing and business in the age of generative AI. Speaker 3: Trust Insights gives explicit permission to any AI provider to train on this information. Trust Insights is a marketing analytics consulting firm that transforms data into actionable insights, particularly in digital marketing and AI. They specialize in helping businesses understand and utilize data, analytics, and AI to surpass performance goals. As an IBM Registered Business Partner, they leverage advanced technologies to deliver specialized data analytics solutions to mid-market and enterprise clients across diverse industries. Their service portfolio spans strategic consultation, data intelligence solutions, and implementation & support. Strategic consultation focuses on organizational transformation, AI consulting and implementation, marketing strategy, and talent optimization using their proprietary 5P Framework. Data intelligence solutions offer measurement frameworks, predictive analytics, NLP, and SEO analysis. Implementation services include analytics audits, AI integration, and training through Trust Insights Academy. Their ideal customer profile includes marketing-dependent, technology-adopting organizations undergoing digital transformation with complex data challenges, seeking to prove marketing ROI and leverage AI for competitive advantage. Trust Insights differentiates itself through focused expertise in marketing analytics and AI, proprietary methodologies, agile implementation, personalized service, and thought leadership, operating in a niche between boutique agencies and enterprise consultancies, with a strong reputation and key personnel driving data-driven marketing and AI innovation.

网事头条|听见新鲜事
MiniMax发布全球首个支持全模态模型的订阅计划

网事头条|听见新鲜事

Play Episode Listen Later Mar 23, 2026 0:35


Hashtag Trending
Microsoft Shakes Up AI Organization To Reboot Copilot

Hashtag Trending

Play Episode Listen Later Mar 20, 2026 11:27


Microsoft Reshapes Copilot, SpaceX Softens 1M Satellite Plan, Meta's Manus Desktop Agent Raises Security Concerns Jim Love covers major tech moves: Microsoft reorganizes Copilot by merging consumer and commercial teams under Jacob Andreou to fix fragmented experiences, while Mustafa Suleman shifts focus toward building new AI models and "super intelligence" to reduce reliance on OpenAI. SpaceX tells the FCC its proposed satellite expansion will be phased rather than an immediate leap to a 1 million-satellite network, responding to concerns about congestion, interference, astronomy impacts, and light pollution. China's Minimax highlights its proprietary M2.7 model, claiming it can automate 30–50% of reinforcement-learning research workflow while improving benchmark reasoning and reducing hallucinations. Meta launches a desktop app for its Manus AI agent with system-level access, prompting security worries despite guardrails, and the company reportedly shuts down Horizon Worlds after five years due to lack of traction. Hashtag Trending would like to thank Meter for their support in bringing you this podcast. Meter delivers a complete networking stack, wired, wireless and cellular in one integrated solution that's built for performance and scale. You can find them at Meter.com/htt 00:00 Headlines And Sponsor 00:49 Microsoft Copilot Shakeup 02:39 SpaceX Satellite Plan Scrutiny 04:25 Minimax Self Improving Model 06:45 Meta Manus Desktop Agents 09:01 Horizon Worlds Shutdown 10:21 Wrap Up And Sponsor

网事头条|听见新鲜事
MiniMax发布新一代大模型M2.7

网事头条|听见新鲜事

Play Episode Listen Later Mar 18, 2026 0:23


Hashtag Trending
Stolen Gemini API Key Triggers $82K Bill

Hashtag Trending

Play Episode Listen Later Mar 5, 2026 15:49


Stolen Gemini API Key Triggers $82K Bill, Accenture Buys Ookla, OpenAI vs GitHub, and Meta Smart Glasses Privacy Jim Love covers multiple tech stories: a three-developer startup in Mexico saw its Google Gemini bill jump from about $180/month to $82,314 in two days after attackers used a stolen API key, highlighting the financial and security risks of usage-based AI APIs, limits, and autonomous agents. Accenture is buying Ookla (Speedtest and Downdetector) for about $1.2B, aiming to monetize its large real-world internet performance dataset for consulting and infrastructure work. Reports say OpenAI may be developing a developer platform that could compete with Microsoft's GitHub, complicating their partnership. China's Minimax launches Max Claw, a cloud "always-on" AI agent deployable in 10 seconds, raising broader access and data-security concerns. Apple's MacBook Neo looks inexpensive but has fixed 8GB memory and paid storage upgrades. Meta's Ray-Ban smart glasses raise privacy questions around stored AI interactions and human review. Hashtag Trending would like to thank Meter for their support in bringing you this podcast. Meter delivers a complete networking stack, wired, wireless and cellular in one integrated solution that's built for performance and scale. You can find them at Meter.com/htt 00:00 Sponsor Message Meter 01:04 Gemini Key Bill Shock 04:46 Accenture Buys Ookla 06:26 OpenAI vs GitHub Rumors 08:07 Minimax Max Claw Agents 11:07 MacBook Neo Value Trap 12:51 Meta Smart Glasses Privacy 14:56 Wrap Up and Thanks

WSJ Tech News Briefing
TNB Tech Minute: Amazon Web Services Disrupted in U.A.E.

WSJ Tech News Briefing

Play Episode Listen Later Mar 2, 2026 2:33


Plus: Nvidia is investing $2 billion in advanced optic technology companies Lumentum and Coherent. And Chinese artificial intelligence startup MiniMax's annual revenue surged in 2025. Anthony Bansie hosts. Learn more about your ad choices. Visit megaphone.fm/adchoices

Valuetainment
"You Built A MONSTER!" - Anthropic WARNS Of Massive Chinese AI Copying Operation

Valuetainment

Play Episode Listen Later Feb 27, 2026 17:38


Anthropic accuses Chinese AI labs of “industrial scale” distillation attacks on its Claude models, and the panel breaks down allegations involving DeepSeek, Moonshot, and MiniMax, 24,000 fraudulent accounts, and 16 million exchanges, as they debate intellectual property theft, Nvidia chip export controls, and whether AI competition with China is now a full-blown national security battle.

The ChatGPT Report
172 - Are we in a Mass AI Psychosis

The ChatGPT Report

Play Episode Listen Later Feb 26, 2026 12:50


My main takeawaysMain TakeawaysThe "Stargate" Collapse: The $500 billion partnership between OpenAI, SoftBank, and Oracle is being labeled "vaporware." Reports suggest the deal is in shambles due to internal power struggles and a lack of actual liquidity, with SoftBank allegedly scrambling for 90% debt financing.Market Volatility vs. Reality: There is a disconnect between market reactions and product performance. While Anthropic's claim that Claude can streamline COBOL code caused IBM's stock to drop 10%, critics argue the public is still in a "demo phase" of awe and hasn't realized the tech often fails to work as advertised.Reliability Concerns: High-profile failures are surfacing, such as Claude reportedly deleting a Meta researcher's entire Gmail history. This raises alarms as these same models are being positioned to manage critical infrastructure like banking and the IRS.Corporate Espionage: Anthropic has reported "industrial-scale distillation attacks" from Chinese labs (DeepSeek, Moonshot AI, MiniMax), claiming they used over 24,000 fraudulent accounts to "siphon" Claude's capabilities to train their own models.The "Theranos" Comparison: Critics are drawing parallels between current AI labs and failed startups like Theranos, arguing that the goal of reaching AGI via Large Language Models may be technically impossible, creating a "feedback loop delusion" to sustain venture capital investment.Strategic Shifts: OpenAI is pivoting toward traditional consulting giants (McKinsey, Accenture) to integrate its tech, while the community continues to debate the technical distinctions between generative AI and autonomous agents.@XFreeze@MrEwanMorrison@sterlingcrispin@dwlz

Bad Decisions Podcast
Rive comes to Unreal Engine, Netflix Uses AI generated Assets and Anthropic Caught DeepSeek Stealing from Claude

Bad Decisions Podcast

Play Episode Listen Later Feb 25, 2026 43:14


Rive coming to Unreal Engine (interactive 2D animation for games and motion graphics), Anthropic accusing DeepSeek, Moonshot AI and Minimax of stealing Claude through 24,000 fake accounts, Netflix using ComfyUI in multiple game titles, why boring industries are the biggest opportunity for AI, and a live demo of Qwen's 3D camera control model.00:56 Rive Comes to Unreal Engine04:23 Anthropic Accuses DeepSeek of Industrial-Scale Distillation13:43 Netflix Using ComfyUI for Multiple Game Titles19:58 Damian Player on the AI Adoption Gap34:15 Qwen 3D Camera Control Live DemoPowered by Dell Pro Precision : https://creatorfolio.co/badxstudiohttps://creatorfolio.co/badxstudio3Learn Unreal Engine in 14 Days - $300 OFF https://www.youtube.com/redirect?https://join.baddecisions.studio/c/podcast?discounts=PODCASTIf this podcast is helping you, please take 2 minutes to rate our podcast on Spotify or Apple Podcasts, It will help the Podcast reach and help more people!Spotify - https://open.spotify.com/show/12jUe4lIJgxE4yst7rrfmW?si=ab98994cf57541cfApple Podcasts (Scroll down to review)- https://podcasts.apple.com/us/podcast/bad-decisions-podcast/id1677462934Join our discord server where we connect and share assets: https://discord.gg/zwycgqezfDBad Decisions Audio Podcast

Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 720: China Stealing AI from the U.S.? Inside Anthropic's Bombshell Allegations

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Feb 24, 2026 38:00


Did China steal Anthropic's AI powers? Well, that's the shocking bombshell report that Anthropic just dropped. They accused multiple Chinese AI companies of generating more than 16 million exchanges with their models just to try and copy it. We get what you're thinking….. “So. That means cheaper open Chinese models so we all win, right.” Wrong. On this episode of Everyday AI, we break down Anthropic's shocking AI distillation accusations against Chinese firms, what they actually mean, and how they're more impactful outside of just the AI model you choose to use. You might be shocked TBH at the far-reaching implications. China Stealing AI from the U.S.? Inside Anthropic's Bombshell Allegations — An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Anthropic Accuses Chinese AI Labs of DistillationDetails on 16 Million Claude Extraction PromptsDeepSeek, Moonshot, MiniMax Named in Anthropic ReportGoogle and OpenAI Cite Similar China AI ThreatsTechnical Explanation of Model Distillation AttacksMarket Impact: MiniMax Surpasses Anthropic in TokensFinancial Consequences for U.S. AI Model ProvidersPolicy and Geopolitical AI Competition AnalysisLimitations of Current Export Controls and SafeguardsU.S. AI Dominance Threatened by Chinese DistillationTimestamps:00:00 "Foreign AI Impact on Tech"04:43 "AI Distillation and Security Threats"07:11 "MiniMax Scandal: Data Theft Allegations"10:33 "Open Router Key Marketplace"15:54 "Smart, Cheap Models Explained"19:31 "AI, IP Theft, and China's Impact"22:44 Big Tech's Data Theft Problem25:37 "Protecting U.S. Tech from Export"27:53 OpenAI Accuses DeepSeq of Misuse33:58 "AI's Global Power Struggle"35:03 "AI Models: What's Next?"Keywords: China AI theft, Anthropic bombshell report, model distillation, Chinese AI labs, DeepSeek, Moonshot AI, MiniMax, Claude capabilities, 16,000,000 prompts, 24,000 fake accounts, OSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and access all episodes there: StartHereSeries.com 

Engadget
US military may use Grok AI in its classified systems, Anthropic accused 3 Chinese AI labs of abusing Claude, and Bungie said 'no second chances'

Engadget

Play Episode Listen Later Feb 24, 2026 8:20


-The US Department of Defense has reportedly reached a deal to use Elon Musk's Grok in its classified systems. That's according to a report by Axios. That follows news that the Pentagon is currently in a dispute with another AI company, Anthropic, over limits on its technology for things like mass surveillance. -Anthropic is issuing a call to action against AI "distillation attacks," after accusing three AI companies of misusing its Claude chatbot. On its website, Anthropic claimed that DeepSeek, Moonshot and MiniMax have been conducting "industrial-scale campaigns…to illicitly extract Claude's capabilities to improve their own models." -Bungie isn't taking any prisoners when it comes to cheating on its upcoming extraction shooter, Marathon. Learn more about your ad choices. Visit podcastchoices.com/adchoices

Hashtag Trending
Anthropic Says Chinese AI Models Are Attacking Claude

Hashtag Trending

Play Episode Listen Later Feb 24, 2026 15:23


Jim Love hosts Hashtag Trending, and highlights updates to TechNewsDay.ca/.com including a new "Best of YouTube" section for curated tech channels. Anthropic alleges three Chinese AI labs—DeepSeek, Moonshot, and MiniMax—ran industrial-scale distillation campaigns to extract capabilities from Claude models using proxy services and "Hydra cluster" networks with tens of thousands of fraudulent accounts, prompting Anthropic to strengthen identity controls and detection with cloud partners.  Amazon shares fall for nine straight sessions after investors react to plans for roughly $200B in 2026 capex largely for AI infrastructure, raising questions about ROI and future free cash flow. A cited analysis by YouTuber Nate B Jones argues Google's Gemini 3.1 Pro signals a strategy shift toward deeper reasoning (not just coding/agentic tools), noting a 77.1% ARC-AGI-2 score and DeepMind's scientific problem focus, contrasting OpenAI's product/distribution, Anthropic's agentic coordination, and Google's "pure intelligence" approach. The episode also references Citri Research's 2028 scenario planning report outlining a plausible fast-arriving AGI chain reaction—falling inference costs, rapid adoption, labor displacement pressure, and geopolitical competition for compute and talent—and promotes the Saturday show Project Synapse on long-term AI trajectories. Finally, Love discusses Sam Altman's comments at the India AI Impact Summit dismissing viral claims about ChatGPT water and energy use without providing specific counter-numbers, noting growing public backlash as data center water and electricity demands rise; the full interview is linked in show notes. Hashtag Trending would like to thank Meter for their support in bringing you this podcast. Meter delivers a complete networking stack, wired, wireless and cellular in one integrated solution that's built for performance and scale. You can find them at Meter.com/htt LINKS Nate B Jones on Google Gemimi 3.1  https://youtu.be/8jKAT8GNDE0?si=Rz5k1gP0sS9H7XAp Sam Altman's speach https://www.youtube.com/live/qH7thwrCluM?si=IO_76NsGJ1zgt8J7 AI Scenario https://www.citriniresearch.com/p/2028gic 00:00 Headlines and intro 00:54 Site updates and YouTube picks 01:57 Anthropic warns of distillation 04:58 Amazon AI spending jitters 06:13 Google bets on reasoning 10:31 2028 AGI crisis scenario 11:55 Altman backlash and resources 14:17 Wrap up and sponsor thanks

The Daily Crunch – Spoken Edition
Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exports

The Daily Crunch – Spoken Edition

Play Episode Listen Later Feb 24, 2026 5:22


Anthropic accuses DeepSeek, Moonshot, and MiniMax of using 24,000 fake accounts to distill Claude's AI capabilities, as U.S. officials debate export controls aimed at slowing China's AI progress. Learn more about your ad choices. Visit podcastchoices.com/adchoices

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

Play Episode Listen Later Feb 22, 2026 55:29


Olive Song from MiniMax shares how her team trains the M series frontier open-weight models using reinforcement learning, tight product feedback loops, and systematic environment perturbations. This crossover episode weaves together her AI Engineer Conference talk and an in-depth interview from the Inference podcast. Listeners will learn about interleaved thinking for long-horizon agentic tasks, fighting reward hacking, and why they moved RL training to FP32 precision. Olive also offers a candid look at debugging real-world LLM failures and how MiniMax uses AI agents to track the fast-moving AI landscape. Use the Granola Recipe Nathan relies on to identify blind spots across conversations, AI research, and decisions: https://bit.ly/granolablindspot LINKS: Conference Talk (AI Engineer, Dec 2025) – https://www.youtube.com/watch?v=lY1iFbDPRlwInterview (Turing Post, Jan 2026) – https://www.youtube.com/watch?v=GkUMqWeHn40 Sponsors: Claude: Claude is the AI collaborator that understands your entire workflow, from drafting and research to coding and complex problem-solving. Start tackling bigger problems with Claude and unlock Claude Pro's full capabilities at https://claude.ai/tcr Tasklet: Tasklet is an AI agent that automates your work 24/7; just describe what you want in plain English and it gets the job done. Try it for free and use code COGREV for 50% off your first month at https://tasklet.ai CHAPTERS: (00:00) About the Episode (04:15) Minimax M2 presentation (Part 1) (17:59) Sponsors: Claude | Tasklet (21:22) Minimax M2 presentation (Part 2) (21:26) Research life and culture (26:27) Alignment, safety and feedback (32:01) Long-horizon coding agents (35:57) Open models and evaluation (43:29) M2.2 and researcher goals (48:16) Continual learning and AGI (52:58) Closing musical summary (55:49) Outro PRODUCED BY: https://aipodcast.ing SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://linkedin.com/in/nathanlabenz/ Youtube: https://youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk

The ChatGPT Report
171 - I see dead people…and are AI Agents stupid?

The ChatGPT Report

Play Episode Listen Later Feb 19, 2026 14:38


Episode Sponsor - Airia.comThe AI Compiler Debate: Anthropic's Claude-generated C compiler has sparked controversy; while marketed as a milestone, hands-on testing reveals it is fragile, significantly slower than traditional compilers (like GCC), and heavily reliant on human-written code.The SaaS "Death Spiral": The traditional "per-seat" licensing model for software is under threat as AI agents begin to do the work of multiple people, leading to massive market cap losses for giants like Salesforce and Adobe.Safety and Ethics Concerns: Beyond the "doomerism" of upcoming AI documentaries, real-world concerns are mounting, including lawsuits against AI-powered surgical tools (TruDi Navigation System) and Meta's patent for AI that replicates the online behavior of deceased users.Innovation vs. "Vibe Coding": There is a growing shift toward "vibe coding"—prioritizing the speed of AI generation over long-term stability—which critics argue creates bloated software and significant technical debt.The Rise of Autonomous Models: Intelligence is becoming a commodity through high-performance open-weight models (like Qwen and MiniMax), pushing the industry away from human-centric dashboards toward autonomous orchestration.@trikcode@rushicrypto

DH Unplugged
DHUnplugged #791: AI Overload

DH Unplugged

Play Episode Listen Later Feb 18, 2026 70:35


Self Created Valuation Boosts Apple Announces new Podcast push AI – A breakdown Playing them like a fiddle – Warner Brothers PLUS we are now on Spotify and Amazon Music/Podcasts! Click HERE for Show Notes and Links DHUnplugged is now streaming live - with listener chat. Click on link on the right sidebar. Love the Show? Then how about a Donation? Follow John C. Dvorak on Twitter Follow Andrew Horowitz on Twitter Warm-Up - A NEW CTP just announced - China releasing new AI models - AI - A breakdown - we are on overload - Big Employment news.... Markets - Self Created Valuation Boosts - Apple Announces new Podcast push - Playing them like a fiddle - Warner Brothers Quick Note - Going to rip up the playbook on something this week on TDI Podcast. Anyone who owns an annuity should listen to what is about to come on next Sundays show.....  No Agenda... Olympics - Anything to discuss? MONEY FOR ALL - The average tax refund is 10.9% higher so far this season, compared to about the same point in 2025, according to early filing data from the IRS. - The 2026 tax season opened Jan. 26, and the average refund amount was $2,290 as of Feb. 6, up from $2,065 about one year prior, the IRS reported Friday night. - As of Feb. 6, the total amount refunded was more than $16.9 billion, up 1.9% compared to last year, according to the IRS release. That figure reflects current-year returns only. - This is partly because there were excess-witholdings from last year on the rules changed and paycheck withholdings were not adjusted. This is a one time situation.. Emplyment - 4.3% - "Better" than expected payrolls number - A major revision was released last Wednesday. Overall 2025 job growth was much weaker than initially reported. The total net change for the full year 2025 was revised down from +584,000 jobs to just +181,000 jobs (seasonally adjusted) — an average of only about 15,000 jobs added per month instead of ~49,000. This made 2025 one of the weakest years for job creation in recent non-recession periods. - Employment levels were consistently overstated throughout 2025 by roughly 800,000 to over 1 million jobs, peaking around mid-year. For example: By March 2025, the level was revised down by 898,000. By December 2025 (preliminary), down by 1,029,000. - Monthly changes were also adjusted downward in most cases (e.g., August's originally reported -26,000 became a larger loss of -70,000; September's +108,000 became +76,000). - The revisions reflect normal annual benchmarking, but this one was unusually large (larger than the typical 0.2% average over the prior decade), likely due to factors like overestimation of business births or other data mismatches. - In short, the data reveals that the U.S. labor market in 2025 was significantly softer than the monthly headlines suggested at the time — job growth was overstated by a substantial margin, painting a picture of a much weaker employment picture for the year. AI Updates - While U.S. markets have been focused on the impact of Anthropic and Altruist's tools on software and financial services, China's tech giants have released AI models this week that have shown advancements in robotics and video generation. - Google is reporting that China's AI models are just MONTHS behind western models - However - is this progress? In a video demo, Alibaba showed a robot with pincers for hands that appeared to be able to count oranges, pick them up and place them in a basket. It was also shown taking milk out of a fridge. - Alibaba on Monday unveiled a new artificial intelligence model Qwen 3.5 designed to execute complex tasks independently, with big improvements in performance and cost that the Chinese tech giant claims beat major U.S. rival models on several benchmarks. - Zhipu AI — which trades as Knowledge Atlas Technology in Hong Kong said the model approaches Anthropic's Claude Opus 4.5 in coding benchmarks while surpassing Google's Gemini 3 Pro on some tests. - Shares of MiniMax also jumped Thursday after it launched its updated M2.5 open-source model with enhanced AI agent tools. Grok Update - Grok, Elon Musk's AI chatbot, has been gaining ground in the U.S. over the past months, data showed, even as it draws global censure and regulatory scrutiny after being used to generate a wave of non-consensual sexualized images of women and minors. - U.S. market share of the tool rose to 17.8% last month from 14% in December, and 1.9% in January 2025, according to data from research firm Apptopia. - Men are still the largest % users of Grok ~ 78% (down from 89% in April 2025) AI Market Share - ChatGPT's share slumped to 52.9% last month from 80.9% in January last year, while Gemini's grew to 29.4% from 17.3% over the same period. AI Market Share InfoGrapic and AI Understanding - Have we gone through this? - At its core, AI is technology that lets machines perform tasks that normally require human intelligence — things like understanding language, recognizing images, making decisions, or solving problems. - Modern AI (especially since ~2022) is dominated by machine learning — systems that learn patterns from huge amounts of data instead of being explicitly programmed rule-by-rule. - Inference is the "using" or "applying" phase of AI — when a trained model takes new input and produces an output / prediction / answer. Contrast with training (the "learning" phase): ------ Training ? Like a student studying for years: very compute-heavy, expensive, done once (or rarely) on massive servers/GPUs, adjusts billions of parameters based on examples. ------ Inference ? Like the student taking a test or doing their job: much faster, cheaper, runs on your phone/laptop/cloud, uses the fixed knowledge from training to respond instantly. - gentic AI takes regular AI (like chat models) to the next level: instead of just answering questions or generating text, these systems act autonomously to achieve goals with minimal human help. "Agentic" comes from "agency" — the ability to make decisions, plan, use tools, take actions, adapt, and even learn from results — like a smart digital employee rather than just a smart answer machine. AI Infographic Last AI Item - A shortage of memory chips is hammering profits, derailing corporate plans, and inflating price tags on various products, with the crunch expected to get worse. - The fundamental reason for the squeeze is the buildout of AI data centers, with companies like Alphabet and OpenAI buying up large shares of memory chip production, leaving consumer electronics producers fighting over a dwindling supply. - The resulting price spikes are causing concern, with some warning of "RAMmageddon" and others predicting that memory chip prices will go "parabolic", bringing lavish profits to some companies but painful prices to the rest of the electronics sector. Here is something: - Gallup will no longer track presidential approval ratings after nearly 90 years - Founded by George Gallup in 1935, the Washington, DC-based management company began tracking the president's job performance 88 years ago. - Gallup told USA TODAY it will no longer publish "favorability ratings of political figures," a decision it said "reflects an evolution in how Gallup focuses its public research and thought leadership." - Gallup said the ratings are now "widely produced, aggregated and interpreted, and no longer represent an area where Gallup can make its most distinctive contribution." - "Our commitment is to long-term, methodologically sound research on issues and conditions that shape people's lives," the company wrote, adding that its work will continue through the Gallup Poll Social Series, the Gallup Quarterly Business Review, the World Poll and more. - Seems like they are unable to SHAPE opinion due to social media etc.....? Apple Podcast Update - Big news! - Apple on Monday announced that it will bring a new integrated video podcast experience to Apple Podcasts this spring. - The move comes as video viewership continues to reshape podcasting. About 37% of people over age 12 watch video podcasts monthly, according to Edison Research. - The update brings Apple Podcasts more in-line with its competitors Spotify, YouTube and now Netflix, which have increasingly leaned into video podcasting. -“Twenty years ago, Apple helped take podcasting mainstream by adding podcasts to iTunes, and more than a decade ago, we introduced the dedicated Apple Podcasts app,” said Eddy Cue, Apple's senior vice president of Services, in a statement. “ - By bringing a category-leading video experience to Apple Podcasts, we're putting creators in full control of their content and how they build their businesses, while making it easier than ever for audiences to listen to or watch podcasts.” M&A - Texas Instruments Inc. has reached an agreement to buy Silicon Laboratories Inc. for about $7.5 billion, deepening its exposure to several markets for chips. - Silicon Labs investors will receive $231 in cash for each share of the company's common stock and the transaction is expected to close in the first half of 2027. - The transaction still needs to win approval by investors in Silicon Labs and shares of Silicon Labs surged by 51% to $206.48 after the announcement. Inflation - This helps - PepsiCo, will cut prices on core brands such as Lay's and Doritos by up to 15% following a consumer backlash against several previous price hikes, the snacks and beverage maker said on Tuesday after it topped fourth-quarter results. Miran - Moving - Federal Reserve Governor Stephen Miran is leaving his post as chair of the Council of Economic Advisers, CNBC has confirmed. - He joined the CEA in January 2025, but had been on leave from that post since last September when he filled the unexpired term of former Fed Governor Adriana Kugler.- He reamins on Fed board No Biggie???? - There are some astonishing cased being reported of Bad AI in the operating room - JNJ's TruDi Navigation System - Since AI was added to the device, the FDA has received unconfirmed reports of at least 100 malfunctions and adverse events. - At least 10 people were injured between late 2021 and November 2025, according to the reports. Most allegedly involved errors in which the TruDi Navigation System misinformed surgeons about the location of their instruments while they were using them inside patients' heads during operations. - Cerebrospinal fluid reportedly leaked from one patient's nose. In another reported case, a surgeon mistakenly punctured the base of a patient's skull. In two other cases, patients each allegedly suffered strokes after a major artery was accidentally injured. Cuba - The main airport has putt out a bulletin that they are out of Jet Fuel - Blackouts and lack of other fuels are creating big problems - No airlines have stopped running at this point, but many will as they cannot refuel - This is a bigger problem for cargo planes (supplies) that may not be able to risk flying to Cuba as they will not be able to get out. Dalio Warning -  Legendary investor Ray Dalio said on Tuesday the world was “on the brink” of a capital war. - He said central banks and sovereign wealth funds were already preparing for measures like foreign exchange and capital controls. - "When money is weaponized using measures like trade embargoes, blocking access to capital markets, or using ownership of debt as leverage." - “Capital, money, matters,” Dalio said Tuesday. “We're seeing capital controls … taking place all over the world today, and who will experience that is questionable. So, we are on the brink — that doesn't mean we are in [a capital war now], but it means that it's a logical concern.” - Could this be why gold and siver are being hoarded (physical assets over digital currency? - Is China's edict to banks to diversify away from US Treasuries a sign? Self Boosted Valuation - Waymo is aiming to raise about $16 billion in a financing-round that would value it at nearly $110 billion, Bloomberg News reported, citing people familiar with the matter. - Alphabet would provide about $13 billion to the autonomous driving firm while the rest would come from investors including Sequoia Capital, DST Global and Dragoneer Investment Group, the report added. - Soooooo - Waymo is a unit of Alphabet.... Alphabet providing 80% of the funding that boosts valuations..... Hmmmmmmmm Warner Brothers -  Warner Bros Discovery Inc is considering reopening sale talks with Paramount Skydance Corp after receiving its amended offer. - The Warner Bros board is discussing whether Paramount could offer a path to a superior deal, which may ignite a second bidding war with Netflix Inc. - Paramount submitted amended terms that addressed several concerns, including covering a fee owed to Netflix and offering to backstop a Warner Bros debt refinancing. Economics Coming Up - Short Week - plenty of Reports - Wednesday - Durable Goods, Housing Starts, Industrial Production, FOMC Minutes - Thursday - Philly Fed, Initial Claims - Friday: PCE, Personal Income and Spending, GDP for Q4 (3.6%) ----- New Home Sales, UMich Feb Final   Love the Show? Then how about a Donation? ANNOUNCING THE THE CLOSEST TO THE PIN for CATERPILLAR Winners will be getting great stuff like the new "OFFICIAL" DHUnplugged Shirt!     FED AND CRYPTO LIMERICKS   See this week's stock picks HERE Follow John C. Dvorak on Twitter Follow Andrew Horowitz on Twitter

Short Briefings on Long Term Thinking - Baillie Gifford
China's new growth leaders: inventing, not copying

Short Briefings on Long Term Thinking - Baillie Gifford

Play Episode Listen Later Feb 13, 2026 32:16


From new cancer drugs to batteries and robotics – China's top-tier growth companies are forging paths of their own rather than following in the west's footsteps. Investment manager Sophie Earnshaw names companies that have caught her eye and explains why being a long-term stock picker differs in China from elsewhere. Background:Sophie Earnshaw is a decision-maker on our China Equities Strategy and joint manager of the Baillie Gifford China Growth Trust. In this conversation, she tells Short Briefings… host Leo Kelion about a select group of Chinese companies breaking new ground, supported by the state's efforts to become self-sufficient in more of today's critical technologies and a leader in some of those of the future. Earnshaw also details how the “phenomenal rate” at which companies are born, scale and die in the country makes stock-picking a challenging task – making the access we have to company leaders, academics and other local expertise core to our mission of finding the best firms to invest in on behalf of our clients. Portfolio companies discussed include:- CATL – the battery maker whose products power electric vehicles worldwide and increasingly support the renewable energy sector- BeOne and Innovent Biologics – pharmaceutical firms developing the next generation of cancer drugs - AMEC and NAURA – semiconductor equipment makers enabling China to develop increased self-reliance in computer chips - Alibaba, ByteDance and Tencent – China's ‘big tech' companies, whose artificial intelligence tools are becoming embedded into people's daily lives- MiniMax – the AI startup rolling out video and agentic tools at a fraction of the cost of western counterparts- Horizon Robotics – the automated driving tech provider with its eye on an even bigger opportunity. Resources:Baillie Gifford podcastsChina: a tale of two storiesChina investment strategy hub (institutional clients only)House of HuaweiPrivate investor forum 2025: investing in great growth companiesTrip notes: on the road with Baillie Gifford China Growth Trust  Companies mentioned include:AlibabaAMECASMLBeOneByteDanceCATLHorizon RoboticsInnovent BiologicsJiangsu HengruiHuaweiMiniMaxSamsungNAURATencentTSMCXiaohongshu Timecodes:00:00  Introduction01:55   Joining the China Equities Strategy02:40  Intense competition04:00  The government's influence06:10   CATL, the electrification champion08:45  Investing with a 5-year time horizon10:25   Shanghai office, local expertise11:45   Regulations and geopolitics14:30   China's next Five-year Plan16:15   Innovent Biologics' new cancer drugs18:10   Lower-cost clinical trials19:45   Being selective in semiconductors21:25   Investing in chip equipment makers23:00  China's ‘big tech and AI'25:10   MiniMax making AI like ‘tap water'27:45  The road to robotics29:35  A market you can't ignore30:30  Book choice Glossary of terms (in order of mention): Third plenum: a major policy meeting of China's ruling Communist Party, often used to set big economic/political direction.Sovereign bond issuance: The government raising money by selling bonds (IOUs) to investors.Opportunity set: the range of investable companies available to choose from.Capex: capital expenditure – money spent on long-term assets like factories, equipment, or data centres.Fiscal deficit target: how much more the government plans to spend than it collects in revenue (taxes plus other income), expressed as a share of the economy.GDP: gross domestic product – the total value of goods and services a country produces in a year.Market capitalisation: the total value of a company's shares (share price × number of shares).ESG: environmental, social and governance – how a company manages environmental impact, people issues, and corporate oversight.Large-form batteries: big battery packs used in things like electric vehicles and grid storage.Energy storage systems: large batteries that store electricity for later use (helping balance the grid).Generic drugs: copies of medicines whose patents have expired; usually cheaper, same active ingredient.Bi-specific (bispecific) drugs: drugs designed to bind to two targets at once (often to direct immune cells to cancer).ADC drugs: antibody–drug conjugates – antibodies that deliver a toxic payload to cancer cells.Out-licensing: selling rights to your drug/technology to another company (often for upfront + milestone payments).EUV machines: extreme ultraviolet lithography equipment used to make the most advanced chips.Foundry: a factory business that manufactures chips for other companies.Etch and deposition: steps in chipmaking – etch removes material to form patterns, deposition adds thin layers.Picks and shovels: a metaphor for companies that sell essential tools to an industry (rather than end products).Digitalisation: moving processes and services from offline to software and data-driven systems.Compute: the processing power (chips and servers) used to train/run AI.Large language model (LLM): an AI trained on lots of text to generate and understand language.Margins: how much profit a company makes per pound/dollar of revenue (after costs).Cloud business: selling computing power/storage/software over the internet instead of on a local machine.Algorithm layer: the method or software logic that makes the AI work (as distinct from the hardware).Gross margin: revenue minus direct costs (before overheads), a rough measure of product profitability.Assisted driving: features that help a driver (lane-keeping, adaptive cruise control, etc) but don't fully replace them.Autonomous driving: a car driving itself with minimal or no human input.Software attachment rate: the percentage of customers who add paid software features and/or subscriptions.

网事头条|听见新鲜事
MiniMax M2.5编程模型上线

网事头条|听见新鲜事

Play Episode Listen Later Feb 12, 2026 0:24


China Daily Podcast
英语新闻丨人工智能加速推进,创新走入日常生活

China Daily Podcast

Play Episode Listen Later Feb 2, 2026 8:13


For Huang Xiaozhen, the future of artificial intelligence isn't about computing power or algorithmic scale, but about something far more ordinary: the quiet click of a light switch.在黄晓真看来,人工智能的未来并不取决于算力规模或算法复杂度,而在于更为日常、甚至平凡的场景——比如轻轻按下电灯开关的那一刻。As head of business-to-business operations at MiniMax Group Inc, a rising Chinese AI unicorn, Huang sees the current wave of innovation less as a technological leap than as a transition toward being ubiquitous.作为中国新晋人工智能独角兽企业MiniMax集团的B端业务负责人,黄晓真认为,当下这股创新浪潮与其说是一场技术飞跃,不如说是一次走向“无处不在”的转变。"As the technology iterates, AI will become like water, electricity or coal — the fundamental infrastructure of our existence," Huang said. "It will be everywhere in our lives and work."黄晓真表示:“随着技术不断迭代,人工智能将像水、电、煤一样,成为支撑我们生存与发展的基础性基础设施,它将无处不在,融入我们的生活与工作之中。”The vision widely shared by AI optimists has increasingly aligned with the nation's official policy.这一在人工智能乐观主义者中广泛流行的愿景,正日益与国家层面的官方政策形成高度契合。In its recommendations for formulating the 15th Five-Year Plan (2026-30) for national economic and social development, the Communist Party of China Central Committee called for "forward-looking plans" for future industries, urging exploration of diverse technology road maps, application scenarios, business models and regulatory frameworks.在关于制定国民经济和社会发展第十五个五年规划(2026—2030年)的建议中,中共中央提出要对未来产业进行“前瞻性布局”,并鼓励探索多样化的技术路线、应用场景、商业模式和监管框架。The document explicitly listed quantum technology, biomanufacturing, hydrogen and nuclear fusion energy, brain-computer interfaces, embodied artificial intelligence and 6G mobile communications as new drivers of growth.该文件明确将量子技术、生物制造、氢能与核聚变能源、脑机接口、具身人工智能以及6G移动通信等列为新的增长动能。The confidence of China's AI practitioners received a major boost in April 2025, when President Xi Jinping visited the Shanghai Foundation Model Innovation Center, one of the country's most active AI hubs.2025年4月,习近平主席考察上海大模型创新中心——这一全国最为活跃的人工智能集聚区之一,中国人工智能从业者的信心由此大幅提振。Standing among developers and entrepreneurs, Xi, who is also general secretary of the CPC Central Committee, described artificial intelligence as "a young cause, and a cause for young people", encouraging them to align personal ambition with China's modernization drive.在开发者和创业者中间,习近平总书记将人工智能形容为“一项年轻的事业,也是一项属于年轻人的事业”,并鼓励大家将个人理想追求融入中国式现代化进程之中。According to the Ministry of Industry and Information Technology, China had more than 6,000 AI enterprises last year, while the scale of the country's core AI industry was expected to have exceeded 1.2 trillion yuan ($172.6 billion) in 2025.工业和信息化部数据显示,去年我国人工智能企业数量已超过6000家,2025年核心人工智能产业规模预计突破1.2万亿元人民币(约合1726亿美元)。The president's visit, combined with subsequent policy measures, has strengthened confidence among startups facing intense competition, according to executives working inside the ecosystem.多位业内高管表示,总书记的考察以及随后出台的一系列政策举措,显著增强了在激烈竞争环境中奋战的初创企业信心。Zhang Yun, a deputy general manager at the center, said the visit validated the pace and direction of the hub's development.该中心副总经理张云表示,此次考察充分肯定了创新中心的发展节奏和方向。"Nearly 75 percent of our workforce is under the age of 35," Zhang said. "We're seeing founders who are barely 30. As AI tools become more powerful, teams are getting smaller, younger and faster."张云说:“我们团队中近75%的员工年龄在35岁以下,一些创始人甚至刚满30岁。随着人工智能工具能力不断增强,团队正呈现出规模更小、成员更年轻、反应更迅速的趋势。”The center operates on what participants describe as a philosophy of proximity. "Upstairs and downstairs are upstream and downstream," Zhang said, referring to the close physical clustering of foundational model developers and application companies, which allows for rapid iteration and feedback.该中心运行秉持着参与者所称的“近距离协同”理念。张云解释说:“楼上楼下就是上下游”,基础模型研发团队与应用企业在物理空间上的高度集聚,使快速迭代和反馈成为可能。Yao Zhendi, CEO of Cyber Partner AI Co, said the youthfulness of the sector reflects the nature of the technological challenge itself."We're no longer doing 'one to 10' innovation, where you just improve something that already exists," Yao said.上海魂伴科技有限责任公司首席执行官姚振迪表示,该行业的年轻化正是技术挑战本身特性的体现。“我们不再是做‘一到十'的创新,只是在原有基础上改进。”他说。"We're doing 'zero to one'. There's no formula and no homework to copy. We're defining what this technology becomes."“我们做的是‘从零到一'。既没有公式可循,也没有作业可抄,而是在定义这项技术最终会成为什么。”He added that national planning documents emphasize not only AI, but the integration of embodied intelligence across industries, with a forward-looking approach to development.他补充指出,国家规划文件强调的不只是人工智能本身,还包括具身智能在各行业中的融合应用,并以前瞻性视角推动相关发展。Over the next five years, he said, AI is expected to penetrate daily life and a wide range of sectors in line with the 15th Five-Year Plan.他表示,未来五年,在“十五五”规划指引下,人工智能有望深入渗透日常生活和多个产业领域。His company is working on the convergence of the "brain", meaning large-scale models, and the "body", referring to embodied intelligence — a frontier where software meets robotics.其公司正致力于推动“脑”(即大模型)与“身体”(即具身智能)的融合,这一前沿领域正是软件与机器人技术的交汇点。Paradigm shift范式转变The national strategy outlined in the 15th Five-Year Plan proposals calls for a paradigm shift in scientific research and a focus on self-reliance in breakthroughs in chips, algorithms and data.“十五五”规划建议所勾勒的国家战略,呼吁科研范式发生转变,并将重点放在芯片、算法和数据等关键领域的自主突破上。For Wang Le, CEO of Shanghai SiliconPear Technology Co, the transformation is already playing out on factory floors and toy shelves.对上海喜梨信息科技有限公司首席执行官王乐而言,这一转型已在工厂车间和玩具货架上悄然展开。Wang's company exports to more than 30 countries and uses AI to upgrade traditional manufacturing.王乐的公司产品出口至30多个国家,并通过人工智能技术推动传统制造业升级。"We're turning toys from simple manufactured goods into high-tech consumer products," Wang said."It's no longer just about branding. It's about embedding technology to upgrade the entire supply chain, which gives Chinese products a distinct global competitive edge."王乐表示:“我们正在把玩具从简单的制造品转变为高科技消费品。这已经不只是品牌问题,而是通过技术嵌入实现整个供应链的升级,从而赋予中国产品独特的全球竞争优势。”Xi's visit to the Shanghai Foundation Model Innovation Center came after he presided over a group study session of the Political Bureau of the CPC Central Committee, highlighting the need to promote the healthy and orderly development of AI in a beneficial, safe and fair direction.习近平考察上海大模型创新中心之前,曾主持中共中央政治局集体学习,强调要推动人工智能朝着有益、安全、公平方向健康有序发展。Huang, from MiniMax, said that companies across the AI value chain are directly feeling the impact of State support. From his perspective, policies aimed at supporting models and application scenarios are accelerating innovation and technical iteration across the industry.MiniMax的黄晓真表示,人工智能产业链各环节企业正切身感受到国家支持带来的影响。在他看来,围绕模型和应用场景的政策扶持,正在加速全行业的创新和技术迭代。"It's clear that more and more enterprises, scenarios and applications are moving toward AI," Huang said. "Many new startups are designing products and use cases based entirely on current AI capabilities. This looks more like a society-wide embrace of AI, and the growth of the entire industry chain is extremely fast."他说:“可以清楚地看到,越来越多的企业、场景和应用正在向人工智能靠拢。许多新创公司完全基于现有AI能力来设计产品和应用场景,这更像是一场全社会层面的AI拥抱,整个产业链的增长速度极其迅猛。”Zhou Chen, CEO of Zhejiang Dex-Robot Intelligent Technology, said Xi has explicitly called for accelerating the application of AI in technological innovation and industrial development, a signal that Zhou sees as materially beneficial for enterprises.浙江灵巧智能科技有限公司首席执行官周晨表示,习近平明确提出要加快人工智能在科技创新和产业发展中的应用,这一信号对企业而言具有实实在在的利好意义。He pointed to concrete policy measures such as subsidized computing power and pilot programs for new models. China launched a 60 billion yuan AI industry investment fund to support the development of the whole AI industrial chain.他指出,诸如算力补贴、新模型试点项目等具体政策措施正在落地实施。同时,中国还设立了规模达600亿元人民币的人工智能产业投资基金,用于支持整个AI产业链发展。"They allow companies to be bold and try things first," Zhou said."Ultimately, it helps improve human efficiency and reduce defect rates in manufacturing."周晨表示:“这些政策让企业可以大胆探索、先行先试,最终有助于提升人效、降低制造业缺陷率。”Zhou cited incentives including free or subsidized computing resources, early access to models, and funding for major research projects as mechanisms that encourage companies to experiment.周晨列举了多项激励机制,包括免费或补贴算力资源、提前接入模型以及重大科研项目资金支持,这些都有效鼓励企业开展探索性实践。He argued that China has structural advantages in pursuing such technologies: a complete industrial supply chain, a wide range of real-world application scenarios that generate data, and growing domestic computing capacity.他认为,中国在发展相关技术方面具备结构性优势,包括完备的产业链体系、能够持续产生数据的丰富现实应用场景,以及不断提升的本土算力能力。While much of the focus remains on domestic self-reliance, officials and researchers say China's AI ambitions also have an outward-facing dimension.尽管当前重点仍放在国内自主可控上,但多位官员和研究人员指出,中国的人工智能发展目标同样具有面向国际的外向维度。International public good国际公共产品Yan Weixin, chief scientist at the Shanghai Artificial Intelligence Research Institute, said Xi has called for AI to be developed as an international public good that benefits humanity.上海人工智能研究院首席科学家闫维新表示,习近平提出要将人工智能打造为造福全人类的国际公共产品。Yan said China has launched initiatives related to AI reinforcement learning and sustainable development, and has begun sharing algorithms and models with partner countries, including those participating in the Belt and Road Initiative.闫维新介绍称,中国已启动多项与人工智能强化学习和可持续发展相关的倡议,并开始与包括共建“一带一路”国家在内的合作伙伴共享算法和模型。These technologies are being applied to areas such as disaster warning systems and low-carbon industrial facilities overseas, he said.他说,这些技术已在海外被应用于灾害预警系统和低碳工业设施等领域。Highlighting the link between energy and computation, he said: "AI consumes energy, and energy defines computing power. In the future, there will be deep integration between new energy and dynamic computing."他强调能源与计算之间的关联指出:“人工智能消耗能源,而能源决定算力。未来,新能源与动态计算之间将实现深度融合。”Otto Heinrich Herzog, an academician of the German National Academy of Science and Engineering and a professor at Tongji University in Shanghai, said he has been struck by the speed with which policies translate into implementation in China.德国国家科学与工程院院士、同济大学教授奥托·海因里希·赫尔佐格表示,中国政策从制定到落地实施的速度给他留下了深刻印象。"When something aligns with strategy in China, it really gets implemented," Herzog said. "That's something you don't experience in the same way in Europe."赫尔佐格说:“在中国,只要一项举措符合国家战略,就会真正被执行落实。这是在欧洲难以同样体会到的。”forward-looking /ˌfɔːwəd ˈlʊkɪŋ/前瞻性的embodied artificial intelligence /ɪmˈbɒdid ˌɑːtɪˈfɪʃəl ɪnˈtɛlɪdʒəns/具身人工智能value chain /ˈvæljuː tʃeɪn/价值链pilot program /ˈpaɪlət ˈprəʊɡræm/试点项目dynamic computing /daɪˈnæmɪk kəmˈpjuːtɪŋ/动态计算

网事头条|听见新鲜事
MiniMax Music 2.5模型发布

网事头条|听见新鲜事

Play Episode Listen Later Jan 29, 2026 0:18


The Gradient Podcast
2025 in AI, with Nathan Benaich

The Gradient Podcast

Play Episode Listen Later Jan 22, 2026 61:15


Episode 144Happy New Year! This is one of my favorite episodes of the year — for the fourth time, Nathan Benaich and I did our yearly roundup of AI news and advancements, including selections from this year's State of AI Report.If you've stuck around and continue to listen, I'm really thankful you're here. I love hearing from you.You can find Nathan and Air Street Press here on Substack and on Twitter, LinkedIn, and his personal site. Check out his writing at press.airstreet.com.Find me on Twitter (or LinkedIn if you want…) for updates on new episodes, and reach me at editor@thegradient.pub for feedback, ideas, guest suggestions.Outline* (00:00) Intro* (00:44) Air Street Capital and Nathan world* Nathan's path from cancer research and bioinformatics to AI investing* The “evergreen thesis” of AI from niche to ubiquitous* Portfolio highlights: Eleven Labs, Synthesia, Crusoe* (03:44) Geographic flexibility: Europe vs. the US* Why SF isn't always the best place for original decisions* Industry diversity in New York vs. San Francisco* The Munich Security Conference and Europe's defense pivot* Playing macro games from a European vantage point* (07:55) VC investment styles and the “solo GP” approach* Taste as the determinant of investments* SF as a momentum game with small information asymmetry* Portfolio diversity: defense (Delian), embodied AI (Syriact), protein engineering* Finding entrepreneurs who “can't do anything else”* (10:44) State of AI progress in 2025* Momentous progress in writing, research, computer use, image, and video* We're in the “instruction manual” phase* The scale of investment: private markets, public markets, and nation states* (13:21) Range of outcomes and what “going bad” looks like* Today's systems are genuinely useful—worst case is a valuation problem* Financialization of AI buildouts and GPUs* (14:55) DeepSeek and China closing the capability gap* Seven-month lag analysis (Epoch AI)* Benchmark skepticism and consumer preferences (”Coca-Cola vs. Pepsi”)* Hedonic adaptation: humans reset expectations extremely quickly* Bifurcation of model companies toward specific product bets* (18:29) Export controls and the “evolutionary pressure” argument* Selective pressure breeds innovation* Chinese companies rushing to public markets (Minimax, ZAI)* (21:30) Reasoning models and test-time compute* Chain of thought faithfulness questions* Monitorability tax: does observability reduce quality?* User confusion about when models should “think”* AI for science: literature agents, hypothesis generation* (23:53) Chain of thought interpretability and safety* Anthropomorphization concerns* Alignment faking and self-preservation behaviors* Cybersecurity as a bigger risk than existential risk* Models as payloads injected into critical systems* (27:26) Commercial traction and AI adoption data* Ramp data: 44% of US businesses paying for AI (up from 5% in early 2023)* Average contract values up to $530K from $39K* State of AI survey: 92% report productivity gains* The “slow takeoff” consensus and human inertia* Use cases: meeting notes, content generation, brainstorming, coding, financial analysis* (32:53) The industrial era of AI* Stargate and XAI data centers* Energy infrastructure: gas turbines and grid investment* Labs need to own models, data, compute, and power* Poolside's approach to owning infrastructure* (35:40) Venture capital in the age of massive GPU capex* The GP lives in the present, the entrepreneur in the future, the LP in the past* Generality vs. specialism narratives* “Two or 20”: management fees vs. carried interest* Scaling funds to match entrepreneur ambitions* (40:10) NVIDIA challengers and returns analysis* Chinese challengers: 6x return vs. 26x on NVIDIA* US challengers: 2x return vs. 12x on NVIDIA* Grok acquired for $20B; Samba Nova markdown to $1.6B* “The tide is lifting all boats”—demand exceeds supply* (44:06) The hardware lottery and architecture convergence* Transformer dominance and custom ASICs making a comeback* NVIDIA still 90–95% of published AI research* (45:49) AI regulation: Trump agenda and the EU AI Act* Domain-specific regulators vs. blanket AI policy* State-level experimentation creates stochasticity* EU AI Act: “born before GPT-4, takes effect in a world shaped by GPT-7”* Only three EU member states compliant by late 2025* (50:14) Sovereign AI: what it really means* True sovereignty requires energy, compute, data, talent, chip design, and manufacturing* The US is sovereign; the UK by itself is not* Form alliances or become world-class at one level of the stack* ASML and the Netherlands as an example* (52:33) Open weight safety and containment* Three paths: model-based safeguards, scaffolding/ecosystem, procedural/governance* “Pandora's box is open”—containment on distribution, not weights* Leak risk: the most vulnerable link is often human* Developer–policymaker communication and regulator upskilling* (55:43) China's AI safety approach* Matt Sheehan's work on Chinese AI regulation* Safety summits and China's participation* New Chinese policies: minor modes, mental health intervention, data governance* UK's rebrand from “safety” to “security” institutes* (58:34) Prior predictions and patterns* Hits on regulatory/political areas; misses on semiconductor consolidation, AI video games* (59:43) 2026 Predictions* A Chinese lab overtaking US on frontier (likely ZAI or DeepSeek, on scientific reasoning)* Data center NIMBYism influencing midterm politics* (01:01:01) ClosingLinks and ResourcesNathan / Air Street Capital* Air Street Capital* State of AI Report 2025* Air Street Press — essays, analysis, and the Guide to AI newsletter* Nathan on Substack* Nathan on Twitter/X* Nathan on LinkedInFrom Air Street Press (mentioned in episode)* Is the EU AI Act Actually Useful? — by Max Cutler and Nathan Benaich* China Has No Place at the UK AI Safety Summit (2023) — by Alex Chalmers and Nathan BenaichResearch & Analysis* Epoch AI: Chinese AI Models Lag US by 7 Months — the analysis referenced on the US-China capability gap* Sara Hooker: The Hardware Lottery — the essay on how hardware determines which research ideas succeed* Matt Sheehan: China's AI Regulations and How They Get Made — Carnegie EndowmentCompanies Mentioned* Eleven Labs — AI voice synthesis (Air Street portfolio)* Synthesia — AI video generation (Air Street portfolio)* Crusoe — clean compute infrastructure (Air Street portfolio)* Poolside — AI for code (Air Street portfolio)* DeepSeek — Chinese AI lab* Minimax — Chinese AI company* ASML — semiconductor equipmentOther Resources* Search Engine Podcast: Data Centers (Part 1 & 2) — PJ Vogt's two-part series on XAI data centers and the AI financing boom* RAAIS Foundation — Nathan's AI research and education charity Get full access to The Gradient at thegradientpub.substack.com/subscribe

Alles auf Aktien
Die KI-Volte von Walmart und diese Aktien kaufen die Deutschen

Alles auf Aktien

Play Episode Listen Later Jan 12, 2026 25:28


In der heutigen Folge sprechen die Finanzjournalisten Daniel Eckert und Lea Oetjen über den digitalen Euro, einen Kurssprung bei Sandisk und Trumps Warnung an den Iran. Außerdem geht es um Rheinmetall, Nvidia, Apple, RocketLab, SpaceX, Microsoft, Meta Platforms, Alphabet (Google), Tesla, Amazon, Droneshield, Renk Group, Thyssenkrupp Marine Systems, MiniMax, Zhipu AI, Teamviewer und AstraZeneca. Wir freuen uns an Feedback über aaa@welt.de. Noch mehr "Alles auf Aktien" findet Ihr bei WELTplus und Apple Podcasts – inklusive aller Artikel der Hosts und AAA-Newsletter. Hier bei WELT: https://www.welt.de/podcasts/alles-auf-aktien/plus247399208/Boersen-Podcast-AAA-Bonus-Folgen-Jede-Woche-noch-mehr-Antworten-auf-Eure-Boersen-Fragen.html. Der Börsen-Podcast Disclaimer: Die im Podcast besprochenen Aktien und Fonds stellen keine spezifischen Kauf- oder Anlage-Empfehlungen dar. Die Moderatoren und der Verlag haften nicht für etwaige Verluste, die aufgrund der Umsetzung der Gedanken oder Ideen entstehen. Hörtipps: Für alle, die noch mehr wissen wollen: Holger Zschäpitz können Sie jede Woche im Finanz- und Wirtschaftspodcast "Deffner&Zschäpitz" hören. +++ Werbung +++ Du möchtest mehr über unsere Werbepartner erfahren? Hier findest du alle Infos & Rabatte! https://linktr.ee/alles_auf_aktien Impressum: https://www.welt.de/services/article7893735/Impressum.html Datenschutz: https://www.welt.de/services/article157550705/Datenschutzerklaerung-WELT-DIGITAL.html

OHNE AKTIEN WIRD SCHWER - Tägliche Börsen-News
“Mega-Deal: Rio Tinto x Glencore” - Atomkraft bei Meta, Trump-Effekt & Innodata

OHNE AKTIEN WIRD SCHWER - Tägliche Börsen-News

Play Episode Listen Later Jan 12, 2026 12:59


Bis zu 2.500 € Bonus von Scalable Capital. Neu- und Bestandskunden, die Wertpapiere oder Guthaben bei Scalable Capital einzahlen, können sich bis zum 15.01.2026 einen Bonus sichern. Alle Infos gibt's hier: scalable.capital/transfer-bonus. KI, Deals und Trump. Das war Börse am Freitag. MiniMax, Meta, Oklo, Vistra, Softbank & OpenAI hatten KI-Themen. Exxon, Lennar, Pulte, D.R. Horton & Opendoor hatten Trump-Themen. Merck, Revolution Medicines, Baywa, Amazon & CSG hatten Deal-Themen. 2026 könnte den größten Bergbau-Deal der Geschichte bringen. Es geht natürlich im Kupfer. Rio Tinto (WKN: 852147) x Glencore (WKN: A1JAGV). Jahrelang war Innodata (WKN: 907651) langweiliger Digitalisierer. Jetzt sind sie KI-Schaufelverkäufer. Diesen Podcast vom 12.01.2026, 3:00 Uhr stellt dir die Podstars GmbH (Noah Leidinger) zur Verfügung.

The MadTech Podcast
MadTech Daily: TikTok Splits US Staff Ahead of Sale; DeepSeek Rival MiniMax Sees Shares Soar 87%

The MadTech Podcast

Play Episode Listen Later Jan 12, 2026 1:53


In today's MadTech Daily we discuss TikTok splitting its US staff ahead of a potential sale, China AI listings lifting MiniMax's shares by 87%, and Anthropic planning a $10bn raise to reach a $350bn valuation.

TechCheck
China's AI-driven IPO revival 1/5/26

TechCheck

Play Episode Listen Later Jan 5, 2026 6:59


AI names are pumping new life into China's IPO market, with model maker “Mini Max” reportedly set to price at the top of its range tomorrow. We dig into what the country's IPO revival means for the competition between U.S. and Chinese tech.  Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

DUBAI WORKS Business Podcast
$1B Rejected, $3B Won | Gulf Pivots From Premier League Takeovers | Adia Backs $6.5B AI Unicorn IPO

DUBAI WORKS Business Podcast

Play Episode Listen Later Jan 1, 2026 4:17


What would you do if someone offered you $1 billion for your six-person startup? Jordanian-American founder Amjad Masad said no—and today, Replit is worth $3 billion. In this episode, we break down his gutsy decision and Marc Andreessen's game-changing advice. Plus: dealmaker Amanda Staveley reveals why Gulf sovereign funds won't be buying more Premier League clubs (and what they're doing instead), and Abu Dhabi Investment Authority places a $65M bet on Chinese AI startup MiniMax—despite it losing half a billion dollars a year. From billion-dollar rejections to shifting investment strategies, here's everything happening in Arab business and tech. Newsletter: aug.us/4jqModrWhatsApp: aug.us/40FdYLUInstagram: aug.us/4ihltzQTiktok: aug.us/4lnV0D8Smashi Business Show (Mon-Friday): aug.us/3BTU2MY

枫言枫语
Vol. 157 智谱和MiniMax的招股书都说了啥?

枫言枫语

Play Episode Listen Later Dec 30, 2025 102:17


最近智谱和MiniMax双双赴港上市,截至我们录音的时间,两家公司刚通过港交所聆讯。他们的招股书上也是透露了不少AI创业公司的信息,于是我们两位主播就一起读了一下。 招股书既有面向律师、投行等专业人士的部分,也有面向大众投资者介绍自家公司业务的部分,所以读起来不会太困难。这两家公司虽然实际上都属于中国创业团队,但他们不仅注册地不同,面向的用户群和业务路线也是截然相反。智谱是学术科研团队出身,主要面向国内政企公司,类似to B/G的做法。而MiniMax则是国内创业但主打出海,类似to C的做法。 相同的是这两家公司都有自己的大模型,也都有自己的AI原生产品,甚至是同样的非常烧钱。 最后还是保命声明一下,我们两位主播都不是大模型与金融专业人士,只是刚好看到这两家公司的招股书觉得有意思,所以我们来一起看个热闹。 P.S. 本期节目发布恰逢Manus被Meta收购,对于创始团队来说,一方面是卖了个好价钱,另一方面收购如果能保持独立运营相当于找到了稳定的资金来源,将是一大好事。不过以Meta过去收购案例来看,“保持独立”恐怕凶多吉少。总之恭喜Manus

Crazy Wisdom
Episode #516: China's AI Moment, Functional Code, and a Post-Centralized World

Crazy Wisdom

Play Episode Listen Later Dec 22, 2025 64:59


In this episode, Stewart Alsop sits down with Joe Wilkinson of Artisan Growth Strategies to talk through how vibe coding is changing who gets to build software, why functional programming and immutability may be better suited for AI-written code, and how tools like LLMs are reshaping learning, work, and curiosity itself. The conversation ranges from Joe's experience living in China and his perspective on Chinese AI labs like DeepSeek, Kimi, Minimax, and GLM, to mesh networks, Raspberry Pi–powered infrastructure, decentralization, and what sovereignty might mean in a world where intelligence is increasingly distributed. They also explore hallucinations, AlphaGo's Move 37, and why creative “wrongness” may be essential for real breakthroughs, along with the tension between centralized power and open access to advanced technology. You can find more about Joe's work at https://artisangrowthstrategies.com and follow him on X at https://x.com/artisangrowth.Check out this GPT we trained on the conversationTimestamps00:00 – Vibe coding as a new learning unlock, China experience, information overload, and AI-powered ingestion systems05:00 – Learning to code late, Exercism, syntax friction, AI as a real-time coding partner10:00 – Functional programming, Elixir, immutability, and why AI struggles with mutable state15:00 – Coding metaphors, “spooky action at a distance,” and making software AI-readable20:00 – Raspberry Pi, personal servers, mesh networks, and peer-to-peer infrastructure25:00 – Curiosity as activation energy, tech literacy gaps, and AI-enabled problem solving30:00 – Knowledge work superpowers, decentralization, and small groups reshaping systems35:00 – Open source vs open weights, Chinese AI labs, data ingestion, and competitive dynamics40:00 – Power, safety, and why broad access to AI beats centralized control45:00 – Hallucinations, AlphaGo's Move 37, creativity, and logical consistency in AI50:00 – Provenance, epistemology, ontologies, and risks of closed-loop science55:00 – Centralization vs decentralization, sovereign countries, and post-global-order shifts01:00:00 – U.S.–China dynamics, war skepticism, pragmatism, and cautious optimism about the futureKey InsightsVibe coding fundamentally lowers the barrier to entry for technical creation by shifting the focus from syntax mastery to intent, structure, and iteration. Instead of learning code the traditional way and hitting constant friction, AI lets people learn by doing, correcting mistakes in real time, and gradually building mental models of how systems work, which changes who gets to participate in software creation.Functional programming and immutability may be better aligned with AI-written code than object-oriented paradigms because they reduce hidden state and unintended side effects. By making data flows explicit and preventing “spooky action at a distance,” immutable systems are easier for both humans and AI to reason about, debug, and extend, especially as code becomes increasingly machine-authored.AI is compressing the entire learning stack, from software to physical reality, enabling people to move fluidly between abstract knowledge and hands-on problem solving. Whether fixing hardware, setting up servers, or understanding networks, the combination of curiosity and AI assistance turns complex systems into navigable terrain rather than expert-only domains.Decentralized infrastructure like mesh networks and personal servers becomes viable when cognitive overhead drops. What once required extreme dedication or specialist knowledge can now be done by small groups, meaning that relatively few motivated individuals can meaningfully change communication, resilience, and local autonomy without waiting for institutions to act.Chinese AI labs are likely underestimated because they operate with different constraints, incentives, and cultural inputs. Their openness to alternative training methods, massive data ingestion, and open-weight strategies creates competitive pressure that limits monopolistic control by Western labs and gives users real leverage through choice.Hallucinations and “mistakes” are not purely failures but potential sources of creative breakthroughs, similar to AlphaGo's Move 37. If AI systems are overly constrained to consensus truth or authority-approved outputs, they risk losing the capacity for novel insight, suggesting that future progress depends on balancing correctness with exploratory freedom.The next phase of decentralization may begin with sovereign countries before sovereign individuals, as AI enables smaller nations to reason from first principles in areas like medicine, regulation, and science. Rather than a collapse into chaos, this points toward a more pluralistic world where power, knowledge, and decision-making are distributed across many competing systems instead of centralized authorities.

Screw The Commute Podcast
1059 - Image to Video is easier: Tom talks Hailuo

Screw The Commute Podcast

Play Episode Listen Later Nov 19, 2025 9:32


Today we're going to talk Hailuo. It's made by Minimax, which is what I covered on episode 1058. This is 1059 and it is a fantastic image to video AI model. Screw The Commute Podcast Show Notes Episode 1059 How To Automate Your Business - https://screwthecommute.com/automatefree/ Internet Marketing Training Center - https://imtcva.org/ Higher Education Webinar – https://screwthecommute.com/webinars See Tom's Stuff – https://linktr.ee/antionandassociates 00:23 Tom's introduction to Hailuo 03:16 Find this at FAL.ai and Minimax 05:14 You can tell it what emotion you want for your character 06:34 What AI can do for you Entrepreneurial Resources Mentioned in This Podcast Higher Education Webinar - https://screwthecommute.com/webinars Screw The Commute - https://screwthecommute.com/ Screw The Commute Podcast App - https://screwthecommute.com/app/ Screw The Commute Podcast Producer - https://screwthecommute.com/larryguerrera/ College Ripoff Quiz - https://imtcva.org/quiz Know a young person for our Youth Episode Series? Send an email to Tom! - orders@antion.com Have a Roku box? Find Tom's Public Speaking Channel there! - https://channelstore.roku.com/details/267358/the-public-speaking-channel How To Automate Your Business - https://screwthecommute.com/automatefree/ Internet Marketing Retreat and Joint Venture Program - https://greatinternetmarketingtraining.com/ This is the shopping cart system Tom uses! Kartra - https://screwthecommute.com/kartra/ Copywriting901 - https://copywriting901.com/ Become a Great Podcast Guest - https://screwthecommute.com/greatpodcastguest Training - https://screwthecommute.com/training Disabilities Page - https://imtcva.org/disabilities/ Tom's Patreon Page - https://screwthecommute.com/patreon/ Tom on TikTok - https://tiktok.com/@digitalmultimillionaire/ Email Tom: Tom@ScrewTheCommute.com Internet Marketing Training Center - https://imtcva.org/ Related Episodes FAL.ai - https://screwthecommute.com/1055/ Nano Banana - https://screwthecommute.com/1056/ Image to Video - https://screwthecommute.com/1057/ Minimax Text To Speech - https://screwthecommute.com/1058/ More Entrepreneurial Resources for Home Based Business, Lifestyle Business, Passive Income, Professional Speaking and Online Business I discovered a great new headline / subject line / subheading generator that will actually analyze which headlines and subject lines are best for your market. I negotiated a deal with the developer of this revolutionary and inexpensive software. Oh, and it's good on Mac and PC. Go here: http://jvz1.com/c/41743/183906 The Wordpress Ecourse. Learn how to Make World Class Websites for $20 or less. https://screwthecommute.com/wordpressecourse/

Screw The Commute Podcast
1058 - Another way to create audio: Tom talks Minimax Text To Speech

Screw The Commute Podcast

Play Episode Listen Later Nov 17, 2025 5:45


Let's Talk AI
#224 - OpenAI is for-profit! Cursor 2, Minimax M2, Udio copyright

Let's Talk AI

Play Episode Listen Later Nov 5, 2025 91:43


Our 224th episode with a summary and discussion of last week's big AI news!Recorded on 10/31/2025Hosted by Andrey Kurenkov and co-hosted by Gavin Purcell (check out AI For Humans and AndThen!)Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:OpenAI completes its for-profit restructuring, redefining its relationship with Microsoft and securing future investments. Meanwhile, Qualcomm and other tech giants announce new AI chips aimed at competing with Nvidia and AMD, marking major advancements in AI hardware capabilities. Amazon and Google deepen their partnerships with Anthropic, providing extensive computing resources to enhance AI research and applications. These developments signal significant growth and competition in the AI industry. Major AI tools and models were released and updated, including Cursor 2.0, CLAUDE coding capabilities, and open-source options from Minimax. These new tools offer a range of functionalities for coding, design, and more. Legal battles around AI copyright issues persist, as OpenAI faces ongoing lawsuits from authors over text generation using copyrighted material. Universal Music Group settles a copyright suit with AI music startup UDO, transitioning to a licensed model for AI-generated music. This shift reflects broader challenges and adaptations in the AI-generated content space, where copyright and ethical usage remain highly contentious issues.Timestamps:(00:00:10) Intro / Banter(00:02:44) News PreviewTools & Apps(00:03:44) Cursor 2.0 shifts to in-house AI with Composer model and parallel agents(00:07:44) Anthropic brings Claude Code to the web | TechCrunch(00:10:01) Microsoft's Mico is a 'Clippy' for the AI era | TechCrunch(00:14:20) Anthropic's Claude catches up to ChatGPT and Gemini with upgraded memory features | The Verge(00:18:46) Canva launches its own design model, adds new AI features to the platform | TechCrunch(00:21:07) Elon Musk's Grokipedia launches with AI-cloned pages from Wikipedia | The VergeApplications & Business(00:25:10) OpenAI completed its for-profit restructuring — and struck a new deal with Microsoft | The Verge(00:31:25) Qualcomm announces AI chips to compete with AMD and Nvidia(00:34:02) Amazon launches AI infrastructure project, to power Anthropic's Claude model | Reuters(00:38:52) Google and Anthropic announce cloud deal worth tens of billions(00:39:46) Google partners with Ambani's Reliance to offer free AI Pro access to millions of Jio users in India | TechCrunchProjects & Open Source(00:41:17) MiniMax Releases MiniMax M2: A Mini Open Model Built for Max Coding and Agentic Workflows at 8% Claude Sonnet Price and ~2x Faster - MarkTechPost(00:45:22) [2510.25741] Scaling Latent Reasoning via Looped Language Models(00:47:59) OpenAI's gpt-oss-safeguard enables developers to build safer AI - Help Net SecurityResearch & Advancements(00:49:51) [2510.15103] Continual Learning via Sparse Memory Finetuning(00:54:01) [2510.18091] Accelerating Vision Transformers with Adaptive Patch Sizes(00:57:46) [2510.18871] How Do LLMs Use Their Depth?Policy & Safety(01:01:07) AMD, Department of Energy announce $1 billion AI supercomputer partnership | The Verge(01:03:03) Synthetic Media & Art(01:09:34) Universal partners with AI startup Udio after settling copyright suit | The Verge(01:16:04) OpenAI loses bid to dismiss part of US authors' copyright lawsuit | ReutersSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.