Podcasts about gtc

  • 396PODCASTS
  • 979EPISODES
  • 44mAVG DURATION
  • 1DAILY NEW EPISODE
  • Jun 10, 2026LATEST

POPULARITY

20192020202120222023202420252026


Best podcasts about gtc

Show all podcasts related to gtc

Latest podcast episodes about gtc

The Unleashing Leaders Podcast
Perseverance in the Face of Adversity: From Geopolitics to the Gut Check with George Tagg Jr. | Ep 56

The Unleashing Leaders Podcast

Play Episode Listen Later Jun 10, 2026 47:12


Setbacks don't send a calendar invite. They show up at 6 a.m. on a Teams call. In a hospital room. On a battlefield. George Tagg Jr. has met adversity at every one of those addresses. And every single time, he found a way forward. Attorney. Former DOJ Nazi hunter. A State Department and DOD official who negotiated in the Caucasus, was on the ground in Kyiv when Russia's invasion began, and managed the deconfliction line between U.S. and Russian forces in Syria. George has served across multiple presidential administrations. He is now founder and CEO of GTC 360 Advisors. And writing a book on resilience, including a chapter on dealing with bullies. Starting with Putin. But if you ask George, his hardest moments were never on a battlefield. Losing his grandparents young. Then his father, one of the most magnetic lobbyists in DC during the 90s, gone suddenly to a heart attack six months after 9/11. George was 20. He could have gone the other direction. Instead, he picked up the legacy, held onto the lessons, and kept going. That is what this episode is really about. Not the geopolitics. The process. A repeatable, human way of turning your worst moments into the thing that moves you forward. If you have ever faced a moment that felt unsurvivable, this one is for you.   Additional Resources: Connect with George on LinkedIn Learn more about GTC 360 Advisors Attend Unleashing Leaders University! Sign up for our newsletter! Learn more about Unleashing Leaders Follow Unleashing Leaders on LinkedIn Connect with Lee on LinkedIn Follow Unleashing Leaders on Facebook Follow Unleashing Leaders on Instagram   Key Takeaways: Lick Your Wounds, Then Move: why giving yourself grace after a setback is not weakness, it is the first step The Blamestorm Trap: how blaming others feels good for a minute and keeps you stuck indefinitely The 180 Turn: George's three-phase framework from retrospective grief to forward-facing momentum The Rabbi Rule: why your results will never speak for themselves and the connective tissue that keeps you standing The Rule of Threes: why trying something once tells you almost nothing and how to find what reignites you Compounding Works Both Ways: small investments in action compound powerfully  

Pi Tech
News: осінь на ринку LLM?; як не треба робити метріки; локальний інференс майбутнього

Pi Tech

Play Episode Listen Later Jun 10, 2026 64:37


У цьому випуску обговорюємо зміни в настроях AI-індустрії. Компанії все рідше говорять про повну заміну людей ШІ і все частіше позиціонують його як інструмент для підвищення продуктивності. Говоримо про NVIDIA RTX Spark та зростання інтересу до локального інференсу, проблеми використання токенів як метрики ефективності, реальну вартість впровадження AI для бізнесу та можливе охолодження ринку. Також розбираємо заяви про домінування ботів у вебтрафіку, проблеми AI-підтримки в Meta, новини від Google та концепцію JEPA від Яна Лекуна як одну з потенційних альтернатив сучасним LLM. 00:40 — огляд GTC та нових продуктів NVIDIA 04:00 — порівняння NVIDIA Spark з Mac Studio 11:57 — тенденції в AI та локальному інференсі  18:29 — вплив цін на компоненти на ринок технологій 22:12 — перспективи локального інференсу та прогрес у великих моделях 28:45 — судові справи та авторські права в AI 32:29 — боти перевищують людей у мережевому трафіку 35:34 — презентація Google та нові технології помічників 39:19 — купівля кодів та нові можливості для розробників 42:53 — архітектура та нові концепції в ML 48:09 — абстракція в моделюванні та предикції 52:11 — потенціал LLM та їх правильне використання 56:13 — адаптація до нових реалій в програмуванні 01:02:25 — культ суперменів у IT

The Military Money Manual Podcast
PCS Season Payoff: How Military Families Turn Moving Expenses Into Free Vacations #232

The Military Money Manual Podcast

Play Episode Listen Later Jun 8, 2026 47:28


20,000 Dollars, 4 Cards, 400,000 Points. Spencer Reese of the Military Money Manual podcast sits down with Taryn from The Military Travelers to break down how military families can turn PCS moving expenses into free travel — without spending a single extra dollar. From sign-up bonuses to Hyatt Globalist status, this episode is packed with practical strategies for the upcoming PCS season. Topics Covered PCS Season Spending Strategy — Why PCS moves generate $10,000–$20,000+ in spendable expenses and how to redirect that spend to hit credit card sign-up bonuses GTC vs. Personal Cards — The frustrating reality of the Government Travel Card and why (for most branches) personal cards make more sense Branch-Specific Rules — Important caveats for Navy families and other branches regarding on-post lodging forms and card usage policies The Math of Points — Why putting PCS spend on one existing card (like the Amex Platinum at 1X) leaves massive value on the table compared to opening new cards Business Cards for Military Families — How a PPM qualifies you as a sole proprietor, why business cards don't hit your personal credit report, and how the Chase 5/24 rule makes them essential for bigger families Current Elevated Offers — Hot cards highlighted: Chase Sapphire Reserve (150K points), Atmos Summit (100K points + 3X on foreign transactions), United Club, Marriott Brilliant, Hilton Aspire, and Delta Reserve TLE Maximization — Using all 21 days of Temporary Lodging Expense, loyalty numbers, and stacking hotel promos during your move Hyatt Globalist Corporate Challenge — How a .mil email address unlocks Globalist status after 20 nights in 90 days (valid through February 2028) Multiple Cards Strategy — Holding multiple Hilton Aspires and Chase Sapphire Reserves, and the tip to align annual free night certificate expiration dates Military Spouse Eligibility — A reminder that spouses are fully eligible for annual fee waivers under MLA/SCRA Resources Mentioned The Military Travelers — themilitarytravelers.com | Facebook Group: "The Military Travelers" | Instagram: @themilitarytravelers Travel Freely (travelfreely.net) — Zach's beginner's guide to business cards and sole proprietor applications IRS EIN Application — Free Employer Identification Number at irs.gov (2-minute online form) Hyatt Globalist Corporate Challenge — Search "Hyatt Globalist Corporate Challenge" and enroll with a .mil email Spencer and Jamie offer one-on-one Military Money Mentor sessions. Get your personal military money and personal finance questions answered in a confidential coaching call. militarymoneymanual.com/mentor Over 22,000 military servicemembers and military spouses have graduated from the 100% free, Ultimate Military Credit Cards Course available at militarymoneymanual.com/umc3 Military Money Manual may receive compensation from JPMC. Opinions expressed here are author's alone, not those of any bank, credit card issuer, airlines or hotel chain. If you want to maximize your military paycheck, check out Spencer's 5 star rated book The Military Money Manual: A Practical Guide to Financial Freedom on Amazon or at shop.militarymoneymanual.com. If you have a question you would like us to answer on the podcast, please reach out on instagram.com/militarymoneymanual.

Acting Up with GTC
AUGTC S3 E6: What, Like It's Hard? Bringing Legally Blonde to Life

Acting Up with GTC

Play Episode Listen Later Jun 3, 2026 55:49


tickets legally blonde gtc acting up legally blonde the musical
有話好說
台灣民眾薪情差 韓媒評"乞丐超人"?(2026/06/01)

有話好說

Play Episode Listen Later Jun 1, 2026 29:10


黃仁勳旋風來台,參加Computex 也帶來新的AI技術跟訂單,台灣150家廠商供應鏈成黃仁勳最強後盾,AI經濟蓬勃,台灣經濟成長超越南韓,卻被南韓形容,台灣有不少乞丐超人,盯著便利商店打折品,人均所得低於南韓,落差真有這麼大嗎?AI旋風帶來經濟成長與出口成長,是否也造就台灣K型經濟,貧富差距拉大?歡迎加入《尖鋒對話》,一起深度探討。

Acting Up with GTC
AUGTC S3 E6: What, Like It's Hard? Bringing Legally Blonde to Life

Acting Up with GTC

Play Episode Listen Later Jun 1, 2026 55:49


tickets legally blonde gtc acting up legally blonde the musical
Tech Update | BNR
Nvidia komt nu juist met chips voor laptops en desktop-pc's onder naam RTX Spark

Tech Update | BNR

Play Episode Listen Later Jun 1, 2026 3:58


Nvidia heeft tijdens de eigen goed-nieuwsshow GTC, gehouden rond techbeurs Computex in Taiwan, nieuwe chips aangekondigd die juist voor apparaten van eindgebruikers zelf bedoeld zijn. Het betreft de RTX Spark, voorzien van zowel een CPU als een GPU, om te concurreren met AMD en Intel. Joe van Burik vertelt erover in deze Tech Update. Verder in deze Tech Update: Softbank, bekend van investeringen in OpenAI, gaat tientallen miljarden in AI-datacenters in Frankrijk steken See omnystudio.com/listener for privacy information.

M觀點 | 科技X商業X投資
SP2. 輝達 GTC 直播抽顯卡 + 軟體股回神了 | M觀點特別篇

M觀點 | 科技X商業X投資

Play Episode Listen Later May 31, 2026 60:28


SP2. 輝達 GTC 直播抽顯卡 + 軟體股回神了 | M觀點特別篇 --- (00:00)第一個話題:輝達 GTC 直播抽顯卡 (20:29)第二個話題:軟體股回神了 --- M觀點資訊 --- 科技巨頭解碼: https://bit.ly/2XupBZa M觀點 Telegram - https://t.me/miulaviewpoint M觀點 IG - https://www.instagram.com/miulaviewpoint/ M觀點Podcast - https://bit.ly/34fV7so M報: https://bit.ly/345gBbA M觀點YouTube頻道訂閱 https://bit.ly/2nxHnp9 M觀點粉絲團 https://www.facebook.com/miulaperspective/ 任何合作邀約請洽 miula@outlook.com -- Hosting provided by SoundOn

聽天下:天下雜誌Podcast
【天下零時差06.01.26】輝達GTC、Computex登場;國際油價漲,台灣通膨為何低?;聯準會公布最新褐皮書

聽天下:天下雜誌Podcast

Play Episode Listen Later May 31, 2026 6:11


週一天下零時差關注以下財經大事: 一、輝達執行長黃仁勳在GTC與Computex將為台廠帶來什麼利多? 二、聯準會公布最新褐皮書,美伊戰爭如何影響美國經濟情勢? 三、國際油價高漲,為何台灣通膨這麼低? 文:郭家宏 製作團隊:錢玉紘、鄭子鴻 *閱讀零時差,點這看全文

聽天下:天下雜誌Podcast
【阿榕伯胡說科技Ep.76】5月科技大事解析:黃仁勳再度訪台、聯發科股價噴發、SpaceX上市倒數

聽天下:天下雜誌Podcast

Play Episode Listen Later May 28, 2026 59:55


這集節目是5/25 胡說科技YouTube直播內容,想看影像版歡迎點擊連結:https://cww.psee.ly/959e8p 5月科技大事,你最關心哪件事? 一、輝達財報公布,黃仁勳再度訪台 二、聯發科股價大爆發,市值逼近7兆 三:SpaceX上市倒數,地表規模最大IPO 阿榕伯直播講給你聽。 錯過阿榕伯X群聯電子潘健成首場實體閉門對談!還有精彩回放可以看:https://bit.ly/4ntlMLY 關於《胡說科技》: 天下獨家推出《胡說科技》,這是第一份為關注產業趨勢讀者打造的科技專欄頻道。由天下總主筆陳良榕,半導體資深記者,耕耘科技報導多年,親自操刀,每週出刊。《胡說科技》結合對產業的敏感度,提供獨家、深刻的觀點與分析,不談花俏技術,不用行話堆疊,每週一篇,希望讀者在短時間掌握影響決策的科技動向。 *到官網看最新文章,立即訂閱: https://bit.ly/3TuL8eb *意見信箱:bill@cw.com.tw -- Hosting provided by SoundOn

Entreprendre dans la mode
[CLIP] World Models, Real-Time Video and the Decade Ahead | Jamie Umpherson (Runway)

Entreprendre dans la mode

Play Episode Listen Later May 22, 2026 12:24


Chip Stock Investor Podcast
Lumentum: 90% Revenue Growth, a $2 Billion Nvidia Investment, Triple Digits Coming — and the Dilution Story Nobody Is Covering

Chip Stock Investor Podcast

Play Episode Listen Later May 13, 2026 10:01


Over a year ago, CSI did a three-part deep dive on co-packaged optics after Nvidia dedicated an entire segment of its GTC keynote to the technology — naming Lumentum and Coherent as the primary beneficiaries. The analysis was right. They did not buy.That mistake is now worth talking about directly.Lumentum just reported fiscal Q3 2026 revenue up 90% year over year. Q4 guidance implies triple-digit year-over-year growth. Nvidia made a $2 billion investment in both Lumentum and Coherent, and separately announced a major fiber optic cable manufacturing expansion with Corning. Co-packaged optics products have not even begun shipping in volume yet — that catalyst hits in December 2026. The case for Lumentum continues to build.But this is CSI, and true conviction in a business means covering what could go wrong as well as what is going right. There is a significant dilution story unfolding that every Lumentum shareholder needs to understand before adding to a position.When the stock was trading at roughly one-tenth of its current price, Lumentum raised cash by issuing convertible notes — a type of debt that converts to equity when the stock reaches certain price milestones. The stock has now blown through those milestones. All of that convertible debt is now eligible to convert into stock at terms that are extremely favorable for the debt holders and extremely expensive for existing shareholders. The result: shares outstanding are expected to increase by approximately 20% over the next two quarters. Nick and Kasey explain the full mechanics clearly — why it happened, what it costs, and whether the revenue acceleration can outrun the dilution.Also covered: the Qorvo fab acquisition in North Carolina that adds indium phosphide manufacturing capacity in two to three years, and what operating leverage looks like when a company goes from negative margins to all-time highs in the span of a few quarters.What we cover:— Why CSI did the deep dive on co-packaged optics and still did not buy — the honest lesson— Lumentum fiscal Q3 2026: 90% revenue growth — what drove it and what comes next— Q4 guidance: triple-digit YoY growth before CPO products even ramp— Nvidia's $2B investment in Lumentum and Coherent — the supply chain signal— Nvidia and Corning fiber optic expansion — Nvidia's hands all over the supply chain— Co-packaged optics — the December 2026 catalyst that has not landed yet— Operating leverage in action: from negative margins to all-time highs— Convertible notes explained: why ~20% share dilution is coming in 2026— Qorvo North Carolina fab acquisition — InP capacity coming in two to three years— The bottleneck in laser module manufacturing and why Lumentum dominates itSponsored by fiscal.ai — 25% off any paid plan through May 14 only. Use our link: fiscal.ai/csiDisclosure: Nick and Kasey hold positions in Lumentum and Coherent. This content is for general information only and is not individual investment advice. All investing involves risk.chipstockinvestor.com

Startup Island TAIWAN Podcast
EP3-37 | What Makes AI Leaders Different? The Taiwan Connection Behind Jensen Huang and the AI Boom

Startup Island TAIWAN Podcast

Play Episode Listen Later May 12, 2026 22:51


NVIDIA GTC has outgrown the label of "tech conference" — it's now the annual convergence point of the entire AI ecosystem, where chips, software, robotics, and autonomous systems share the same floor, with an energy so dense that finding a parking spot takes two hours. In this episode, Silicon Valley-based veteran tech journalist and consultant Michelle Cheng joins host Uly fresh from GTC to give us a firsthand read on what the conference signals for the industry ahead.Michelle traces the deep ties between Jensen Huang and TSMC founder Morris Chang, and how Jensen's visits to Taiwan consistently bring the entire AI supply chain together at what the media has dubbed the "trillion-dollar banquet." Drawing on her experience covering Silicon Valley through the optical networking era, she maps today's AI infrastructure boom against the longer arc of industry cycles — demand never disappears, it just takes time to mature. She captures the mood of GTC with a drink called "Innovation Cocktail" she picked up at an after party: a blend of excitement and anticipation that defines where Silicon Valley stands right now.The episode closes on Jensen's own framing from the keynote — intelligence is becoming infrastructure — and Michelle's clear-eyed take: the window into the AI ecosystem is open, and the companies that find their position early tend to benefit the most when the cycle matures.NVIDIA GTC 已經不只是一場技術研討會——它是整個 AI 產業的年度聚合點,晶片、硬體、軟體、機器人、自駕車全在同一個屋簷下,能量之強連停車位都要搶兩個小時。這集邀請長期駐點矽谷、曾創辦科技媒體矽谷辦公室的資深記者兼顧問 Michelle Cheng,從剛結束的 GTC 現場出發,帶我們感受這場盛會的真實溫度。節目中,Michelle 從黃仁勳與台積電創辦人張忠謀的早期淵源,談到每次黃仁勳訪台召集台灣供應鏈夥伴的「兆元晚宴」,具體說明台灣在全球 AI 供應鏈中的結構性角色。她也從親身追蹤過的光纖網路世代出發,對照今天的 AI 基礎設施熱潮,指出產業週期的規律:需求從未消失,只是需要時間成熟。Michelle 用一杯在 GTC after party 喝到、名為「Innovation Cocktail」的調酒,形容當下矽谷興奮與期待交織的氛圍,並引用黃仁勳在 GTC 的核心論點作結——「Intelligence is becoming infrastructure」,AI 生態系的入場窗口正在開啟,能早期找到自己定位的公司,往往才是下一輪週期的最大受益者。

Get IT: Cybersecurity insights for the foreseeable future.
NVIDIA GTC 2026: AI, Robotics and Future Trends in Tech

Get IT: Cybersecurity insights for the foreseeable future.

Play Episode Listen Later May 12, 2026 21:09


Join KJ Burke, as he dives into the key highlights from NVIDIA's GTC 2026 conference. Discover the latest in AI advancements, robotics, space computing and enterprise strategies shaping the future of technology. To learn more, visit cdw.ca Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

SHOCK2 PODCAST
Podcast Special – Gratis Comic Tag 2026 (MGN Inside)

SHOCK2 PODCAST

Play Episode Listen Later May 7, 2026


Am Samstag, dem 9. Mai, findet der Gratis Comic Tag 2026 statt. Felicitas, Felix und Christoph stellen in dieser besonderen MGN-Episode alle 22 Hefte des GTC 2026 vor! Von Donald Duck über verschiedene Superhelden, bis zu Humor st für jeden Geschmack und für die ganze Famile etwas dabei! Achtung in kürze verlosen wir auf der SHOCK2 Webseite auch einen kompletten Satz mit allen 22 Heften! Alle Infos zum Gratis Comic Tag 2026 https://mag.shock2.info/tipp-gratis-comic-tag-2026-alle-infos-schon-jetzt/

CCA On the Air
CCA Accelerator Special Feature: Transforming Advising at Greenville Technical College

CCA On the Air

Play Episode Listen Later May 5, 2026 23:06


In this episode of CCA on the Air: Transforming Advising at Greenville Technical, we're joined by Brett Barclay (Brett.Barclay@gvltec.edu), the Dean of Academic Advising at Greenville Technical College in South Carolina. GTC has participated in CCA's Accelerator, a part of the Gates Foundation Intermediaries for Scale Investment, for nearly three years. Through this project, GTC has focused intently on advising transformation—listen to this episode to find out what they've done! 

Acting Up with GTC
AUGTC: S3 E5: The Power of Empathy: Why "To Kill a Mockingbird" Still Matters

Acting Up with GTC

Play Episode Listen Later May 4, 2026 42:02


Step behind the curtain with us on this powerful episode of Acting Up with GTC!

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0
Shopify's AI Phase Transition: 2026 Usage Explosion, Unlimited Opus-4.6 Token Budget, Tangle, Tangent, SimGym — with Mikhail Parakhin, Shopify CTO

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Apr 22, 2026 72:25


Early bird discounts for the San Francisco World's Fair, the biggest AIE gathering of the year, end today - prices will go up by ~$500 tonight so do please lock in ASAP!From near-universal AI tool adoption inside Shopify to internal systems for ML experimentation, auto-research, customer simulation, and ultra-low-latency search, Mikhail Parakhin joins us for a deep dive into what it actually looks like when a 20-year-old, $200B software company goes all-in on AI. We cover why Shopify has become much more vocal about its internal stack, what changed after the December model-quality inflection, and why the real bottleneck in AI coding is no longer generation, but review, CI/CD, and deployment stability.We also go inside Tangle, Tangent, SimGym, which are three major AI initiatives that Shopify is doing to make experimentation reproducible, optimization automatic, customer behavior simulatable, and search and catalog intelligence faster and cheaper at scale. Along the way, Mikhail explains UCP, Liquid AI, and why token budgets are directionally right but often measured badly, why AI-written code can still increase bugs in production, what makes Shopify's customer simulation defensible, and what he learned from the Sydney era at Bing.We discuss:* Mikhail's path from running a major Microsoft business unit spanning Windows, Edge, Bing, and ads to becoming CTO of Shopify* Why Shopify is talking more publicly about AI now, and why staying at the frontier has become necessary for the company* Shopify's internal AI adoption curve, the December inflection, and why CLI-style tools are rising faster than traditional IDE-based tools* Why Jensen Huang is directionally right on token budgets, but raw token count is still the wrong way to evaluate engineering output* Why the real unlock is not more agents in parallel, but better critique loops, stronger models, and spending more on review than generation* Why AI coding can still lead to more bugs in production even if models write cleaner code on average than humans* Why Shopify built its own PR review flow, and why Mikhail thinks most off-the-shelf review tools miss the point* How PR volume, test failures, and deployment rollback are becoming the real bottlenecks in the agent era* Why Git, pull requests, and CI/CD may need a new metaphor once code is written at machine speed* What Tangle is, and how Shopify uses it to make ML and data workflows reproducible, collaborative, and production-ready from the start* Why Tangle is different from Airflow, and why content-addressed caching creates network effects across teams* What Tangent is, and how Shopify is using auto-research loops to optimize search, themes, prompt compression, storage, and more* Why Tangent is becoming a democratizing tool for PMs and domain experts, not just ML engineers* Why AutoML finally feels real in the LLM era, and where auto-research still falls short today* Why Tangle, Tangent, and SimGym become much more powerful when combined into one system* What SimGym is, why simulated customers only work if you have real historical behavior, and why Shopify's data gives it a moat* How SimGym evolved from comparing A/B variants to telling merchants what to change on a single live storefront to raise conversions* Why customer simulation is so expensive, from multimodal models to browser farms to serving and distillation costs* How Shopify models merchant and buyer trajectories, runs counterfactuals, and thinks about interventions like discounts, campaigns, and notifications* Why category-level behavior is so different across commerce, and why ideas like Chinese Restaurant Processes are showing up again in practice* Shopify's new UCP and catalog work, including runtime product search, bulk lookups, and identity linking* Why Shopify is using Liquid AI, and why Mikhail sees it as the first genuinely competitive non-transformer architecture he has used in practice* Where Liquid already works inside Shopify today, from low-latency query understanding to large-scale catalog and Sidekick Pulse workloads* Whether Liquid could become frontier-scale with enough compute, and why Shopify remains pragmatic and merit-based about model choice* Who Shopify is hiring right now across ML, data science, and distributed databases* The Sydney story at Bing, why its personality was not an accident, and what Mikhail learned from deliberately shaping AI character early onMikhail Parakhin* LinkedIn: https://www.linkedin.com/in/mikhail-parakhin/* X: https://x.com/MParakhinTimestamps00:00:00 Introduction: Mikhail Parakhin, Microsoft, and Shopify00:01:16 Why Shopify Is Talking More About AI00:02:29 Internal AI Adoption at Shopify and the December Inflection00:06:54 Token Budgets, Jensen Huang, and Why Usage Metrics Can Mislead00:10:55 Why Shopify Built Its Own AI PR Review System00:12:38 AI Coding, More Bugs, and the Real Deployment Bottleneck00:14:11 Why Git, PRs, and CI/CD May Need to Change for Agents00:18:24 Tangle: Shopify's Reproducible ML and Data Workflow Engine00:21:19 Why Tangle Is Different from Airflow00:26:14 Tangent: Auto Research for Optimization and Experimentation00:30:07 How Tangent Democratizes Experimentation Beyond ML Engineers00:33:06 The Limits of Auto Research00:36:36 Why Tangle, Tangent, and SimGym Compound Together00:37:20 SimGym: Simulating Customers with Shopify's Historical Data00:42:47 The Infra Behind SimGym00:46:00 Why SimGym Gets Better with Real Customer History00:47:30 Counterfactuals, HSTU, and Modeling Merchant Trajectories00:51:55 CRPs, Clustering, and Category-Level Customer Behavior00:53:30 UCP, Shopify Catalog, and Identity Linking00:55:07 Liquid AI: Why Shopify Uses Non-Transformer Models00:59:13 Real Shopify Use Cases for Liquid01:03:00 Can Liquid Scale into a Frontier Model?01:09:49 Hiring at Shopify: ML, Data Science, and Databases01:10:43 Sydney at Bing: Personality Shaping and AI Character01:13:32 Closing ThoughtsTranscript[00:00:00] swyx: Okay. We're here in the studio, a remote studio, with Mikhail Parakhin, CTO of Shopify. Welcome.[00:00:08] Mikhail Parakhin: Thank you. Welcome.[00:00:10] swyx: I don't even know if I should introduce you as CTO of Shopify. I feel like you have many identities. Uh, you led sort of the, the Bing ML team, I guess, uh, uh, or ads team. I, I don't know, I don't know, uh, you know, it's, uh, people va-variously refer you as like CEO or, or, uh, I don't know what that, that, that said previous role at Microsoft was.[00:00:29] Mikhail Parakhin: Uh, that was... Yeah, my previous role w- at Microsoft was the-- I actually was the CEO of one of Microsoft's business units, which included, as I, you know, as we discussed, all the things that people like to laugh about, uh, including Windows and Edge and Bing and ads and everything.[00:00:47] swyx: Yeah, yeah. What a, what a, what a wild time.You've obviously, uh, done a lot since you landed at Shopify. Uh, one of the reasons I reached out was because you started promoting more sort of internal tooling, uh, primarily Tangle, but also a lot of people have seen and adopted Tobi's QMD, uh, and obviously, I think, uh, Shopify has always been sort of leading in terms of, uh, engineering.I think more-- it's just more recent that you guys have been more vocal about your sort of AI adoption. Is that, is that true?[00:01:16] Mikhail Parakhin: Well, I think AI tools in general are fairly recent development, uh, and we've-- Shopify, you know, at this stage of its development, we're developing AI in-in-house and other, uh, building tools that use AI and, you know, interfacing with the wider AI community, uh, you know, are on the sort of the, uh, runaway trajectory.So it just did by sort of natural byproduct. We, we talk about it more also. We just, uh, just even yesterday, Andrej Karpathy was famous in tweeting about, oh, are there some, uh, ways, uh, that, that you can organize your agents to store the data and then, uh, look up the data so that you don't have to research or, or lose context every- Yestime. And a little bit tongue in cheek, I tweeted that, “Hey, we've, we've done it much earlier, and we even have different approaches, Tobi and I.” Tobi, of course, is a big fan of QMD, and I'm more of a SQL, SQLite fan. But, uh, yeah, very similar things that we've already done here. The point is, yeah, we're very dynamic, you know, explosively growing company, and we have to be at the forefront of AI adoption, obviously.[00:02:29] swyx: Yeah. Yeah. Um, you, your team kindly prepared some slides actually that we were gonna bring up on to, uh, the screen. I think I can, I can screen share, and then we can kind of go through some of the shocking stats that maybe, maybe put some numbers to what exactly is going on. So here we have, uh- An internal AI tool adoption chart.What are we looking at here? What ?[00:02:54] Mikhail Parakhin: Yeah, this is very interesting statistics. Uh, this is number of daily active workers, you know, think of, uh, DAO, basically the active users of-[00:03:05] swyx: Yeah ...[00:03:05] Mikhail Parakhin: AI tool as a percentage of all the people in the company, right? And then- Yeah ... different AI tools. And, uh, you could see two things here is that one is the green is total.Uh, green is just total. So you could see that it approaches really % by now. It's hard not to do your job now without interacting deeply, at least with one tool. You could see another interesting thing is just as many people commented in December was the phase transition when suddenly models gotten good enough that, that everything took off and started growing.Uh, it, it was many people noticed that the thing is that small improvements accumulated into this big change in Sep- December roughly timeframe.[00:03:52] swyx: Yeah.[00:03:52] Mikhail Parakhin: The other thing I would claim you could see is that, uh, CLI-based tools and tools that don't require you to look at the code becoming more popular, and you could see, yeah, various versions of, uh, Cloud Code and Codex and Pi and internal development tools taking off.Uh, exactly, yeah, uh, and blue is our River, just internal agent for coding, where tools, uh, that require IDEs such as, uh, GitHub, Copilot or Cursor, they're not exactly shrinking, but they're not growing as fast. Like, uh, red, red line is, is the IDE kind of tools. So you could see that they're, they're not experiencing as, as fast of a growth.[00:04:37] swyx: As I understand it, basically, every employee has their choice, right? Of choose whatever tool you use, and then you're just kind of doing a, a daily sur-survey or something.[00:04:47] Mikhail Parakhin: Exactly. And, uh, we- Yeah ... the, the push is to get your job done, you can use any tool, and we effectively fund unlimited tokens for everybody.Uh, we, we do, we do try to control the models that, uh, people use, but from the bottom, not from top. Like we basically say, “Hey, please don't use anything less than Opus four point six.”[00:05:09] swyx: Oh .[00:05:10] Mikhail Parakhin: Some people, some people end up using GPT five point four extra high. Some people use Opus four point six. Um, uh, you know, uh, there are some, uh, there are plus and minuses in going for full one million context window versus not.But, uh, we try to discourage people from using anything less than that.[00:05:28] swyx: Yeah, yeah. Got it, got it. Uh, I mean, uh, that's, you know... The, the next chart here, it really kind of shows the expansion and the sort of December twenty twenty-five inflection, right? That, uh, people are using a lot of tokens. I think it's also really interesting that no one was kind of abusing it in twenty twenty-five.Like it was- Had comparatively, uh, to this year, there was almost no growth. I mean, it's still like, you know, probably, probably gave fifty percent.[00:05:56] Mikhail Parakhin: Yeah. This is just a different scale. It's still exponential- Yeah, yeah ...growth at just a different- ...rate of expansion. Uh, there was inflection point, and Sean, I would claim the, the super interesting part here is that you could see that the distribution becoming more and more skewed.Yes. The top percentiles grow faster. So that means- Yeah ...the people in the top ten percentile, they, their consumption grows faster than seventy-five and so forth. So, uh, the distribution skews more and more towards the highest users, which is... I don't know what it tells me. It's like it feels not ideal, to be honest.Or maybe it's okay. We'll see.[00:06:36] swyx: Why does it feel not ideal? Is, is it because of, um, quantity over quality, or what's the concern?[00:06:42] Mikhail Parakhin: Because take it to the limit. That means, you know, if, if this rate of separation continued- Ah, yes ...a year, there will be one person consuming all the tokens. So it's just, it's kinda strange.[00:06:54] swyx: Yeah, I mean, um, uh, I, I think internal like teaching and all that, uh, will, will help sort of distribute things more widely. But in, in the early days, of course, the people who are sort of more AI-pilled will obviously find more ways to use it than the people who are less AI-pilled. Maybe let's, let's call it that.I'll just, I'll just kinda quickly, uh, pause from the, the... You know, we will go back to the rest of the slides, but I just wanna, um, review, you know, there are a lot of CTOs of, of large companies like yourself where they're all considering some kind of token budget, right? Like I think it's something, something that Jensen Huang has been talking about, where like if your 200K engineer is not using 100K of tokens every year, like they're, they're underutilizing coding agents.Of course, Jensen Huang would say that, but like it seems a very quantity over quality approach and like some, some people are basically saying like, well, is this comparable to judging engineer quality by lines of code, right? Which we also know is like kind of flawed, but better than nothing. So I, I don't know if you have like a sort of management take here on, on how to view this kind of, uh, metrics.[00:08:02] Mikhail Parakhin: Well, I mean, you're, you're baiting me. I, I like... This is my favorite topic. Uh, if you let me, I'll probably talk for two hours on just this. I have a lot of things to say. Like I do think Jensen gotten a lot of bad press saying, “Oh, of course you're, you know, this, uh, the- ...the cake seller says you don't need enough cakes.”You know? Like, of course. Uh, but, uh, I actually, uh, think that's undeserved. I think he, he's actually right. Uh, I do think- He,[00:08:33] swyx: he's directionally correct.[00:08:35] Mikhail Parakhin: Yeah. Yeah. He's directionally correct for sure. Uh-[00:08:37] swyx: Who knows what the right number is? Yeah.[00:08:39] Mikhail Parakhin: The thing that I do Uh, want to say, and this is something that we learned through trial and error and very important is like two things.One is that it's not about just consuming tokens. Uh, you can consume tokens and, and in fact, the anti-pattern is running multiple agents, too many agents in parallel that don't communicate with each other. That's almost useless, uh, compared to just fewer agents and burns tokens very efficiently. Uh, setting up the right critique loop, especially with the high quality models, where one agent does something, the other one, ideally with a different model, critiques it, uh, suggests ways to improve it, the agent redoes it with this critique and, and so it takes much longer.So people don't like it because latency goes up. You know, they, they have to wait until this debate is happening. But, uh, the quality of the code is much higher. And another thing, just since you mentioned like, look, uh, uh, yeah, the overall budget is just like, uh, lines of codes. Lines of codes are exploding for everybody right now, or partially because AI is really mover balls, but partially just because AI can write a lot more code, you know, doesn't get tired.And so you have to have to have a very strong narrow waist during PR review. Otherwise, just the number of bugs will go through the roof. It's, uh, it's this unexpected consequence of the just volume trumping everything. I would claim by now good model writes code on average with fewer bugs than, than the average human.But since they write so much more of it, like more of it will make it into production. So you have to- You still[00:10:26] swyx: have[00:10:26] Mikhail Parakhin: more bugs. Yeah. Have to have a very rigorous PR reviews, also automated of course. But, uh, yeah, that to spend a lot budget there. Like this, this for me, for me, actually, the important metric is the ratio of budget spent during code generation versus, uh, spent, uh, expensive tokens like GPT, uh, five point four Pro or, uh, uh, Deep Think from Gemini, you know, checking on PR reviews.[00:10:55] swyx: Yeah, totally. Uh, I noticed in your chart you didn't have any review tools. Do you just use like, like let's say a Claude code to review tools? Or do you have another set of review tools like the Greptiles, the Code Rabbits, uh, Devin Reviews has a review tool. I don't know if you've had those specialist review tools.[00:11:13] Mikhail Parakhin: You are a little bit jumping on my store tool right now because the graphs I was only showing public tools. Uh, uh, the-- I haven't found a good PR review tool that, that does what I think should be done. And, uh, partially my, my thinking is because it's so... It just goes against both what people feel like emotionally they prefer and, uh, some of the, uh, you know, frankly Even business models that, that the companies run.At peer review tool, uh, time, you want to run the largest models. That means, I don't know, Codex or, or, uh, Cloud Code is not gonna cut it. You need to have pro-level models if you really want to, uh, stand the tide of bots from going into production. And you need us to spend a lot of time, the models taking turns, but you don't want, like, a big swarm of, uh, of, uh, agents.So in fact, you end up in a different dual-dualistic world where you generate not that many tokens. You, in fact, generate few tokens, but it takes f-a long time because these are expensive models taking turns rather than many, many agents trying to do many things in parallel. So that's, that's why I feel like I haven't found good tools, so we are using our own for peer review for now.[00:12:33] swyx: Yeah. Yeah. I mean, uh, I think a lot of companies are building their own, uh, especially to their needs, right?[00:12:38] Mikhail Parakhin: Mm-hmm.[00:12:38] swyx: Um, I, uh, you also have a chart here going back to the slides on, uh, PR merge growth, where we're now at thirty percent, uh, month on month rather than ten percent. Uh, and also the, the estimated complexity is going up.You know, this is productivity, right? ‘Cause y- presumably there's more stuff going into the code base and more, more features getting worked on. I'm curious about the backlog, right? Like the, the, the-- I actually don't mind a pro-level model taking an hour or two hours to review my PR, because I've dealt with humans who take a week to review my PR, right?And I keep pinging them on Slack, “Hey, hey, review my PR.” So, you know, I think there's some trade-off here where, like, it still doesn't make sense.[00:13:18] Mikhail Parakhin: Exactly. That, that's exactly m-my point. Uh, that on one hand, you can tolerate longer latencies at, uh, PR. On the other hand, like right now, the real problem is not in spending time waiting for PR.It's real problem is since there's so much more code than- Yeah ... uh, probability of at least some tests failing going up, and then you, like, keep de-failing, then you have to find the offending PR, evict it, retest it without that PR, and so deployment cycle becomes much longer. Uh, so it actually, in terms of the overall time to deploy, it's total time savings if you spend more time on a longer model, like thinking for an hour, because then, then you, you don't have to spend all that time during testing and rolling, you know, rolling back the deployment.[00:14:03] swyx: Yeah, totally. That's still worth it. You know, you don't look at the individual, look at the aggregate, and look at the, the, the change in the aggregate system.[00:14:11] Mikhail Parakhin: Exactly.[00:14:11] swyx: I'm kind of curious if, like, there's this PR mentality and, like, c-- the, the, the CICD paradigm will be changed eventually. Some people are like, obviously a lot of people want new GitHub, but I even wonder if, like, Git is the problem, right?Like, is that the bottleneck? Is the concept of a PR a bottleneck? Do you guys use stack diffs? I don't know if, uh, that's a, like, a merge queue stack diff type of thing.[00:14:34] Mikhail Parakhin: We, we use, we use Stacks, we u- we use Graphite. We worked with, uh, Graphite a lot. Uh, so we use Stack, uh, PRs. I think, uh, like that's clearly the overall CICD in general, and the interaction with the code repository right now is the, clearly the sort of the, the main issue and the bottleneck for us, uh, and highest top of mind.I would say we probably need a different metaphor or different whole design of how to process it in new agentic world. I haven't seen anything dramatically better yet. I, I think everybody right now is just trying to keep their head above the water ‘cause, ‘cause there, there's so many PRs and then everybody's CICD pipelines start creaking, the, the times are increasing, the number of bugs slipping by increasing, and you have to, have to clap on down.And so we are a little bit in this situation when we need to first stabilize that story and then start thinking, hey, what, what it could be a completely different and new world, which I haven't... I know some people working on it. I haven't seen something, like anything super compelling yet, but clearly the old thing were designed for humans will need to be morphed into something new.[00:15:53] swyx: One of the thing that I, I think about is kind of like the merge conflict is basically a global mutex on the whole system, right? And in, in hu- in human organizations, we do have something like that. It's the company standup. But like, other than that, it's like it's actually fitting for us to be somewhat decentralized, somewhat plugged into one stream of information source, but somewhat lossy.Like it's okay, you know, that, that not every delivery is like atomic consistency. Like we're not dealing with a database sometimes.[00:16:27] Mikhail Parakhin: This is a very good point, uh, because since humans don't write code too fast, you know that global mutex is not too bad. Once you-[00:16:36] swyx: Yes ...[00:16:37] Mikhail Parakhin: start writing code at the speed of machine, it becomes the, you know, the bottleneck.Then what do you do? Maybe, and I can't believe I'm saying this because I, I'm long-- lifelong opponent of, uh, microservices, and I always thought that was, like, a really bad idea. And now that you're saying it, like, maybe in new guys like microservices will make a comeback, you know, because then you, you can ship things independently in tiny things and, and the managing all that complexity automatically will be much easier.I don't know. Like, we'll s-- we'll have to see.[00:17:10] swyx: Yeah. I mean, I don't know what the Microsoft or, or Shopify thing is, but I, I read this paper from Google where they have a monorepo that deploys into microservices, right? And then, uh, the other concept that I think about a lot is the Chaos Monkey concept from, from Netflix.Being able to create, like, this robust system where, um, uh, you know, you, you have the service discovery, you have the, uh, the independent, independent microservices discovery and, and, uh, you know, probably going to be a fair amount of duplication. That's how an organic system sort of scales, uh, that, that you have that...I don't know how you call it. Slack? Robustness? Depend-- uh, d-duplication. I, I, I forget the-- I, I'm-- And this-- those-- these are not exactly the terms- Hmm ... I'm looking for, but I c-can't really think of the words. Okay. I was gonna go into Tangent and Tangle. Uh, so, uh, we, we sort of discussed the overall stats that, uh, Shopify has.Uh, but, you know, I, I think some, some pretty cool stuff that you guys are working on is your ML experimentation, uh, and your, your sort of auto tr-research training pipeline. Presumably you're much closer to this one because it's, it's a sort of personal hobby of yours. How, how would you explain them in, together?I thought we have a slide that, like, uh, has the s- the system diagram.[00:18:24] Mikhail Parakhin: Yeah. Tangle first and then Tangent as a-[00:18:27] swyx: Yeah ...[00:18:28] Mikhail Parakhin: as a thing on top of Tangle. And, uh, Tangle is the third generation, I claim, of, uh, systems of, uh, running any data processing, but a bit with a skew for ML experiments, but not necessarily. Any sort of data processing tasks where you need to iterate, share, and you have scale so that you want maximum efficiency.You know how, like, normally you would work, you would-- Imagine you're a data scientist or an ML practitioner, you would get Jupiter notebooks or, or maybe you would get, uh, you know, Pyth- your Python scripts, and you would manage the data, and you produce those TSV files, and you put them in some JFS or something.Then you would notice that, oh, it has this, uh, weird missing values. You go and write another script that, uh, goes and replaces them with, uh-[00:19:20] swyx: Ah ...[00:19:21] Mikhail Parakhin: dash S. And then, then you, then you run some, some, uh, “Oh, I need to filter bots.” And so you run some light GBM model that, uh, removes the bots. And then, then you like-- And then you, you kind of like get into shape, and then you start experimenting, and you run multiple experiments, and then you're like, “Oh my God,” like, “this experiment is worse.”You undo, and you cannot get to previous result. And like, “Ah, what did I do?” Like that. Again, then, then you finally like get everything working. Then you like start throwing it over the fence to production. You, you replicate it, those things don't work, and then sometimes you like don't notice that you forgot some feature naming and the, the features don't match.But then, like imagine you, you did everything, and then six months later you're like, have to repeat it because now there's more data, or you wanted to do another pass, and you're like, “What, what did I do?” Or like, or like, “This script crashes now,” or the, “the path has changed.” And then, then you're trying to, like you spend another month just doing ar- digital archeology on your own, you know, history, right?Now multiply that by many, many teams. Now imagine you got an intern that you wanna ramp up. Now you have to show that intern, “Oh, you know, look, here's the folder, there's the scripts, you know, ask your cloud agent to do, and then, uh, to, to figure it out.” And then cloud agent does something, and then you're, “Ah, yeah, right, right, it was the wrong folder.I forgot to tell you, I actually have this other thing I forgot myself.” And, and that's, that's the, like, the daily life we all, uh, all know it, uh, if, if you're a data scientist, machine practitioner, ma- machine learning practitioner or, uh, or even like any data managing, uh, person.[00:21:00] swyx: Yeah. So I, I used to do this, uh, f- uh, on the quant finance side, uh, in, in my hedge fund.So we did this before Airflow, and then, uh, obviously Airflow came along and, uh, then more recently Dagster, uh, I would say is like, in my mind, what I would use for that shape of problem, uh, where you had to materialize assets and create a pipeline.[00:21:19] Mikhail Parakhin: And that's, that's very good segue because... So Airflow is great, but Airflow is more about you, you have something and you wanna repeatedly run it in production on schedule.It's less about you as a team developing things and being able to share, and you grabbing the standard pipeline and saying, “Hey, I wanna change this tiny little component in the huge sea of data processing, and I don't wanna-- I wanna run ten experiments on this, and I wanna do hyperparameter optimization.”All that is very hard to do with Airflow. It's very easy to do with Tango. Tango is m- more about, it's everything about group of people Running experiments, it might be agents too nowadays. Uh, running experiments cheaply, collaborating, sharing results. Uh, you don't need to understand fully. You, you grab-- you clone somebody else's experiment or somebody else's pipeline, uh, run, uh, change small piece, run it, be, like, get it to production state, and then ship in one click.So then the... You don't have to port it into any other system to, to run in production. You can just run the same experiment. It's, it's fully production ready. And, and it's, uh, it has lots of... Again, as I said, it's third generation system. The original one was, I would claim there was Ether and then, uh, at least in my career, Ether was the first, first, uh, that pioneered this type of approach.And then there was, uh, Nirvana, which, uh, uh, at Yandex, which did kind of sec-second take on this. And now this one aggregates the, the learnings from all of those and, and Airflow as well to, to get to the state where you try it, it, it feels kind of magical. Uh, ‘cause now everything is based on content, uh, hashes.So even if the version changed, but if the output didn't change, nothing is being rerun. It's very efficient. If you... Multiple people start experiment that needs the same sort of data preprocessing, it's not repeated multiple times. It's automatically done only once. If you start ten experiments that all require, you know, some, some data preparation first as the first step, and you don't have to coordinate for that.Like, you don't have to know that other people are starting it. You now, it's very easy compos-, uh, composability, any language you can u- uh, you wanna use, and it's very visual. So you can see immediately, you can edit it easily, you can assemble small things with just even mouse clicks if you want to, and, uh, share, clone.And everybody knows also it's fully kind of static in the sense that we rerun it second time, it will exactly have the same results. Like, you will never have to do digital archeology. So full versioning and everything is also there.[00:24:06] swyx: Uh, so, so people can, uh... It's open source. Go to the GitHub repo and, and, uh, check it out.Uh, and it is also a really good, uh, blog post about it. I think all these is, like, really appealing. The, the, the, the thing that I think sells me the most about it is that, um, sort of development to production transition, right? Which I think, um, a lot of people haven't really solved that, uh, strictly, right?Like, we develop really, really well in, in Python notebooks, but then, you know, that's obviously not a sort of production ready process. I think that, like, any way in which that is solved, I think is, is very appealing. Then the other thing that you mentioned, which also raised my eyebrows, was content-based caching, which you mentioned is, is, um, you know, is ve-very much, uh, um, a sort of efficiency measure about, uh, you know, just like recalculation only on, on sort of content addressing Which I think makes sense.Uh, it surprised me that the savings could be this much, but maybe I just haven't worked at your scale where there's so much duplication, uh, that people just rerun because they change a single ID upstream.[00:25:10] Mikhail Parakhin: It does, yeah. But it's not only you rerun. The, the main savings are coming from the fact that you ran it, you got your job done, and you moved on.Then- Yeah ... somebody else in some department you don't know existed runs the same task, but on a newer version.[00:25:27] swyx: Yeah.[00:25:27] Mikhail Parakhin: Like right now, you can't, in, in most of the organizations, you can't even find out about it so that you can't even measure that you're spending that time twice, right? Here- Yeah ... if everybody's on Tango, that's detected automatically and detected that the output is the same.And then for that person, all it looks like is like experiment just suddenly moved, jumped forward, right? Uh, uh- Yeah ... so that's because, because the, there's network effect of multiple people helping each other.[00:25:51] swyx: Yeah. This is one of those things where it's designed to be a platform from the beginning rather than an individual developer's tool from the beginning, right?And, and everything's gonna streams down from there. That is the sort of Tango, uh, orchestrator, and it's, it manages jobs. We've seen a few versions of this, and this is obviously, uh, uh, the sort of, uh, unique approaches that you guys have, have, uh, figured out. And then there's Tangent.[00:26:14] Mikhail Parakhin: Yeah. And Tangent is basically an automatic auto research loop that can help and kind of do your work for you.Uh- ... you know, uh, effectively, effectively, Andrej Karpathy recently popularized it with auto research. Yes. Remember he said like he was, uh, speed running this, uh... Yeah, uh, you know the story. The, here we're basically bringing the same capability into Tango so that, uh, the, uh, Tangent can analyze it. It's just an agent that can run multiple experiments, figure out what can be changed, and keep on rerunning it, keep on modifying until, uh, maximizing some goal, some loss function, whatever you need to, to achieve.And in general, I would say if you're not using auto research-like approach in whatever you do, like literally whatever you do, then you're missing out. We saw at Shopify that taking like a wildfire, anything where you can put measurements can be done dramatically better. Our-[00:27:19] swyx: Mm-hmm ...[00:27:20] Mikhail Parakhin: uh, speed of, uh, templatization HTML, uh, completely new UX tem- uh, templatization of, uh, reducing latency for liquid themes.Uh, we-- Our, uh, search, uh, recently we moved from It's hard even, uh, quote from eight hundred QPS to forty-two hundred QPS with the same quality just by pure optimizations and not a research loop that kept running and changing code in our index serve on the same number of machines, just increasing the throughput.We, we managed to improve the quality of gisting and machine learning process. Uh, you know, gisting is the prompt compression technique that[00:27:59] swyx: allows for[00:28:00] Mikhail Parakhin: lower latency and, and lower and, uh, actually higher quality slightly. So like literally whatever different walks of life, and it doesn't have to be AI related.Uh, we, we had a reduction in, uh, storage because the agents would go and find data sets that clearly are derivative, uh, and then you don't need to store things twice. You know, we, we, we found somewhat embarrassingly that it was one of the largest tables was hashing random IDs into another random ID, and we literally- Oofput only one. So it was translating, yeah, two random IDs hashed[00:28:36] swyx: into[00:28:37] Mikhail Parakhin: each. So, so[00:28:37] swyx: it has access to the code as well, so it can, it can check the, like what, what the hell is it doing?[00:28:42] Mikhail Parakhin: So there, there cou- it could be run in two levels. You, uh, you know, at the superficial level, it could just use ex-existing components and, uh, reshuffle them.Uh, you know, like you can grab- Yeah ... uh, XGBoost, and you can grab some, some Py- PyTorch module, and then can grab some, you know, grab another tools and, and combine them. At a deeper level, since Tangle is all sort of CLI based underneath you, every, every component is a wrapped really CLI, uh, call and a YAML file, it can analyze code and create new components and, and, uh, keep on iterating as well.So, so you can, you can both have quick modifications of existing t- uh, pipelines with the, with components that are already there pre-baked, or you can create new components, uh, and-[00:29:29] swyx: Yeah ...[00:29:29] Mikhail Parakhin: keep iterating on those. So auto research is, again, this is probably the, the thing I was excited the most in the last two months happening, and we see it taking like, like totally like a wildfire.Just, uh, everybody, every day, every... well, every day, every minute, I would, uh, have somebody Slack message saying, “Oh, look how much better I made it.” And, uh, it's all throughout the research.[00:29:53] swyx: Is this democratized in some way in, in the sense that like is it your ML, uh, engineers and researchers doing this, or is it your regular PMs and software engineers also have the ability to auto-- to use Tangent?[00:30:07] Mikhail Parakhin: This is an awesome question. Like, Tango in general and Tangent in particular are extremely democratizing. Like they- Yeah ... they are the main tools for- ‘Cause I don't[00:30:15] swyx: need the details.[00:30:16] Mikhail Parakhin: Yeah. Exactly. Initially used by ML and AI engineers, but then literally, as you said, PMs are like the highest user right now is one of PMs on our org, uh, Sartak and he was, he was number one by, by usage of, of this ‘cause they're just, uh, energetic and knowledgeable, and now it, it unlocks a lot of capability where you don't have to co-change code manually.[00:30:39] swyx: I mean, I mean, because it kind of cuts out the ML, ML engineer from the process because the, the, the PMs have the domain knowledge and the ability to think about, uh, from first principles about, okay, what, what results do I want? And they can-- they even have the access to the data that, that needs to go in.So it's like in some ways, like this is the magic black box that we've always wanted for, for training and, and for, uh, I guess, uh, uh, hill climbing, whatever.[00:31:04] Mikhail Parakhin: It's basically cloud code for your AI development- ... uh, situation, right? Like now, now you don't have to know exactly how algorithms work. You can just, uh, bring your domain knowledge and expertise and product knowledge and iterate within Tangent until you've gotten the results that you need.[00:31:21] swyx: In my previous roles, every time that someone has pitched AutoML, you know, I've always been like, “Uh, this is not, this is not gonna work. It's, you know, it's, it's always gonna be a flop.” Somehow it's working now. I mean, presumably the answer is now we have LLMs and it's good enough, right? It's, it's an emergent property that we can do auto research, but like, it doesn't feel that satisfying that how come we didn't do this before, right?Like we just did like parameter search and like, I don't know. That's maybe that's it.[00:31:48] Mikhail Parakhin: Yeah. Bayesian optimization and hyperparameter optimization was, was the one that, or facet of AutoML that was used very actively, which incidentally also built into, uh, Tango. But, you know, I know Patrice Simard very well, and, uh, he was such a, uh, such a proponent of AutoML, and he put, like literally spent careers trying to democratize it.Without LLMs, it just turned out to be very hard. Like it, you, you would have flexibility within certain narrow domain, but it was hard to wider scale, and now with LLMs suddenly it's like magic wand, and so suddenly everybody- ... is an AutoML expert.[00:32:28] swyx: Yeah, I, I think it's multiple things, right? Like I'm, I'm just gonna bring up the, the, the chart again, right?Like LLMs can do the monitoring very well. That is the very potentially unbounded, super unstructured. It can do the analysis very well, it can do the... Uh, and basically it is much more intelligence poured into every single step. Uh, there's maybe nothing structurally changed about AutoML, but this is just m-more intelligent and more unstructured.[00:32:53] Mikhail Parakhin: Exactly.[00:32:54] swyx: Any flaws that you've run into? Like everyone is like drinking the Kool-Aid, oh my God, time savings, uh, you know, performance improvements. Like what, what, uh, issues have you have, uh, come up?[00:33:06] Mikhail Parakhin: This is really cool. It's not a solution to all the world's problems for sure. The limitations are usually the ones I-- And this is where we get into a bit of a subjective territory.Uh, I can only share what I've, I've seen so far, and I'm sure the situation, uh, is changing, and, you know, maybe after I say it, like many people will reach out and say, “Hey, what about this?” And you don't know that, and then, then we'll be probably right. But what I've seen is auto research is very good at doing kind of obvious things that you don't have bandwidth to do or you didn't notice or maybe you're not aware of like the-- some standard practices.It is not good at doing something completely out of distribution, something that, you know, you have to think for, for multiple days, uh, and, and do something like none of this. So, so it's, uh, I, uh, set an experiment once, uh, on, on my sort of, uh, hobby thing, and I let it run for, uh, ended up, uh, several weeks run, uh, you know, it's like full production kind of scale, so it, you know, slow runs and, and it ex-- it performed in the end, uh, over four hundred experiments, and only one was successful.I'm like, “Okay, that's, that's good.” But-[00:34:18] swyx: But it saved time.[00:34:19] Mikhail Parakhin: Yeah, I saved time. Like it, it was the, that thing. Yeah, if I, if I were doing four hundred experiments myself, my betting average, as I said, would have been much higher, I'm sure. But also, first of all, it would take me like three years to do four hundred experiments.And, uh, I didn't have to do them. Like the machines were just, uh, the price of electricity did that. So, and I got one improvement, uh, that in, uh, my, my-- Honestly, when I was starting that experiment, my thinking was to go and show that, “Hey, Andre, maybe you just don't know how to optimize.” And I was super smart because in, in my pro-problem, it was optimized for many years, and it was like fully improved.Uh, and I didn't expect it, you know, auto research to find anything at all. Yet it did. So instead of making fun of Andre, I ended up, uh, a big, big supporter. Yeah, that's exactly the tweet. Yes.[00:35:10] swyx: You and Toby really, really go back and forth on-online a lot, which is really funny. Uh, think of it as, as an eval for the optimalness of the code it's running on.Uh, it's almost like it reminds me of like a Kolmogorov complexity thing, but, uh, I guess it's-- there's some optimal thing that you're trying to sort of reduce down to, I guess. Um, and so, so you, you, you know, you should congratulate yourself that you had, uh, you know, uh, ninety-nine percent, uh, optimality.[00:35:36] Mikhail Parakhin: Exactly, yeah. I think Andre really deserves a lot of credit for popularizing this approach. This is, uh, this is incredibly, I think, powerful and cool and You know, the, uh, even him, him just mentioning it led to a lot of gains in a lot of places in the industry, so we should be thankful.[00:35:56] swyx: Yeah. I think he also has a just...I don't know what it is. Like, um, you know, it, it is a simple self-contained project that people can take and apply to other things, which is, is, is one thing, but also just the name. Just like somehow no one, no one managed to call their thing auto research. It's just naming things is very important. I think that that is mostly, uh, our coverage of Tango and, and, uh, Tangents.I think obviously, you know, there's a lot of, uh, ML infra at, at Shopify that people can, uh, dive into. We're about to go into SimGym, but before I do that, any, any other sort of broader comments around this whole effort? Like where is it, where is it leading to?[00:36:36] Mikhail Parakhin: As a segue to SimGym, like all those things start composing strongly.And, uh, you could see a huge unlock when you can look at each one of the tools and, and you see, oh, they're extremely useful. Uh, Tango is useful by itself. Auto Research is useful by itself. SimGym is useful by itself. If you combine all three, you create like synergetic effect. I think that's why we wanted to even, uh, cover them today is because this is something that if you go back even, you know, five years ago, would've been unthinkable.Uh, replicating that, uh, would, would be either incredibly costly or impossible, right? With probably thousands of people are required.[00:37:20] swyx: Well, we have serverless human, uh, serverless intelligence, right? Like, uh, so yes, you do have thousands of hu-- of, of intelligences, not just, not humans. And that's, that's close enough, right?Even if they're not AGI, they're, they're close enough to do the, the task that you need them to do. And, and, you know, that's, there's plenty for, for a lot of routine work, knowledge work. Okay, let's get into SimGym. Um, this is one of those things I, I was surprised to see actually it's apparently your, uh, one of your most popular launches, and I think something that, uh, I think Sim AI, I think Yunjun Park, who did the Smallville thing, there's a very small cottage industry of people trying to do like the simulate customer thing.I think a lot of people maybe don't super trust this yet because they're like, well, obviously they would just do what you prompt them to do, right? But maybe just think, uh, tell us about the sort of inspiration or origin story.[00:38:10] Mikhail Parakhin: That's exactly actually the thing I wanted to cover, because if you don't have the historical data, all you can do is prompt a-agents in a vacuum, and they will do exactly what you prompt them to do.In fact, when I first proposed it, and this is a bit of, um, my brainchild initially, if I, I can boast, even Toby said like, “But wouldn't they, they just repeat what, what you tell them?” And, uh, but I'm like, “Yes, except Shopify has decades of history of how people made changes and what there is, uh, there, what it resulted in terms of sales.”So now what we can do is we can-- we have this... It's not, it's a noisy data. There's a small, usually websites, uh, you know, like things, things are never in isolation. It's almost never AB experiment. It's always AA experiment when there's has two meanings, but basically, you know, in different time you run two different things.But if you aggregate in general, uh, like everything together, and you apply, uh, denoising and collaborative filtering like approach, you can extract a very clear signal. And then you can optimize your agents. And that's why it took so long. It took almost a year of that optimization of just us sitting and fiddling, and, and we had this internal goals of correlation of hitting-- internal goal was to hit zero point seven correlation with, uh, add to cart events, for example.Like that, that if we run real AB test experiment, that it should, it should go and, and rep-uh, replicate, uh, same sort of success that, that humans had or lack thereof. And it, it took forever, and I don't think that's easily replicatable because, uh, like who else would have that data? You have to have this historic, you know, decades, uh, worth of data.And now, now the, like the other thing you need is in-infrastructure and the scale, right? Because, uh, w- again, what we found, uh, stat sig results, you need to run a lot of simulations, a lot of agents, and, and it's-- Those are expensive things. Like you're, you're making actions in the browser because you want a real friction.You want to, to be able to get the image like of what humans will see because you wanna, uh, detect effects like, “Hey, if I make my images larger, will I have more sales or l- uh, fewer sales?” And like usually people's intuition here, by the way, is that I increase my images, I will have more because they look nicer.You know, designers all look sparse and big images. Like usually your sales tank, right? But, but, uh, you know, from HTML, all the characters look the same only the, the size tag looks different, right? So it's very hard. So you have to take visual information, you have to run this in simulated browser environment on the big farm and, and of course, you have to have, uh, like very, very expensive model, good model with multi-model model.So all this it's-- is what's taken so long and, uh, to share my personal fail a little bit there, Sean, is like, you know, we always had this bias to-- for like large company bias. You know, we always, uh, whenever you-- we do, we're like, “Hey, we'll run an experiment,” right? We make, make a change, and we will run an experiment and then, uh, see, uh, see which one's better or like, “No, this is worse,” and most of them are worse, so you discard it and keep iterating, hill climbing.And we're like, “Oh, like smaller merchants, they cannot get stat sig results. They cannot really run experiments simply because, you know, in a week there would be not enough data for them.” So we thought from this perspective. What we didn't realize is that most people don't have A and B, they just have one thing, and they need suggestions of What A and B should be.So, uh, we first build this, hey, we run simulation on two separate teams and, and, uh, say, “Hey, which one is better?” We then morphed it into, and very recently just released it, when you have just your site, your theme, we run over it and we say, “Hey, here's what predicted values of, of, uh, uh, conversions are, and here's how we think you should modify it to increase your conversions.”And then circling back to what you started with, the proof is in the pudding. Like, if we are not correlating with reality, like, people will not be using it. And, uh, thankfully, we see literally every day more users than the previous day. So, so right now, uh, right now- It's working. Yeah. I'm-- Right now my problem is how to pay for it all because the so our major thing is how to optimize the LLMs, do distillation, how to run the headless browsers, uh, and handful browsers, uh, uh, cheaper so that we can accommodate the increase in traffic.[00:42:47] swyx: Yeah. I, I understand that you, uh, you published a lot of technical detail at GTC, so I was just gonna bring it up a little bit. I think s- was this in, in con-conjunction with some kind of GTC presentation? Or something like that, right?[00:42:59] Mikhail Parakhin: Well, we, yeah, we, we did it in several place, but yeah, we had the engineering- Yeahblog, uh, as well. Yeah.[00:43:05] swyx: Yeah. So you're running, uh, GPT OSS. Uh,[00:43:08] Mikhail Parakhin: the, this is an older version. You know, now we run multimodal model. But yeah- Yeah ... GPT OSS, we still run GPT OSS as well for[00:43:15] swyx: And then you have the VMs, and you also have browser-based. I really like this one where it you said, “It violates almost every assumption that standard LLM serving is designed for.”And then you had like, basically orders of magnitude differences between everything.[00:43:29] Mikhail Parakhin: Exactly. Which is, which, uh, which was, you know, a bit of a challenge to implement, like when, like even simple things. Uh, be- since it violates all the assumptions, for example, multi-instance GPUs, like MIGs don't work as well.But we needed, uh, to get MIG to work because, ‘cause otherwise it's way too expensive. And so we had to deal with the, yeah, with, uh, lots of infrastructure and, and, uh, work with, uh, uh, Fireworks and CentML, uh, you know, to help with optimizations and browser-based, as you mentioned. Yeah, like, takes a village.[00:44:04] swyx: Okay. So there's a lot of like, I guess, experimentation in the infrastructure so far, and you've published more or less what you have here. I guess I'm, I'm less familiar with CentML. I, I don't do, uh, that much work in this, this part of the stack. But why was it the sort of preferred instance platform?[00:44:22] Mikhail Parakhin: There are really three probably top companies. There used to be, uh, uh- Three top companies, uh, at least I was aware of that did, uh, LM optimization. You know, together Fireworks and Santa ML, not necessarily in that order. Santa ML recently got acquired by NVIDIA. Uh, what they did is if you have a model and you want to optimize it to a specific prof-- uh, profile of usage, uh, they would go and do it.And, uh, we work with, with those companies, uh, this was work particularly in with Santa ML and NVIDIA to get them the best possible results out of it. And, and sometimes you, you have to retune depending on, like sometimes you want the maximum throughput, sometimes you want minimal latency, sometimes you want like the cheapest, right?And, yeah, or some combination. And so yeah, these are people who would come and help you.[00:45:14] swyx: I see. I see. Yeah, yeah. I'm familiar with these people for the LLM, you know, autoregressive stack. But the other interesting category of these optimizers is also the diffusion people, whereas like Fel and, you know, uh, Pruna recently has come up a lot as well, which I think is like really underappreciated, uh, at least by myself, because I, I thought, oh, all the workload would be LLMs, but actually there's a lot of diffusion as well.[00:45:38] Mikhail Parakhin: Exactly.[00:45:38] swyx: There's a lot here, so I, I, I... it's, it's, uh, it's, it's, it's hard to cover. But I, I do think like people underappreciate the importance of customer simulation, basically. I think this is something that I'm candidly still getting to terms with. Uh, you know, uh, you also-- your team also like prepared this, like, really nice diagram.Uh, I, I assume this is AI generated.[00:46:00] Mikhail Parakhin: Yeah, it looks-[00:46:01] swyx: Maybe it's not.[00:46:01] Mikhail Parakhin: Yeah, it looks, uh, Gemini-ish. Yeah, but, uh, uh, honestly, I, I don't know where, where the hell they generated. It looks, look, uh, looks like it's, uh, Google. But the interesting part, John, that, that, uh, we haven't covered, but I, I wanted to mention is if your store had previous customers, rather than it's a new store, you're like new merchant just launching things, it helps tremendously in just correlation and forecast.Yeah, we take your previous, uh, customer's behavior, and we create agents that replicate those specific distribution of, of customers that you get, and then we a- we apply those to your changes, and then that, that raised raw, you know, the re-- uh, just correlation with the add to cart events or to-- with conversion or whatever it, it, it may be, uh, quite dramatically.So, uh, replicating humans in general seems like an interesting, cool challenge.[00:46:58] swyx: As a shareholder, I think this is the-- like if people are Shopify shareholders, they should really deeply understand this because this is basically the moat. The, the more you use Shopify, the more it will just automatically improve, right?Like you're, you're doing the job for them.[00:47:13] Mikhail Parakhin: Yeah, that's what we started with. Like, uh- ... uh, otherwise, if you're just a startup, I wouldn't do it if, uh, you know, if it was my startup because Without the data, it, yeah, as, as you said, it's, it's exactly the case that, uh, whatever you say in prompt, that's, that's what the agents will be doing.[00:47:30] swyx: The statistician in me wants to like really satisfy the sort of, um, statistical intuition, I guess. Um, to me it's kind of, uh, the, the word that comes to mind is, um, ergodicity. Uh, so let's say a, a customer takes this path, customer takes this path, customer takes this path, right? Um, the... In my mind, the way I explain it is like, okay, here, here's the ninety-five percentile, here's the five percentile, and here's the median, right?Um, but to me, what SimGym is potentially doing is that it can, uh, modify... It can sort of model the sort of in-between sort of journeys as well, that, that maybe are dependent on the previous states. This may be like a very RL-type conclusion where like basically the summary statistics, if you only did naive AB testing, you only have the, the statistics at, at, at a certain point, and you only judge based on the sort of overall summary statistics.But here you can actually model trajectories. Does that make sense? Or-[00:48:31] Mikhail Parakhin: That makes total sense because like, well, that, that makes even more sense that maybe even you realize bec- because-[00:48:38] swyx: Okay. Please,[00:48:38] Mikhail Parakhin: please. Yes ... we do-- Yeah. The, so internally, uh, we have this system, we talked about it briefly once at NeurIPS.We have a huge HSTU-based system that models the whole companies, uh, and their possible paths. And like- Yeah ... what you are, what you are showing, like actually at any point of time, you can either model the user's behavior or you mo- can also think about, uh, the whole merchant as a company, as the entity that acts in the world.You can model that as well. And then you can do, can do counterfactuals. In your graph, like in your blue graph, uh, if you're... Imagine in the center there, uh, somewhere in the middle, you would have an intervention. I give that person a coupon, or I don't know, I send a personal thank you card, or give a discount in some- somewhere.And then you can, uh, then you can do forward rollouts from that counterfactual. So what would have happened with that intervention or without the intervention? And you can even ch- change where that intervention, uh, in time can happen, right? Like some- where, where in this journey. So we, we do this at the Shopify scale for our merchants, and then if we notice that something that they can be fixing, like there's a strong counterfactual, like we have Shopify policy, they basically get a notification like, “Hey, we think your...something is wrong with your-” I don't know, Canadian sales. Like, uh, it looks like it's misconfigured. Here's what you need to do. Or do you think like, uh, you have to set up this campaign with these parameters? And we do that at the buyer level to literally offer discounts or cashback or, or things to buyers.So this is-- I'm getting very excited. Like this is my sort of area of, uh, interest, I guess, and, and hobby. But being able to m-model something complex as human beings or companies and model counterfactuals on it, where you can have interventions in the future and optimize when to make intervention, what kind inter-- uh, what kind of intervention to make.It's such an unlock that previously was completely impossible. Like the-- it was, it was always dreamed of, but never... Like how would you even simulate it without LLMs or HTUs? I think very, very exciting times.[00:50:59] swyx: I just wanted to, uh, to maybe illustrate this. I, I'm not the best illustrator, but I, I am a conceptual statistics guy.And y-you know, you cannot just do this. Like this is a dimensionality AB test doesn't do, right? Like, uh, because it doesn't have the, the, the change over time, uh, stochastic nature, uh, and it doesn't have the sort of contextual like... Here's all the context to this point. Um, okay, cool. Um, that's SimGym.You're, you're gonna burn a lot of tokens on this thing. But you're, you're one of the, the only scale platforms in the world that can, uh, that can do this across a huge variety of workloads, right? I'm even curious on a sort of human, uh, research level of like, well, do, does retail behave d-differently from like clothing sales?D-does that behave differently from electronic sales? I, I don't know. I don't know what else you guys... The Kardashian shoppers, do they differ from like people who buy, uh, I don't know, cars and, uh, whatever.[00:51:55] Mikhail Parakhin: Well, very different, and different sensitivities and different modes of, uh, shopping and, and different levels of what's important.Now, to-totally, you can do aggregations at, uh, at a store level. You can do aggregations at a different, uh, category level. I don't know if, uh, you know, for our statisticians among us, I couldn't believe, but we-- recently we're looking at it, and we had to bring back, uh, CRPs, you know, Chinese restaurant process.It's a, like, way of aggregating and, like, naturally grow clustering. So across... Specifically to answer questions that, uh, like you were just posing on how, how if, if buyers behave different categories. And I'm like, “I haven't seen CRP since two thousand and one.” It's[00:52:37] swyx: so What? It's so- What is... No, I haven't, I haven't seen this.No. This is not in my training. Uh,[00:52:44] Mikhail Parakhin: but, but yeah, it, uh, uh, it actually, like the, the-- there was a very popular kind of theory, popular neurips HTML circles in early two thousands, uh, kind of nice. And now, now it has practical applications, uh- Yeah ... that we were resurrecting.[00:53:03] swyx: Yeah, amazing. Uh, I, I can see, I can see how this is like a, uh, a fun job for you where you get to apply all these things.Um, yeah, yeah, so super cool. Super cool. So, okay, so, so anyone who, who knows what CRPs are and has always wanted to use them at work, uh, they should, they should definitely join Shopify. Okay, so w-we have a lot and but I, I'm, I'm being mindful of the time. I, I do wanted to, to sort of cover some other things.Um, I-I'll give you a choice, UCP or Liquid?[00:53:30] Mikhail Parakhin: Liquid. I think, I think on UCP, you know, like UCP is very important for us and, and it just we are-- UCP, we have a structured, uh, discussions, and you can read about them, and we have, uh, blog posts, and we have a big release this week, in fact, like with our catalog.Oh,[00:53:46] swyx: okay.[00:53:46] Mikhail Parakhin: Uh, yeah,[00:53:46] swyx: but- Le-I mean, we, we can, we can discuss the, the, the release briefly because we'll release this after the-- after it's already announced so whatever. There's a catalog that you guys are doing?[00:53:55] Mikhail Parakhin: Yeah. So we are, we are- Okay ... we are bringing in capabilities of a whole, uh, Shopify catalog.Basically, you now you can search for products, you can do lookups by specific ID, you can do bulk lookups when you need to bring m-multiple products. You don't need to know in ad-in advance what you're trying to show or to sell or check out. Like, you can now, you can now have this decided at, at runtime, and this big area for investment for us for both non-personalized and personalized searches, trying to provide basically a win-window into whole universe of products that are being sold everywhere in the world.And Shopify is really not exactly, but almost like a super set of any-anything being sold. Now we are bringing it into UCP and, uh, and, uh, identity linking is another big thing for us, uh, so that you, you can use, uh, like Google or whatever, whatever identity you have, uh, they're minimizing friction.[00:54:56] swyx: Yeah. So[00:54:57] Mikhail Parakhin: yeah, big release for us.But Liquid AI of course we never talk about, and the problem might be more, more aligned with what we d-discussed previously on this chat.[00:55:07] swyx: Sure. The main thing that everyone understands about Liquid is that it is inspired by Worm, and I still don't know why. I'm curious on your explanation. I think you, you, uh, you can make things very approachable.And also I think like what is the potential of like the, the level of efficiency that you get out of Liquid?[00:55:23] Mikhail Parakhin: You- we all familiar with transformer architectures. And, uh, for the longest time, there was a competing architecture, it's called the state space models. So, so Sams, uh, you know, Chris, Chris Reyes, one of the pioneers and, and lots of startups, uh, trying to make those realities.They have, uh, significant benefits being main being, uh, being much faster and, uh, lower footprint and not quadratic in length, you know, sort of, uh, linear in, in, uh, in your context length. But with state space models- They never quite made it. Like they're used-- They have, uh, certain niches when they thrive, their hybrid architectures are useful, but they never quite made it.And liquid neural networks are, you can think of them as a next step, like, uh, sort of, uh, state-space model square. It's non-transformer architecture that's more complicated than sta-state space and really difficult to code if you-- if I'm being honest. But it's, um, very efficient. It's, uh, subline-- sub, uh, quadratic in, in length of your context.Uh, it's very compact way to represent things, and that's a liquid AI company. They... Their goal is to productize it, and very often you have this need, uh, when you need to have long context and small model, and you want to have low latency. Like in general, it's basically on par with transformers, and if you do hybrids with transformers, it's, it's even better.That's why we at Shopify, when we tried multiple and we constantly try multiple models, multiple companies, we found that for small, particularly with low latency applications, when you have low latency and/or if you need longer context lengths, liquid was the best. And so we still use the whole zoo and always like obviously test and use everything, uh, every open source model and, you know, it feels l

Will Brocker
Mountains Will Move - Russell Lorfing at GTC 4-12-2026

Will Brocker

Play Episode Listen Later Apr 12, 2026 60:14


Mountains Will Move - Russell Lorfing at GTC 4-12-2026 by Will Brocker

gtc mountains will move
Acting Up with GTC
AUGTC S3 E4: Don't Fall Asleep! Inside The Drowsy Chaperone

Acting Up with GTC

Play Episode Listen Later Apr 9, 2026 52:22


Step behind the curtain with us on this episode of Acting Up with GTC!

The Lunar Society
Michael Nielsen – How science actually progresses

The Lunar Society

Play Episode Listen Later Apr 7, 2026 123:03


Really enjoyed chatting with Michael Nielsen about how we recognize scientific progress.It's especially relevant for closing the RL verification loop for scientific discovery.But it's also a surprisingly mysterious and elusive question when you look at the history of human science.We approach this question stories like Einstein (who claimed that he hadn't even heard of the famous Michelson-Morley experiment, which is supposed to have motivated special relativity, until after he had come up with the theory), Darwin (why did it take till 1859 to lay out an idea whose essence every farmer since antiquity must have observed?), Prout (how do you recognize that isotopes exist if you cannot chemically separate them?), and many others.The verification loop on scientific ideas is often extremely long and weirdly hostile. Ancient Athenians dismissed Aristarchus's heliocentrism in the 3rd century BC because it would imply that the stars should shift in the sky as the Earth orbits the sun. The first successful measurement of stellar parallax was in 1838. That's a 2,000-year verification loop.But clearly human science is able to make progress faster than raw experimental falsification/verification would imply, and in cases where experiments are very ambiguous. How?Michael has some very deep and provocative hypotheses about the nature of progress. One I found especially thought-provoking is that aliens will likely have a VERY different science + tech stack than us. Which contradicts the common sense picture of a linear tech tree that I was assuming. And has some interesting implications about how future civilizations might trade and cooperate with each other.Watch on Youtube; read the transcript.Sponsors* Labelbox researchers built a new safety benchmark. Why? Well, current safety benchmarks claim that attacks on top models are successful only a few percent of the time, but the prompts in those benchmarks don't reflect how real bad actors actually write. You can read Labelbox's research here. If this could be useful for your work, reach out at labelbox.com/dwarkesh* Mercury has an MCP that lets you give an LLM access to your full transaction history, including things like attached receipts and internal notes. I just used it to categorize my 2025 transactions, and it worked shockingly well. Modern functionality like this is exactly why I use Mercury. Learn more at mercury.com* Jane Street's ML engineers presented some of their GPU optimization workflows at GTC, showing how they use CUDA graphs, streams, and custom kernels to shave real time off their training runs. You can watch the full talk here. And they open-sourced all the relevant code here. If this kind of stuff excites you, Jane Street is hiring — learn more at janestreet.com/dwarkeshTimestamps(00:00:00) – How scientific progress outpaces its verification loops(00:17:51) – Newton was the last of the magicians(00:23:26) – Why wasn't natural selection obvious much earlier?(00:29:52) – Could gradient descent have discovered general relativity?(00:50:54) – Why aliens will have a different tech stack than us(01:15:26) – Are there infinitely many deep scientific principles left to discover?(01:26:25) – What drew Michael to quantum computing so early?(01:35:29) – Does science need a new way to assign credit?(01:43:57) – Prolificness versus depth(01:49:17) – What it takes to actually internalize what you learn Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe

Audio-Podcast – OrionX.net: Deep Insight, Market Execution, Customer Engagement
Analyst Roundtable: GTC, AI, Social Media in Court, War-time Supply Chain, BTC Bubbles – OXD36

Audio-Podcast – OrionX.net: Deep Insight, Market Execution, Customer Engagement

Play Episode Listen Later Apr 6, 2026


Analyst roundtable covering the big ideas in technology that are changing the world, with Adrian Cockcroft, Stephen Perrenod, Chris Kruell, and Shahin Khan. In this episode: - GTC postview - Nemotron vs Nematron - OpenClaw, Instar.sh - Social Media in Court - Modeling BTC Bubbles - Scientific Bitcoin Institute (BSI) - War-time Supply Chain - Memory shortage - ASML Backlog, SK Hynix order [audio mp3="https://orionx.net/wp-content/uploads/2026/04/OXD036_ART-12_20260405.mp3"][/audio] The post Analyst Roundtable: GTC, AI, Social Media in Court, War-time Supply Chain, BTC Bubbles – OXD36 appeared first on OrionX.net.

The Cloud Pod
349: Gmail Finally Lets You Ditch xXDragonSlayer2004Xx

The Cloud Pod

Play Episode Listen Later Mar 31, 2026 64:27


Welcome to episode 349 of The Cloud Pod, where the weather is always cloudy! Justin and Jonathan managed to make it into the studio this week, and they brought a guest! Dave Garaway jas joined us, and brought some on-the-ground knowledge from GTC, plus a slew of supply chain attacks, Gmail username changes and Claude's code debacle. We've got all this and more – so let's get started!  Titles we almost went with this week AWS Console Gets a Makeover Nobody Asked For From Eight Hours to 22 Seconds, Hackers Got Fast AWS Spring Cleaning Hits Nine Services Hard Trivy Pursuit Turns Into a 500K Credential Heist Skip the Consultant, AWS Security Now Hacks Itself AWS Pen Testing Agent Pokes Your Cloud Around the Clock Your Cringey Gmail Address Gets a Second Chance Stop Babysitting Servers, Let Google Handle MCP AI Agent Untangles Your Kubernetes Networking Spaghetti One Bad Actor Poisons a Hundred Million Downloads Lambda Finally Hits the Gym with 32 GB From GPU Hype to Production Inference Without the Hyperscaler Headache Follow Up 01:28 Hegseth, Trump had no authority to order Anthropic to be blacklisted, judge says A US District Judge granted Anthropic a preliminary injunction blocking the Department of War’s blacklisting, ruling the designation was First Amendment retaliation rather than a legitimate national security action. The court found officials lacked authority to blacklist Anthropic without considering less restrictive alternatives or providing evidence of an urgent security risk, noting the designation was triggered by Anthropic’s “hostile manner through the press.” The practical business impact was already substantial before the ruling, with three trade deals cancelled and other potential partners delaying negotiations, representing potentially billions in lost contracts over five years. Anthropic continues to balance the legal fight with maintaining its government relationships, publicly emphasizing alignment with the Department of War’s mission around safe AI deployment even while litigating against it. For cloud and AI vendors, this case establishes a notable precedent around government procurement decisions and First Amendment protections, with implications for how companies publicly challenge federal contracting positions. 02:35 Jonathan – “I'm guessing Anthropic is super busy with all the people coming to them for deals right now, because it seems to me that Anthropic is getting all the business customers and OpenAI are getting the personal customers.”   04:08 Delve Announces Changes and New Customer Support Measures  Delve has

This Week in XR Podcast
Why Social Media Lost in Court and AI Agents Demand Total Surveillance – Shelley Palmer's 5th Visit

This Week in XR Podcast

Play Episode Listen Later Mar 31, 2026 53:47


Shelley Palmer,media technologist, advisor, and author with over 700,000 daily newsletter subscribers, returns to the show. He's one of the sharpest thinkers writing about AI today, and this conversation covers the full arc: from social media liability to the trust collapse coming for all of us, and into the real productivity gains and surveillance trade-offs of living inside an AI-first workflow.The episode opens with the Google and Meta lawsuit verdict and quickly moves past the legal question. Shelley's position is precise: you can't legislate parenting, but you can legislate transparency, and the tech industry has failed on that front entirely. The $6 million judgment against Meta and Google is a rounding error — not a deterrent. What matters is what platforms actually engineered: engagement above all else, backed by neuroscience, probabilistic math, and dopamine feedback loops optimized for shareholders, not users.AI XR News You Should Know: OpenAI is ending Sora and pivoting hard to Codex and enterprise. Ben Affleck secured $900 million from Netflix for a custom AI filmmaking tool. Epic Games cut 1,000 jobs as Fortnite loses audience. NVIDIA's Jensen Huang introduced Nemo Claw and Open Shell at GTC — a corporatized framework for personal AI agents.Key Moments[00:01:15] – Charlie opens noting the show missed one episode in nearly 300 — his daughter's wedding[00:01:55] – OpenAI kills Sora; the Critters director goes dark before the episode[00:04:45] – Google and Meta lose their social media addiction lawsuit; Meta also loses in New Mexico[00:08:07] – Shelley on what can actually be legislated: not parenting, but transparency[00:11:42] – Shelley on Zuckerberg: he genuinely believed connection would be net positive; ask him today[00:13:31] – "Planetarily net negative. No matter what good it does, it does more harm."[00:18:16] – Rony on dopamine engineering: neuroscientists studying pixel size, color, sound to refine addiction[00:19:40] – Shelley reframes it: engagement maximization for shareholders, no more insidious than that[00:23:19] – The physiological change argument: humans evolved to default to trust; AI-generated everything breaks that[00:31:50] – Rony's counterpoint: trust will reset local; the software ecosystem will follow[00:36:53] – Shelley: "Our business increased last year. Everyone on my staff is doing 400 times the work."[00:44:42] – AI-first means automating every workflow you can honestly automate — and knowing what isn't ready[00:45:06] – Jensen's Nemo Claw and Open Shell: the safer path to personal AI agents, and what it actually costs[00:49:42] – The surveillance trade-off: an effective AI agent requires more personal data exposure than anything before it[00:51:24] – Apple's Secure Enclave play: why Tim Cook may win the AI trust war in the endThe productivity gains are real, but so is the privacy exposure, and the systems that earn trust — at every level — are the ones that will survive.This episode is brought to you by Zappar, the company behind Mattercraft — the leading visual development environment for building immersive 3D web experiences across mobile, headsets, and desktop. Mattercraft now features an AI assistant that helps you design, code, and debug in real time, right in your browser. Start building at mattercraft.io. Subscribe to the AI XR Podcast wherever you listen.Watch the full episode for the full breakdown. Available where podcasts are. Full videos available on YouTube. https://youtu.be/S_AECjELYyoSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0
Mistral: Voxtral TTS, Forge, Leanstral, & what's next for Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Mar 30, 2026 48:48


Mistral has been on an absolute tear - with frequent successful model launches it is easy to forget that they raised the largest European AI round in history last year. We were long overdue for a Mistral episode, and we were very fortunate to work with Sophia and Howard to catch up with Pavan (Voxtral lead) and Guillaume (Chief Scientist, Co-founder) on the occasion of this week's Voxtral TTS launch:Mistral can't directly say it, but the benchmarks do imply, that this is basically an open-weights ElevenLabs-level TTS model (Technically, it is a 4B Ministral based multilingual low-latency TTS open weights model that has a 68.4% win rate vs ElevenLabs Flash v2.5). The contributions are not just in the open weights but also in open research: We also spend a decent amount of the pod talking about their architecture that combines auto-regressive generation of semantic speech tokens with flow-matching for acoustic tokens (typically only applied in the Image Generation space, as seen in the Flow Matching NeurIPS workshop from the principal authors that we reference in the pod).You can catch up on the paper here and the full episode is live on youtube!Timestamps00:00 Welcome and Guests00:22 Announcing Voxtral TTS01:41 Architecture and Codec02:53 Understanding vs Generation05:39 Flow Matching for Audio07:27 Real Time Voice Agents13:40 Efficiency and Model Strategy14:53 Voice Agents Vision17:56 Enterprise Deployment and Privacy23:39 Fine Tuning and Personalization25:22 Enterprise Voice Personalization26:09 Long-Form Speech Models26:58 Real-Time Encoder Advances27:45 Scaling Context for TTS28:53 What Makes Small Models30:37 Merging Modalities Tradeoffs33:05 Open Source Mission35:51 Lean and Formal Proofs38:40 Reasoning Transfer and Agents40:25 Next Frontiers in Training42:20 Hiring and AI for Science44:19 Forward Deployed Engineering46:22 Customer Feedback Loop48:29 Wrap Up and ThanksTranscriptswyx: Okay, welcome to Latent Space. We're here in the studio with our gues co-host Vibh u. Welcome. Thanks. Excited for this one as well as Guillaume and Pavan from Mistral. Welcome. Excited to be here.Guillaume: Thank you.swyx: Pavan, you are leading audio research at Mistral and Guillaume, you're Chief Scientist,Announcing Voxtral TTSswyxHost(00:05) Okay. (00:05) Welcome to Lean Space. (00:06) We're here in the studio with trustee co-hosts, Vibhu. (00:09) Welcome.VibhuHost(00:11) Very excited for this one.swyxHost(00:12) As well as Guillaume and Pavan from Mistral. (00:15) Welcome. (00:16) Excited to be here. (00:17) Thank you for having us.(00:18) Pavan, you are leading audio research at Mistral and Guillaume, you're a chief scientist. (00:23) What are we announcing today where we're coordinating this release with you guys?GuillaumeGuest(00:26) Yeah, so we are releasing Voxtral TTS. So it's our first audio model that generates speech. It's not our first audio model. We had a couple of releases before.(00:35) We had one in the summer that was Voxtral, our first audio model, but it was like a transcription model, ASR. Like a few months later, we released some update on top of this, supporting more languages. Also a lot of table stack features for our customers, context biasing, precision, timestamping and transcription. We also have some real-time model that can transcribe not just at the end of the level.(00:56) You don't need to fill your entire audio file, but that can also come in real-time. And here, this is a natural extension in the audio, so basically speech generation. So yeah, so we support nine languages, and this is a pretty small model, 3D model, so very fast, and also state of the art. Performed at the same level as the base model, but it's much more efficient in terms of cost, and also much, in terms of cost, it's also much cheaper, only a fraction of the cost of our competitors.(01:22) And we are also releasing the work that this model is running.swyx What's the decision factor?Guillaume It's a good question.swyxThere will be more. Yeah, Pavan, any sort of research notes to add on?Architecture and CodecPavan: But it's a novel architecture that we develop inhouse.We traded on several internal architectures and ended up with a auto aggressive flow matching architecture. And also have a new in-house neural audio codec. Which, converts this audio into all point by herds latent [00:02:00] tokens, semantic and acoustic tokens. And yeah, that's that's their new part about this model and we're pretty excited that it's, it came out with such good quality and Jim was mentioning. Yeah, it's a three B model. It's based off of the TAL model that we actually released just a few months back and insert trunk and mainly meant for like the TTS stuff, but they need text capabilities are also there. Yeah.swyx: So there's a lot to cover.I always I love any, anything to do with novel encodings and all those things because I think that's obviously I creates a lot of efficiency, but also maybe bugs that sometimes happen. You were previously a Gemini and you worked on post training for language models, and maybe a lot of people will have less experience with audio models just in general compared to pure language.What did you find that you have to revisit from scratch as you joined this trial and started doing this? At leastUnderstanding vs GenerationPavan: when it comes to, for, I think the, there are two buckets, I guess the audio understanding and audio [00:03:00] generation. The audio understanding, like the walkthrough models that Kim was mentioning that we released earlier.The walkthrough chat that we released I think July last year, and the follow up transcription only, models family that we released in January, that would be one bucket, and the generation is another bucket. I think. You can also treat them as a unified set of models, but currently the approaches are a little different between these two.To your question on how audio is fed to the model? In the understanding model, it's very similar to actually Pixar models that we also released,swyx: yes.Pavan: That'sswyx: amazing.Pavan: It was pretty, I, that was the first project I worked on after joined Misra. It was pretty, pretty nice. And Wtu was very similar in spirit.I guess So we feed audio through an audio encoder similar to images through a vision encoder, and it produces continuous embeddings and which are fed as tokens to the main transformer decoded transformer model. Yeah. On the model output is just text. So on the output side, there is nothing that needs to be done in these kinds of mode.I [00:04:00] guess the interesting part of what the generation stuff is, the output now has to produce audio and. The approach that we have is this neural audio codec, which converts audio into these latent tokens. There is a lot of existing attrition and a lot of models which are based off of this kind of approach.And we took a slightly. A different, design decisions around this. But at the end of the day, the neural audio product converts audio into a 12.5 herdz set of latents. And each latent is, has a semantic token and a set of acoustic tokens. And the idea is that you take these discrete tokens and then feed it on the input side.There's several ways to use this at each frame, but we just sum the embedding. So it's like having key different vocabularies. Combine all of them because they all correspond to one audio frame on the input side. The output side is the interesting part on the output side, the, it's not the, I don't know if it's the most popular, but one.Popular technique is to have a depth transformer [00:05:00] because you have K tokens at each time step, like with a text, you just have one token at each time step. So you just do predict the token from the vocabulary with, yeah, with just, you get probabilityswyx: This's a very straightforward text. VeryPavan: straightforward.swyx: Yeah.Pavan: But if you have K tokens, then the name thing would be to predict all of them in paddle. That doesn't work. At least that doesn't work that well because audio has more entropy. And the, one of the techniques people use is this depth transformer where you you almost have a small transformer, or it can be L-S-T-M-R in as well, but people use transformers and you predict the K tokens in auto aggressive fashion in that.So you have two auto reive things going on.Flow Matching for AudioPavan: So the thing we did differently is in, instead of having this auto aggressive K step prediction, we have a flow matching model. Instead of modeling this as a discrete token set we trained the codec to be both discrete and continuous to have this flexibility.So we did try the discrete stuff too, and which it works well, but the continuous stuff works just better. So yeah, we took this flow matching, so the, it's a flow [00:06:00] matching head, which takes the latent from the main transformer and like kind in fusion, it's denoising, but in this flow matching itself, velocity estimate.So you go from this noise t all the way to there. Audio latent, which corresponds to the 80 millisecond audio and then, which is sent through the work order to get back the 80 millisecond audio frame.swyx: Yeah. Is this the first application of flow matching in audio? Because usually I come across this in the image.Pavan: Yeah. Actually, in some sense there are models flow matching models in audio, but I think this specific combination I could be wrong. There could be somewhat. No. I haven't seen. I haven't seen much work in this, so I think it's novel and a lot of it's just a way bigger community, so they, I think they pioneer a lot of these diffusion flow matching work, and it's interesting to adopt some of the ideas there into audio and,swyx: yeah.Pavan: Yeah, I'm, personally that's the think part which is trying out about. One of more meta point is unlike text, even in vision, I think this is true, but in [00:07:00] audio step literature that there is no.Winner model, yet there is no, okay, this is the way you do things. It's it's still by, I think people are still iterating and figuring out like what's the best overall recipe. I guess the idea. Pretty sure there are models which are also completely end-to-end, like NATO audio. NATO audio, but it's still not come to a convergence point where this, the right way to think that.That also makes. A space pretty exciting to explore.Real Time Voice AgentsVibhu: What are some of the ways to look at it?Vibhu: There are ways where you can do diffusion for audio generation, but if you want like real time generation, that's a big thing with the approach I'm assuming that you took. Yeah. And also like how do you go about evaluating different axes of what you care about, yeah,Pavan: good point. I think we so you can do just flow matching diffusion for the whole audio. We didn't even go down that path because one of the main applications is voice agents and we want real time streaming, and that's the use case. That's not the only use case, but that's one of the primary use cases we want to get to.So we [00:08:00] picked the auto aggressive approach for that. And within the auto aggressive space, again, you can do chunk by chunk or you can do so we picked the. I think at least personally prefer the operations, which are the simplest, and so we try to see, can we just add audio as just another head to our regular transformer decode model because that kind of makes it easier for eventual end-to-end modeling of audio text native modeling.Yeah. And it works pretty well. So I guess we went with that and we tried a little bit, but the flow matching head itself, like we had a discreet. Diffusion kind of approach, which also works well, but the flow matching work better.swyx: I was just curious about how you also think about this overall direction of research.Do you basically, when you work with the audio team, do you set some high level parameters and then let them explore whatever, or how does it work between you guys?Guillaume: No I think the way it works is that we are the, we are prioritizing together, I think, what are the most important features because there are many things we can do [00:09:00] in audio.Yeah, I think we try to. These are like how we should do things, for instance. Ultimately what we want to do is to build this through duplex model, but we are not going to start this start there directly, I think is. Some of the project people are doing, butswyx: just to confirm, full effects means it can speak while I'm speaking or,Guillaume: yeah.Okay. Audio. Yeah. Yeah. So intimately we're going to get there, but for us it was, we decided to take it like a step by step. So we start with whatever is the most important. I think support customers, which is the transcription is the most popular use case. Then the speech generation, Soviet time, just a bit before that.And then actually to be like more, but try combining everything all together. But but yeah, we thought it was also important to like separate things and optimize each capability one by one before weswyx: measure of that together. And the super omni model. ButGuillaume: very interesting because as Par said, it's when you work on some other domains of this airline and everything, there are many areas where I think it's not as interesting.For instance. Many places, it's essentially just around data or like creating new environments on a lot of kind [00:10:00] of easy things. But things were, I think the research is maybe not as interesting. Were in audio. There are so many ways to actually build this model. So many ways to go around it. That's the sense I think is really interesting.And what we also tried for speed generation is that we tried multiple approaches. What was interesting that even though they were extremely different, they under the big know the particles but the for matching turned out to be quite more natural. So we are happy with this.swyx: Is there intuition why it maybe like flow matching is just models speech better in some natural fundamental, latent dimension?Pavan: No, I think the main thing is e even at a particular time step, there is a distribution of things.swyx: Yes.Pavan: To be predicted like the way you inflate. So you already know the word that you're speaking and Yeah. The intake space, let's say the word maps register a single token for simplicity.In most cases it does. So there is not a lot of so you just pick the word, but with within audio, even the same word could, even with your own voice, could be inflicted in so many different ways. And I think [00:11:00] any approach which like models this distribution and. And flow matching is one, one of the take.It's not the only one at all, but it's a one which works pretty reasonably well. I think that's better. So you have to pick across several different, the intuition I have is it's, there are some, several different clusters each corresponding to some specific way you would inflict, pronounce that thing.And you can't predict the mean of it because that corresponds to some blurred out speech or something like that. But you have to pick one. And then like sharpswyx: conditional inference.Pavan: Yeah, exactly.swyx: Is that all covered under disfluencies, which is I think the normal term of art. Pauses intonations. By the way, I have to thank Sophia for setting all this up, including like some of these really good notes becausePavan: Yeah.swyx: I'm less familiar with the audios for me.Pavan: No. I think dis dismisses are definitely one such Eno defenses is more likeswyx: which is arms are.Pavan: Yeah, arms. And also repeat like you like,swyx: yeah.Pavan: You do this full of words, your thinking, so you repeat the word.swyx: Okay. Whereas intonation is like a diff, it's up up [00:12:00] speak and all this.Okay.Pavan: Yeah. So I think there is a lot of like entropy. And modeling it as a distribution. And a, any technique which helps with it and the depth transformer is a conditional way of modeling this. And Transformers actually really good at it, even though that's a mini transformers. So I think that worked pretty well too for us too.It's just that the main concentration is when you have a depth transformer. If you have K tokens, you need to do K auto steps, right? Even though it's a small thing, it's K steps, which is very vacant, say heavy, but flow matching. We were able to cut it down significantly. So we are able to do the inference in quad steps or 16 steps and it works pretty well.And there are more normal techniques to bring it down even further to like, in extreme case, one step like we're not doing it yet, but it at least the framework, LEDs itself to more efficient and Yes.swyx: And the image guys have done.Pavan: Yeah.swyx: Incredible work guys. Yeah.Pavan: It now you just. Send a prompt and you get an image.swyx: Yeah. Surprisingly not enough. I think image model labs use those techniques in production. I think it's, I feel like it's a lot of research demos, but [00:13:00] nothing I can use on my phone today.Guillaume: The thing, there's a thing that would be interesting here is that since, indeed I've been so much sure that has been done in the vision community compared to radio dys, stomach, I think there are so many long infra Yeah.And there are so many things we can do to actually improve this further. So it's our first version, but we have so many ways to exist, much better and much more efficient, cost efficient, soswyx: yeah.Guillaume: So really it's not a new field at all, of course, but there are still so many things that can be done.Perfect. It'sswyx: nice. I should also mention for those who are newer to flow matching, I think the creator, this guy's name is Alex, he's done I think in Europe's maybe two Europes as ago. There was, there's a very good workshop. There's one hour on like this matching is I would recommend people look that up.That's the other thing, right?Efficiency and Model Strategyswyx: The efficiency wise, like I, I imagine like the reason is open weights the reason you pick 3.6 B backbone it you are 3.4 B you are, try to fit to some kinda hardware constraints. You kinda fits some kinda basic constraints. What are they?Guillaume: Not necessarily, I think something we care about in our model that they're efficient.So we have a [00:14:00] lot of separate model, for instance. So we have this that is very small, very efficient. We also have a small OCR model that is available. Good, highly efficient as well. And I think on a project maybe there, I think companies are going to take is to have a coverage general model that will do a bit of everything.But that is also going to be expensive. On here. What want say is if you care about this specific use case, if you can actually use this model, it just does that. It's extremely good at it. Survey, very efficient. That's why we can actually add. We do, but also OCR that are like really good at that.And that would be much more cost effective factors and the general model that will contain a lot of capabilities you don't really need. So yeah. So we're doing like general model, but also like more customized model. This,Open Weights and BenchmarksVibhu: how does it compare to other TTS models? It's, we are going follow open wave.We're just dropping it. I think it's pretty good.Pavan: Yeah, I think it's pretty good. Like it, it's definitely one of the best. For sure. It's probably I would say it's the best open source model, butVibhu: decipher themselves.swyx: Yeah.Voice Agents VisionVibhu: Why now? How does it fit into broader ral vision? How do you see voice agents?How do you see voice? I think every year I've heard, okay, you're a [00:15:00] voice. You're a voice. There's a lot of architectural stuff. There's a lot of end time that see it, your solving, but where do you see voice setting?Guillaume: We had so many customers asking for voice. That's also why we wanted to build it.What's interesting in this domain is that. In a sense, if you take something simple like transcription it doesn't seem like something that should be very hard to do for a model. It's essentially, it's pattern recognition. It's classification on this. Models are very good at classifying, right?Or nonetheless, when you talk to them it's not there yet, right? It's not, you don't talk to them the same way you talk to a person. On something, maybe people don't realize it. It's in English it's still much better than in any user language, even compared to French instance. If you talk to this million in French, when you see people talking to this they'll talk very slow.They'll articulate as much as they can. So it's not natural, right? We're not yet to this. And I think, yeah, maybe the next generation will not know this, but yeah, I think people that. But our edge will actually always keep this bias speaking very slowly when they talk to this model. Even if maybe, probably in a couple of years, maybe next year it'll not be necessary anymore.But yeah. But what's interesting is to see that yeah, even for like languages [00:16:00] like yeah, French and Spanish Germans that are not no, no resource on religion. You have a lot of audios there on still it's not as good. And I think a consequence. Because then for this, I suppose just is not as much energy, as much effort that has been put done in some other mod that for some vision or like coding.But but yeah, there's still a lot of progress to be done. I think it's just a question of doing the work and it's clear path I think to get there.Pavan: It's a little fascinating because I worked on Google Assistant I think while back at this point, but it's, I think it's, it like when you take a step back, it's fascinating.It's not that long ago. It was like four years ago or five years ago, and it's now it's completely audio in, audio out and the function calling and the whole thing happens completely end to end. And in a very natural,swyx: yeah,Pavan: natural way and still ways to go. Kim was telling, even despite all the previous, it's not like you're speaking to a person.When you talk to any of these agents, bots, or voice mode kind of situation, it's still like a gap. I think that's the great part and I feel like with even the existing [00:17:00] stack, we should be able to get to this very natural speech conversational abilities soon enough I guess.And we'll also hope. I get thatGuillaume: on this kind of the next step, right? Because when you talk to these agents, like usually people are just writing to them and sometimes they'll this very clear, for instance, you are, you want to write code, but you are, you have a very clear idea of how you want the model to implement what you in mind.But so here you are able to spend a lot of time writing. So it's not really efficient on audio is really like a natural interface that is just not there yet, but I think it's just gonna be the place.Vibhu: How's it like building, serving, inferencing, like we see a lot about, it's very easy to take LMS off the shelf, serve them.Fine tuning, deploying. I know you guys have a whole you have Ford, you have a whole stack of customizing, deploying. Is there a lag in getting that. Like distribution channel. Are you helping? There is. So like prompting, lms, you can have them be concise, verbose, all that.They're built on LM backbones, these models. How do you see all that?Enterprise Deployment and PrivacyGuillaume: Yeah, I think this is a lot of what we're doing with our own customers. Very [00:18:00] often they come to us, so it's for different reasons. I think one reason is sometimes they have this lot of privacy concerns.They have this data that it's very sensitive. They don't want data to leave. The companies, they wanted to stay. Inside the company. So we have them deploy model in-house. So either on a, either on premise or on private cloud. So they're not worried that it's given to a third party on the there some leakage.Sometimes they have this kind of many companies have this different, sensitivity of data they have like sometimes channel chat can send it to the cloud has to stay there. So then it creates some kind of heterogeneous workflows where it's annoying. You cannot send some data to the cloud.This one you can, so here, when we actually deploy the model for them, they don't have this consideration. They are like not worried that, this is going to leak. Everything is much easier. So we help them basically do this on the, so it's one of the very proposition. But but the other is very often, when customers use this off the shelf close model, but very sad is that they are not leveraging, these data that have been collecting for four years or something for decades.So much data. Sometimes it's trillions of tokens of [00:19:00] data in a very specific domain. Their domain, which is data that you'll not find in the public, on the public internet. So data on which, like close model, we actually not have access to one, which that's going to be really good. So if they're using like closed source models are basically not benefiting from all these insights.All these data they have collected three years, they can always give it into the context that in France, but is never as good as if you actually train the modern analysis. So yes, that's basically what we help them to do. We actually provide them some purchase, basically what we announced at GTC this week.So we provide them with this, it's basically like a platform with a lot of tools to actually help them process data. Trained on that. Yeah, it's actually the same thing that we're using in the science team. So it's actually very better tested infrastructure, like a lot of efficient training cut base.For a quality pre-training like a fine tuning, even doing S-F-T-I-L. So we help them do this using the same tools as what our science team is building is using. So since it's tools that we've been using for two years now, it's really better tested. It's really sophisticated.So it's the same thing. We are giving to them, giving the company the same thing [00:20:00] that what are same still using internally actually build their own ai and it makes a really big difference. I think sometimes customers. And many in general don't realize how much better the model becomes when you fine tune it on your own data.And you can have a, your model is here. You start from there. You have a cross source model, which is sort here, but if you actually fine tune it can actually really go much further than this. And then you have a very big advantage. The model is trained on your entire company knowledge, so it knows everything.You don't have to feed like 10 K tokens of contact at every query. So it's it's much easier. It's a bit, I think using a closed source model is really sad because it basically puts. You are not leveraging all this data and you are going to be using the same model as all your old competitors when you're actually using, everything you have been collected for years, which is really valuable.So yeah. So we help basically customers do this. We have a lot of solution I mean deployed for engineers that go in the company that basically look at the problem customers are facing to look at what they're struggling to do what we should do to solve it. So we help them solve them together.So it's I think our approach is a bit different, but here. [00:21:00] Some of their companies and competitors, it's, we don't just release an endpoint on sale, do some stuff on top of that, or we don't just give a checkpoint. We really look very closely with customers. We look at the issues they have, we had them solve them.We really make some tailored solution for the client are facing. Some example are also going to be, sometime we have some customers. They really wanted to have a really good model, really performance on some, like Asian languages on the, if you take some of the shelf models, they can speak it, they can write in this language, but it's not amazing.This language would be like maybe zero 1% of the mixture. So it has been included during training, but very little. So what we did here is upgrade. We trained a new model for them, but so this language was 50% of the mix, so it's much, much stronger. It knows of the dialects, it knows the, so it's yeah.So it's some example of things we can do and it's really arbitrary, custom. I think you had some of their customers, for instance, they wanted some. They wanted some 3D model that can do audio with a very good function cable. So something you wanted to put in the car in particular, they wanted this to be offline because in a car you don't necessarily have access to internet.So [00:22:00] yeah. So here we can actually build the solutions. There is no like model out of the box on this. In the internet you have this very, you have this very general model generalist, like he's strong model. But for things like this, they always want at specific solutions and on some other reasons.Sometimes they come to us is because, like they, they experiment with some closed source model. They get some prototype. They're happy with what they build. They, it works well. They're happy with the performance, and then they want to go to production and then they analyze. But it's extremely expensive.You cannot push this. It's so then they come back to us on this. They can help us build the same thing as this, but using something much cheaper on here. And here we can sometime be something 10 x cheaper by just functioning a model and it'll be better OnPrem on their old server and also much cheaper as well.So yeah,swyx: that's the drop pitch right there. Take all themoney.Vibhu: And outside of that you do, we do put open wave models so people can do this themselves. I feel like not enough people go outta their way.swyx: They're not going to, they're gonna ask them to do it as the expert. IGuillaume: think initially we didn't know, [00:23:00] we wanted completely short at the beginning of the company because, I think our study was not exactly the same as what it is today, but what we underestimated initially is the complexity of deploying this model and connecting them to everything to be sure it has access to the company knowledge on the, and it was, yeah, on, we were seeing customers struggling with this, but it was even, that was three years ago and no, things are much more complicated because now you don't just have, text on SFT on a simple instruction following.You have reasoning like your agents, you have like tools. You have a multimodal audio, so it's much more complicated than before. And even back then it was hard for customers. So they really need, have some support and this is why actually providing like always some four D position as well. The processFine Tuning and Personalizationswyx: I'm curious is there also voice fine tuning that people do?Pavan: So in this forge we also have a say unified framework. And the hope is like the er speech to text that we released earlier this year. And even the ER chart that we released last year. And I think a big people, I think there's a big, rich ecosystem [00:24:00] of people fine tuning whisper, and people want the same thing with w so it's much stronger than Whisper.And yeah, the the platform offers that kind of fine tuning yeah, which could be any kind of fine tuning. Like for instance, even sometimes people want to support new languages to this, which are tail languages, which we hope to cover. Certain natively, but if there is a language where you data and you want to frank you, I think this is a good use case.Or the other use cases, you, it's the same language, like even English but it's in a very domain specific way.swyx: Yeah. Terminology, jargon, medical stuff.Pavan: Exactly. And also there's specific acoustic conditions like there's a lot of noise or the, and. The model will do decently in most conditions, but you can always make it better.And that those are some of the use cases where you can improve it e even further. And that's one good use case for this and for text to speech. We're just releasing it so we'll have support for that soon too. I think it's similar use case.Voice Personalization Pavan: It's little different the kind of things that you want to extend a [00:25:00] text to speech model to, which could be like voice personalization, voice adaptation for enterprises.Many enterprises need very specific kind of tone, very specific kind of like personality for this kind of voice. And all of those are like good use cases for fine tuning.swyx: This one I was gonna ask you, we never talked about cloning voice clothing here. How important is it, right?Like I can clone a famous person's voice. Okay. ButPavan: the main use case would be like for enterprise personalization, like enterprises need like a lot of customization. You don't want the same. Voice for all the enterprises. Each enterprise want a customized, specialized something which is representative both their brand and also their, I guess safety considerations and the use case I think the kind of thing that you would deploy as a empathetic assistant in the context of a healthcare domain would be very different from the kind of thing that would be in a customer support bot and would be different from like more conversational aspects.I think those are the. [00:26:00] Customizations you would expect from enterprise. And that's the main use case, at least from our side.Vibhu: My, my basic example is you don't want to call to customer services and have the same exact voice. It's just, it's gonna be weird.Long-Form Speech ModelsLong-Form Speech ModelsVibhu: But also on the technical side of this, so there's like a few things in TRO that I thought were pretty interesting.He's a big fan of this paper. Oh, he said very good paper. He said this is the best SR paper he's ever read. Yeah. I've hyped up this voice paper enough. We covered it. Somewhere, but a big thing. So Whisper is known for 32nd generation a 32nd processing. You extended this to 40 minutes. There was a lot of good detail in the paper about how this was done.Even little niches of how the padding is. So it's very much needed. You need to have that padding in there, the synthetic data generation around this. I'm wondering if you can share the same about the new speech to text, right? Text to speech. So how do you. How do you generate long form, coherent?How do you generate, how do you do that? And then any gems? Is there gonna be a paper?Pavan: Yeah. Yeah. They would be a technical report. Okay. Yeah. I think I could have a lot of details.Real-Time Encoder AdvancesPavan: But me I think the [00:27:00] summary of it, actually, some of the considerations in this paper were, because we started with the wipa encoder as the starting point, and now we have in-house encoders, like the bigger time model, for instance, which we released in January.Also release a technical report for that real time model as well, which is this dual stream architecture. It's an interesting architecture. You should check it out. And there we have a causal encoder and I don't think there's any strong, multilingual causal encoder out in the community. So we thought it's a good contribution.So that's one nice encoder there. Other people want to adapt. That's a good end code. And we train it from scratch. I think her. Post stack is now mature enough that we are able to train super strong ENC codes. And some of these considerations, like spatting and stuff, is a function of the Whisper ENC code.And now that we train encoders, inhouse the design concentrations are different.Scaling Context for TTSPavan: And for the question on text to speech, I think that's also leans onto the original auto aggressive decoder backbone. I think, it says very, almost identical considerations. I think the long context in it's not even long con, [00:28:00] so the model processes audio at 12.5 herds, so one second maps to like 12.5 tokens.So I think one minute is like 7.8 tokens. You can get like up to 10 minutes in eight K context window and get half an hour and 30 K context window. So that's and 30 2K context is something that's we are very comfortable training on. We can extend it even much longer. 1 48 K. Okay. You can naturally see how it can extend to even our long generations.Yeah. We need the. Like data recipe and the whole algorithm to work coherently enough through such long context. But the techniques are some way very similar to the text, long context modeling. And the key differences, it's just doing flow matching order regressively instead of a text open prediction.swyx: Okay. I think that was most, most of the sort of voice questions that we had. ButWhat Makes a Model SmallVibhu: I have a big question on Mr. Al, Mr. Small. So what is small? How do we define [00:29:00] small? What is this? What is this? I remember the days of Misal seven B on my laptop. The snuff fitting on my laptop. I could run it on the big laptop, butGuillaume: it's just additional.Question of terminology, like here what we did, baseball is north active parameters, but it's true. Really not give it another name, but yeah, we could have called it medium, but only, I,I suppose it's a model that we released mixture of experts. It's a model that combines different model before which we were doing the same, is that we had one model, general model for Israel. Doing instruction following, were like a separate model that was Devrel trial. So qu coding specify specific to code with another model for Reason Maal.So this were separate artifacts built by different team at trial on what we're doing is basically merging all of this. It was, you had pixel trial was the first vision model. We was like a separate model on the way we do things internally is that we have one team focus on one capability, build one model.On the means mature, mature enough, we decide to merge this into the [00:30:00] matrix. But here it was the first time we basically match all of this into one. But there are some other things we did at first time to merge time, for instance, like more capabilities or function coding I think would be, are, it's going to be much, much better in this trial, small platform.But but yeah, so it's our latest model on the working is,Vibhu: and yeah, key things is it's very sparse. Six, be active pretty efficient to serve. 2 56 K context. Yeah,Merging Capabilities vs Specialistsswyx: I think what's interesting is just this general theory of developing individual capabilities in different teams and then merging them.Where is this going gonna end up?Vibhu: Like we've seen the five things put together in this. Yeah. What are the next five teams?swyx: I think actually OpenAI has gone away from the original four Oh. Vision of the Omni model. This was what they were selling. All modalities and all modalities out.But I feel like you might do it.Guillaume: I think there's some mod where it's not competitive use, for instance for audio. For audio here, if you want to do transcription, I think it makes no sense to use a model. If you just want to trans tech it, it'll be very inefficient. If you want to do audio, you probably just want to be the [00:31:00] one VR 3D model performance essentiallyswyx: the same.It's going to be incredibly cheaper. So here, that's why we wantGuillaume: to have a separate but just does this. Yeah, I think the question is just, yeah. If you are to, to your model. By speech and you asking like a very complex questions on how you do this on the, just to cascade things. Do you want to put a d in a model that has like a one key around it?It's like a, not a competitive discussion, I think unaware if you doing into the direction, but that's possible. Of course. But yeah. But I think for us, the next capabilities we want to try to integrate into these models when we are going to be yes, like marketing or no reasoning better, I think more capabilities that people don't talk too much about, but at high bottom, I think for our customers in our, on different industries, for instance, things are around like a legal computer.I design all these things that is this males out of the box are to put at that. Because people, if you don't prioritize this, there is not like too benchmark on that. Butswyx: this done how toGuillaume: make this good and this just start to do the work. Extracting some that processing it [00:32:00] expression. So yeah.But we are offering the imagine to this.swyx: I think for voice. Yeah. The key thing I think over maybe like the last year or so with VO and gr Imagine and all these things is joining voice with video, right? Which people don't understand spatial audio because like most TTS is just oh, I'm speaking to a microphone in perfect studio quality.But when you have video, like the voice moves around.Pavan: That's true. The constitution was a little different in the sense that there it's like a a standalone artifact where you get the whole thing and you consume it. But in a conversational setting, it's a, you need the extreme low latency.swyx: Yeah,Pavan: streaming would be one of the primary concentrations.swyx: You can build a giant company just doing that, right? So you don't need to do the voice, but I was just know on the theme of merging modalities, that is something I, I am like, wow. Like I didn't, everyone up till, let's say mid last year was just doing these like pipelines of okay, we'll stitch a TTS model with a voice thing and a lip sync [00:33:00] thing and what have you.Nope. Just giant model. Yeah.Open Source MissionVibhu: I have a two part question. So one is, it's still open. It seems like open source is still very core to what you guys do and I just have to plug your paper. Jan 2024. This is the one trial of experts like. Very fundamental research on how to do good.Moes paper comes out very good paper for anyone. That's just side tangent. No.swyx: This thing caused, we bring back, eight by 22 was like the nuclear bomb for open source. I think it takes Shouldn be more seven B more. Yeah. Yeah. But this is a bigger opposite than me.Yeah. Yeah I don't remember this. I remember, I don't think it was January, right? It was like new reps it was, it dropped during new reps and everyone in Europes was December of 25th, I think. Yeah. The model was did as well.Vibhu: It's just a little update probably.swyx: Yeah. No, but you have a point to make.Vibhu: No, you gotta check that. But then, I just want to hear more broadly on open source for you guys, and when you had asked earlier [00:34:00] about what's next, what are the other, side tapes working on you. You put out Lean straw. This,swyx: it's not necessarily surprise. I was like, I don't, this doesn't fit my mental model or Misra.Guillaume: Yeah. First for open source in general, I think it's really something which looks to the January of the company. I think we started it per once, is we so we have open sourcing with, since the beginning and even before this. So before this, so me and Tim were at Meta, we released LA and I think what was really nice.To see that before this, for most researchers like universities, it was impossible to work on elements. There was no alien outside. And if you look at many of the techniques that were developed after, for instance, was open source all this post-training approaches like even DPOD, like preference optimization, all of this were done by people that had access to this portal.And it'll have been impossible to do without this. So it's really making sense, move faster. So we really want to contribute to this ecosystem. I think like the deep and also like very lot of impact. All these papers that are I think in the open source community are really helping the science community as a whole to move faster.So [00:35:00] we want contribute to this ecosystem. That's why we're releasing very detailed technical reports. So ma trial and our first reason model, and ation, lot of results, things that work, things that did not work as well. Think helpful on the, yeah, so for the audio model also to share a lot of details, share of them for real time model.And the, yeah, so we really want to continue this, basically belong to this community of people who share science. I think we really don't want to be, leading in a world where the smartest model, the best models are only behind, close doors. Only accessible to a shoe companies that we, as a power to decide we can use them on it.I think it's a scary future. We don't want to live in, we really want this model to be accessible to anyone that want. Intelligence to be used unaccessible by anyone who can use it. So yeah, so that's why we are pushing this mission and source model. Yeah. So not, so yeah, no strategy. So it's open source, not the first model, so not the best on the Yeah.Lean and Formal ProofsGuillaume: LIN trial I think is also one step into this direction. So it's yeah, a bit different than what we are usually releasing. But we have a small team internally [00:36:00] working on them. Formal proofing, formal math. So I think a subject we care about in general and we were working on reasoning. I think we started too early before doing reasoning without LMD is very hard, especially when you work with formal systems because the amount of data you have is negligible.It's addressable community of people writing like formal proofs. But the reason why we like it is because I think there is if you look at what people are doing with reasoning, is there, the problems that you can use. Are usually going to be problems where you can verify the output. So for instance, all this ai ME problem where the solution is a number between 100, like a thousand.So you can verify, compare this with a reference or it's an expression. You can actually compare the output expression generic with the reference. But there are many, most of them have problem and most of the reason problem. There is no like way to easily verify the solution. If the question is show that F is continuous, cannot compare in the reference, right?If it's a probe that this is true or probes is properties, there is no way to. You cannot act, simply verify the correctness of your proof. So it's hard to apply the, there is no referable reward here. So [00:37:00] what you could provide is of course, like a judge and judge that will look at your proof. But it's very hard and it's very, you could do certain, some reward hacking happening there.So it's difficult. You could provide like a reference proof, but then there are also many ways to prove the same thing. So if the model says give negative reward because it's a different poop, maybe it was still digit proof, just different. So it's not going to work well. What's nice with lean and with formal probing is that you don't have to worry about this whatsoever.We just,swyx: they're all function is largely compiles in lean is functionally the same. Exactly.Guillaume: It's like a problem if it compiles it's correct. It's very easy. And you can apply this and then you can,swyx: it's just way too small. So no human will actually go and do it.Guillaume: Yeah, that's exactly.It's the only people can do it. It's like a very small committee of people doing a PhD on that. So it's super small. And it's sad because it's actually very useful on not just mat, but also in software verification. So for instance, software verification today. So tiny market. Very few industries work on this and we need that.It's usually going to be like companies like building airplanes, air robotics,swyx: likeGuillaume: things [00:38:00] where they absolutely want to be sure. Life depend on this, but it's very rare that people formally verify the correctness of their software. But I think one of the reasons for this is simply that it's just hard to do.swyx: Are you think of TLA plus? It's the language that some people do for software verification? No. That people use in a ference, but but yeah, it's the reason I think why people don't use it more and why this industry is not as big as could be is because it's very hard. But now with cutting edges that are there, it's going to be very different.Guillaume: We're going to see much more of this. So I think yes, industry there is going to be much larger in the future that we, these models. So yeah. Here also anticipating this a little bit, we wanted to work on that because it's proving like a math theory and like a, essentially the same tools.swyx: Yeah.Reasoning Transfer and Agentsswyx: One of my theories is that because the proofs takes so long, it's actually just a proxy for long horizon reasoning and coherence and planning. Maybe a lot of people will say okay, it's for people who like math. It's for being okay. It's like a niche math language. Who cares? But actually, and you use this as part of your data mixture for [00:39:00] post-training and reasoning, actually, it might spike everywhere else.Yeah. And I think that's un under explored or no one's like really put out a definitive paper on how this generalizes.Guillaume: Yeah, absolutely. AndPavan: I think evenGuillaume: that's what we're seeing already. For instance, you should do some reasoning on math as then the American should do reason even.Yeah. In the early stage. So we, the, there is some transfer, some sort of emergence that happens. And I think some, it's also interesting, it's not just I think the topic in general, but it's, there is a lot of connection with this on including agents because. Sometimes the model can see like a three that it has to prove it's very complex, but then it can take the initiative to say, I'm going to prove this three lr.I'm going to suggest three Rs, and I'm going to in parallel prove each R. So three of them in parallel with sub agents, but I'm also going to prove them in theory and the three tool so you can do this also. Pretty interesting. You can, even if you fail to put one of the LeMar, you can actually, maybe you succeed to put the normal lema too, so you get some possible reward here.So it's a bit less Spartan issue, just get to zero one for the entire thing. [00:40:00] So it's pretty interesting. I think we can actually,Vibhu: yeah, it's also an interesting case just for specialized models in general, right? Like the cost thing you show is pretty interesting yeah, similar score wise, you are, thirty, seventy, a hundred fifty, three hundred bucks.Smaller.swyx: I think cost is a bit unfair, right? ‘cause this one is at like inference cost. It's always there on top with their margins on top of it. But, we don't know anything else, so we gotta figure it out.Vibhu: Okay.Next Frontiers in TrainingVibhu: I did wanna actually push on that more. Not on cost, but you mentioned about, okay, it's a great way to have verifiable long context reasoning.What are other frontiers that, I'm sure you guys are working on internally, there's a lot of push of people pushing back on pre-training. Scaling, RL pushing, compute towards having more than half of your training budget. All on rl. Where are you guys seeing the frontier of research in that?Guillaume: You mean theVibhu: just in foundation model training in the next, one thing that you guys do actually is you do fundamental research from the ground up, right? So you probably have a really good look at where you can [00:41:00] forecast this out.Guillaume: Yeah. I think for us we're still working a lot on the pre-training side.I think we are very far from situational, the pre-training. I think ML four preprinting will be like big step compared to everything we have done before. So we are pretty excited about this. And I think on the other side, I think now we have more and more to think about this algorithm that will actually support this very long trajectories.I think when it was, for instance, GRPO for it doesn't really work this any bit of policy. Which was okay initially because you are solving math problem that can be solved in like a few thousand tokens. So the model can alize them pretty quickly. So when you do your update, the model is never too far off.It's never too far off. But now when you are moving towards this kind of problems where certain takes hours, like six hours to get a reward, then your model is co pick places. So you have bi new infrastructure that supports this, but also new A, so now everything we're doing internally, we're trying to. Build some infra that we actually anticipate is what we have in six months, one now, which is this extremely no scenarios on the, I think when we started Missal, part of me and [00:42:00] we wanted to, is very nice under element where people are there, they can do research, they like with a lot of resources.So it was nice. I think things changed a lot when I think when J Pity came out. I think after that I think was. This one is same again. But but yeah, but it was nice. And I think we also want to work part of this descrip beforeswyx: coming to the end.Hiring and Team Footprintswyx: We're just, obviously, I think you guys are doing incredible work.You've, they are a very impressive vision for open source and for voice. What are you hiring for? What's the what are you looking for that you are trying to join the company?Guillaume: Yeah, so we are hiring a lot of people in our sense team. We're hiring, in all our offices. So we have a, our H two is in France in Paris.We have a small team in London. We like a team in Pato as well. Co we open some offices in in SAU, in Poland. So one in Zurich. We also like some presence in New York as well on Sooner one in San Francisco. So we all bit either way also like hiring remotely. So we're going the team trying to hire like very strong people.I think we want to stay, so the team is not. Instead of fairly small team. [00:43:00] But I think we want to keep it that way. ‘Cause we we find it quite efficient. So like a small team they agile so yeah.swyx: Okay.AI for Science Partnershipsswyx: Let's focus on science and the forward deployed. We actually are strong believers in science.We started the our new science pod that focuses specifically on the air for science. What areas do you think are the most promis.Guillaume: What we're pretty excited about right now, and something we have already started doing or that we'd probably be able to share more about this in a couple of months, is that we are exploring AI for science.And there are a lot of areas where we think that you could get some extremely promising buzz. If you were to apply AI in these domains. There are a lot of long inputs. You just have to find these domains where actually AI has not been yet applied, and it's usually hard to do because the people working in those domains don't necessarily know the capability of these models.They don't know. How I would just have to pair them with Yeah, exactly. Your researcher slashing, which is actually hard to do. But this matching, we're doing it naturally with our customers. So we have some company we are very closely with. So for instance, ISM Andreesen are one of our partners, so we're doing some research with them on their other, like tons of extremely interesting problems.Columns in physics, in [00:44:00] science matter science that they're essentially the only ones to work on. ‘cause they're doing something No, no one else is doing on the, yeah. So there are many domains where AI can actually revolutionize things. Just you have to think about it on you familiar with what can do or to apply it.So yeah, it's something where more modeling with our partners, with our customers sort AI for s, but.swyx: Yeah. Okay.Forward Deployed Skillsswyx: And then for deployed what it makes a good four deployed engineer, what do they need? Where do people fail?Guillaume: I think it's usually you need people that are very familiar with the tech and not necessarily with a lot of research expertise, but that are actually pretty good at using this model that can actually like that know how to do functioning, that know how to like, start some error pipeline.And it's it's not easy. It's something that mucus. Majority of companies will not be able to do this on their own. So here I think we need people that are, that like to solve problems that are accept solving some complex, very concrete problem. It's applied science basically.And yeah, so I think it's not too different. I think from the case you need in research because it's essentially you are trying to find solutions to problems that in [00:45:00] customers have not yet. So sometimes it's easy. Sometimes you're here to do the work. You have to like create synthetic data.Find some edge case. So it can be, yeah. Depends on the problem. But but yeah, you have to, I think it also a bit of patience on the be creative. I think very similar skill is Asian,Pavan: the diversity of the work they do. It always surprises me. It's it's, it goes all the way from the kind of stuff they encounter in industries.It's just very interesting. I think.swyx: Any fun like success anecdotes.Guillaume: Yeah, it can be actually training this small model on edge that just we do one specific thing can be like training some very large model without some specific languages as well. Making models really good at some tube use, like for instance, computer ID design, these kind of things.Is that pairing with vision as well? Yeah,Pavan: and the fact detection for chips or like in, in factories identifying things like it, the. Diversity could be anything where you can deploy these foundation models. So yeah the work to make it work in that specific setting, basically whatever it takes to make it like add value in that, by the way, workflow.Vibhu: Yeah. [00:46:00] And it goes across the stack, right? Like even just pulling up the website like.swyx: It's so broad on compute. It is so broad.Vibhu: We didn't even touch on if you have a coding CLI tool. One thing you guys were actually like, I think the first tool was agents, ral agents. You had the agent builder, you can serve it via API and all that.And I'm guessing forward deploy people.Guillaume: Yeah.Vibhu: Help build that out and stuff.Customer Feedback LoopGuillaume: It is also why we are, so we're doing many things, but I think that's also part of the value proposition that sometime know customers. They're always very. Extremely careful about their data and they don't want to, they don't like, trusting so many partners, trusting one partner for code, giving the data to another third party for like audios and another one.So they don't like this here. What they really like with our approach that we can help them on anything so they don't have to send the data to so many clouds. So yeah,swyx: I think that there can be many orders of magnitude more. F Ds then research scientists and they don't need your full experience, but they're still super variable to customersGuillaume: in practice.These two teams [00:47:00] are still quite intertwine, very often. Yeah. So first of all, they're using the same tools, the same data pipeline and everything on the, it's it's very helpful for the science team to get the feedback and the solution team ‘cause they can. Look at these customers are trying to do this.This is not working. It can really be show in the next version. Yeah. But this is basically a real world eval. Yeah, it's real world eval and it's not something, for instance, if you're just working in the lab, it's just ships model. But you don't do this work of for customers. You have no idea for whether your model is good at this H case.For instance, you even in year found this, right? So yeah, there is a very gap, big gap between the public benchmarks that are very like academic. OnPavan: the rare cases are just very diverse and in the specific concept of a customer, you can fine tune and make it like first evaluate, create a solid eval, benchmark, and then measure in the context of their, the kind of audio.Like for instance, one use case is literally just, there's the word for kids and they have to just say it out. It's a very specific thing. You're just saying one word and then you have to you, you'll grade the kid whether they did it right or not. It's [00:48:00] like R for, but so there're very diverse use cases and the idea is that they, the.Applied scientist engineer will go and make it better. And then from the learnings we incorporate it into the base model itself. So it's it's just better out of the box.Vibhu: Yeah. It's a good full circle system. Like the foundation model evals are all just proxies of what you really, you're never gonna have one that says it, it doesn't make sense for there to be, a one word transcription like that.It's not something you wanna fit on. Perfect.Wrap Up and Thanksswyx: Everyone should go check out everything that Michelle has to offer and try the TTS model, which will link in the show notes. But thank you so much for coming tha thanks. Such a stretch. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.latent.space/subscribe

Hustle And Flowchart - Tactical Marketing Podcast
The Next Wave - AI Tool Better Than OpenClaw? + NVIDIA'S $1T Prediction & AI Image Wars

Hustle And Flowchart - Tactical Marketing Podcast

Play Episode Listen Later Mar 26, 2026 82:00


This episode is from The Next Wave Podcast. Check out more episodes here: https://www.thenextwave.showIn this engaging and forward-thinking episode, Joe Fier and Matt Wolfe dive deep into the current and future landscape of AI tools, the staggering impact of NVIDIA on the tech world, and the fierce competition in AI image generation. The conversation covers exclusive insights from NVIDIA's GTC conference, the evolution of agentic AI, industry disruptions, and recent advancements in robotics and job automation. Whether you're an entrepreneur, tech enthusiast, or curious about where AI is leading us, this episode delivers valuable perspectives and hands-on tests of cutting-edge tools.Links Mentions:The Next Wave Podcast: https://www.thenextwave.showMatt Wolfe: https://www.youtube.com/@mreflow NVIDIA: https://www.nvidia.com/en-us/Jensen Huang: https://www.linkedin.com/in/jenhsunhuang/OpenClaw: https://openclaw.ai/NemoClaw: https://www.nvidia.com/en-us/ai/nemoclaw/Future Tools: https://futuretools.io/

聽天下:天下雜誌Podcast
【阿榕伯胡說科技Ep.72】3月科技大事,阿榕伯講給你聽:黃仁勳GTC 說了什麼?旺宏會衝300?伊朗能源危機怎麼看?

聽天下:天下雜誌Podcast

Play Episode Listen Later Mar 26, 2026 49:31


這集節目是3/20胡說科技Youtube直播內容,想看影像版歡迎點擊連結:https://pse.is/8vdf6v 3月科技大事,你最關心哪件事? 一、伊朗能源危機 二、輝達 GTC 三、記憶體大跌又大漲 【阿榕伯直播講給你聽】 蘋果發布新產品,MacBook Neo不到2萬,輝達GTC大會,黃仁勳又出來講了什麼? 三月國內外科技產業有什麼大事,代表什麼意義?透露什麼趨勢?

What's Next|科技早知道
GTC 2026:AI的下一个战场不是模型,而是「推理系统」| S10E04

What's Next|科技早知道

Play Episode Listen Later Mar 25, 2026 58:37


2026 年 3 月,英伟达年度开发者大会 GTC 在美国 San Jose 开幕。这一年的 GTC 气氛与往年明显不同——黄仁勋不再需要向市场证明 AI 的价值,因为 Agent 爆发和开源模型崛起已经让算力需求成为行业共识,Token 消耗量正在以百倍速度增长。 本期节目,Diane 在 GTC 现场为大家带来了第一手的观察,也专访了推理优化初创公司 Eigen AI 的联合创始人。Eigen AI 由三位 MIT 背景的创始人于 2024 年中创立,主攻开源大模型的推理加速与企业定制化部署。这次 GTC,他们的推理速度跑分直接登上了黄仁勋 Keynote 的大屏幕,是当前推理速度最快的团队之一。 节目里我们深入聊了为什么推理层正在成为 AI 行业最重要的竞争战场、GPU 和 LPU 各自在推理过程中扮演什么角色、英伟达斥资约 200 亿美元收购 Groq 背后的战略逻辑,以及当前 AI 应用的商业模式为何正在面临系统性挑战。 本期人物 丁教 Diane,「声动活泼」联合创始人、「科技早知道」主播 Di Jin,Co-founder at Eigen AI 主要话题 [00:11] 今年 GTC 最大的不同是什么? 黄仁勋状态明显更放松,不再需要向市场"推销" AI 的价值 Agent 爆发让 Token 消耗量指数级增长,算力需求已成行业共识 开源模型崛起打开了推理层的商业空间,这一层开始变得关键 [09:13] Eigen 是一家什么样的公司,在做什么? 三位 MIT 背景创始人,专攻模型压缩与推理加速 Post Training 帮企业定制模型,Inference 加速让模型跑得更快更便宜 GTC 开幕前两天完成技术突破,推理速度登上黄仁勋 Keynote 大屏幕 [13:24] 过去一年 AI 行业最大的结构性变化是什么? 模型训练层高度集中,GPU 成本比人才成本贵 10 到 100 倍,中小公司已基本出局 Reasoning(推理时扩展)成为新的性能提升路径,让固定模型通过多花算力输出更好结果 Agent 工作流让 Token 消耗量远超对话场景,推理层的优化价值随之暴增 [23:34] 英伟达为什么要花约 200 亿美元收购 Groq?GPU 和 LPU 各自擅长什么? AI 生成回答分两阶段:读懂问题(Prefill)适合 GPU 并行处理,逐字生成答案(Decoding)适合 LPU 串行提速 当前最快模型约每秒 1000 个 Token,Agent 场景未来可能需要每秒 10000 个,GPU 单独难以跨越这道坎 GPU 负责前段、LPU 接手后段,两者组合是目前长序列推理的最优解 [34:04] 推理优化的技术路径有哪些,分几个层次? 底层是 CUDA 算子优化,针对不同模型的矩阵计算特点做精细调整 中间层包括量化(降低数字精度)、剪枝(删除冗余专家模块)、投机解码(小模型预测 + 大模型验证) 最上层是调度与路由,核心是把请求打到存有对应 KV Cache 的 GPU 上,避免重复计算 [44:05] 推理优化怎么在速度、精度和成本之间做取舍? 完全不掉精度、少量掉精度、需要后训练恢复精度,三类方案对应不同客户需求 对话场景最看重 TTFT(第一个字的响应时间),Agent 场景更看重整体任务完成时间 语音交互场景存在天花板:模型再快也超不过人能听懂的速度,快到一定程度就没有意义了 [47:28] AI 应用的商业模式为什么正在出现系统性问题? SaaS 订阅制是历史遗留:以前软件边际成本接近零,现在每用一次 AI 都在真实烧钱 重度用户轻松"用穿"月度套餐,公司不得不限流,引发用户强烈反弹 更合理的方向是按任务完成量收费,但用户心理锚点还没有完成迁移,行业仍在震荡期 [53:52] 开源模型能追上闭源模型吗?推理层未来最大的机会在哪? 行业最大的非共识:开源模型到底能不能真正追上闭源,以及 AGI 算不算已经到来 推理层几乎只能服务开源模型,开源能力的拐点直接决定这个赛道的天花板 一旦开源模型达到拐点,Token 将像电力一样渗透各行各业,推理层的市场规模将彻底打开 黄仁勋的「AI 五层蛋糕」模型 名词解释 LPU(Language Processing Unit) Groq 公司研发的专用芯片,专为大语言模型的文字生成(Decoding)环节优化,通过把高带宽内存直接集成在芯片上,大幅提升了逐字生成的速度,但牺牲了通用性。 TPU(Tensor Processing Unit) 谷歌专为自身 AI 需求定制的芯片,性能强劲且价格相对便宜,但目前仅面向 OpenAI、Anthropic 等少数大型客户供货,缺乏开放的开发者生态。 Quantization(量化) 降低模型内部数字精度以节省存储和计算量的技术。好比把精确到小数点后 10 位的数字改写成精确到 2 位——计算量大幅下降,但对最终输出影响有限。精度从高到低依次为 FP32、BF16、INT8、INT4,越低效率越高,但掉点风险也越大。 Pruning(剪枝) 识别并删除模型中冗余参数或模块的技术。以 MoE 架构为例,模型内部有大量从未被有效训练的"伪专家",将其删除后模型精度几乎不受影响,但推理速度和效率显著提升。 Speculative Decoding(投机解码) 先用小模型快速"草拟"若干 Token,再让大模型批量验证并决定是否采纳的加速技术。当草稿被采纳的概率足够高时,整体推理速度可提升 50% 以上。 KV Cache(键值缓存) AI 在生成回答过程中,将对前文的"理解结果"缓存起来,避免每次都重新读取和计算全部历史内容。合理调度 KV Cache 是 Agent 场景下降低延迟和成本的关键技术之一。 MoE(Mixture of Experts,专家混合架构) 模型内部由多个"专家"子模块组成,每次推理只激活其中最匹配当前任务的少数几个。DeepSeek、Qwen 等主流开源模型均采用此架构,可在维持大参数量的同时显著降低实际计算开销。 SLA(Service Level Agreement,服务水平协议) 对服务质量的量化约定,例如"首字响应时间不超过 300 毫秒"或"每秒至少输出多少个 Token"。推理层的大多数技术决策,都是围绕在成本约束下满足客户 SLA 要求来展开的。 TTFT(Time to First Token,首字时延) 从用户发出请求到收到第一个输出字符的时间间隔。对话类产品中这一指标最为关键,直接影响用户对系统响应速度的主观感受。 「Knock Knock 世界」 上周「Knock Knock 世界」更新了「数字收藏」话题:一段视频、一个表情为什么也能成为博物馆的收藏品?点击这里收听节目

The Next Wave - Your Chief A.I. Officer
AI Tool Better Than OpenClaw? + NVIDIA'S $1T Prediction & AI Image Wars

The Next Wave - Your Chief A.I. Officer

Play Episode Listen Later Mar 25, 2026 43:56


Get Matt's AI playbook: https://clickhubspot.com/kfcr Episode 102: Is NVIDIA really the “sun” at the center of the AI universe? Host Matt Wolfe (https://x.com/mreflow) and Joe Fier (https://www.youtube.com/@joefier) break down everything you need to know from NVIDIA's recent GTC conference, the hottest new AI tools for business and marketing, and the changing landscape of AI data, agents, and robotics. This episode dives deep into the explosive potential of NVIDIA's AI roadmap, why Jensen Huang thinks chip sales will hit $1 trillion, and how accessible agent tools like OpenClaw and NemoClaw could change everything for everyday users and enterprises. Plus, Matt Wolfe and Joe Fier explore the rise of data-for-hire side hustles like DoorDash Tasks, where humans help train AI in the real world, and the jaw-dropping athletic skills of the newest generation of robotics. Whether you're wondering where the money and innovation are flowing next—or concerned about the privacy, data, and future job market in the age of AI—it's all here in this packed, must-hear “special” episode. Check out The Next Wave YouTube Channel if you want to see Matt and Nathan on screen: https://lnk.to/thenextwavepd — Show Notes: (00:00) NVIDIA's Future and Growth (06:31) OpenClaw: AI Accessible to All (08:56) AI Compute Shifting to Inference (12:39) Accelerating AI Thinking Time (15:20) Nemo Claw: AI Assistant Revolution (19:00) Stock Buybacks Boost Value (24:11) NVIDIA's Expansive Industry Influence (28:34) Future Economy: UBI or Data Payment? (29:38) Data Privacy and AI Advertising (32:45) CAPTCHA Origins and Duolingo (38:18) Robots: Cool, But Not Smart (40:18) Farewell and Thank You — Mentions: Joe Fier: https://www.youtube.com/@joefier NVIDIA: https://www.nvidia.com/en-us/ Jensen Huang: https://www.linkedin.com/in/jenhsunhuang/ OpenClaw: https://openclaw.ai/ NemoClaw: https://www.nvidia.com/en-us/ai/nemoclaw/ Future Tools: https://futuretools.io/ Get the guide to build your own Custom GPT: https://clickhubspot.com/tnw — Check Out Matt's Stuff: • Future Tools - https://futuretools.beehiiv.com/ • Blog - https://www.mattwolfe.com/ • YouTube- https://www.youtube.com/@mreflow — Check Out Nathan's Stuff: Newsletter: https://news.lore.com/ Blog - https://lore.com/ The Next Wave is a HubSpot Original Podcast // Brought to you by Hubspot Media // Production by Darren Clarke // Editing by Ezra Bakker Trupiano

The AI with Maribel Lopez (AI with ML)
NemoClaw, OpenClaw, and the Real Reason Enterprises Haven't Deployed AI Agents Yet

The AI with Maribel Lopez (AI with ML)

Play Episode Listen Later Mar 25, 2026 15:48


NVIDIA's NemoClaw adds enterprise security to OpenClaw. What it does, what it doesn't, and what CIOs should do before deploying.FULL SHOW NOTESOpenClaw became the fastest-growing open-source project in history. Enterprise buyers watched from the sidelines — not because the technology wasn't useful, but because an autonomous agent with access to corporate file systems, credentials, and external communication channels is a governance and security problem that no one had solved at the enterprise level.At NVIDIA's GTC 2026 conference, Jensen Huang announced NemoClaw: a reference stack that adds enterprise security controls to OpenClaw. In this solo episode, Maribel Lopez breaks down what NemoClaw actually does, why the SaaS partner ecosystem matters as much as the technology itself, and where the hype is running ahead of the reality.WHAT WE COVER•       Why OpenClaw created a shadow IT problem before NemoClaw existed•       What OpenShell, the Privacy Router, and Nemotron models actually do for enterprise buyers•       Why Salesforce, ServiceNow, SAP, Cisco, and CrowdStrike being in the ecosystem matters•       The hardware dependency NVIDIA's marketing glosses over•       Why “working with NVIDIA” and “ready to deploy” are not the same thing•       The three questions every CIO should answer before touching any of this TIMESTAMPS00:00  —  Why enterprise IT teams were watching OpenClaw from the sidelines01:45  —  What OpenClaw is and why it created an enterprise security problem04:00  —  What NemoClaw actually does: OpenShell, Privacy Router, Nemotron06:30  —  The SaaS ecosystem: Salesforce, ServiceNow, SAP, Cisco, CrowdStrike08:30  —  Where the hype is ahead of the reality10:15  —  Three questions CIOs should answer before deployingRESOURCES MENTIONED•       NemoClaw announcement and NVIDIA Agent Toolkit: build.nvidia.com•       Full written analysis: NemoClaw Brings Enterprise-Grade Security Controls to OpenClaw — lopezresearch.com•       NVIDIA GTC 2026 Jensen Huang keynoteABOUT THIS PODCASTAI with Maribel Lopez covers enterprise AI adoption, agentic systems, AI governance, and AI-driven customer experience. Maribel Lopez is founder and principal analyst at Lopez Research, a technology research and strategy firm.Subscribe on Apple Podcasts, Spotify, or your platform of choice.KEYWORDSenterprise AI agents, agentic AI security, NemoClaw NVIDIA, OpenClaw enterprise deployment, AI agent governance, enterprise AI strategy, AI governance enterprise, agentic AI risks

Options Boot Camp
Options Boot Camp 383: Weekend Risk and Dangerous Donkeys

Options Boot Camp

Play Episode Listen Later Mar 25, 2026 23:27


Is the "Wheel Strategy" making you nervous in a high-volatility market? In this episode, Mark Longo and Dan Passarelli field listener questions about navigating black swan fears, the "Greeks" you should watch when the market moves fast, and whether the AI narrative in Nvidia still has legs. Dan also prepares for his upcoming trip to Greece by getting some questionable advice from Mark about riding donkeys in Santorini and ordering "Euros" at the local taverna. Key topics covered in this episode: The Wheel Strategy vs. Credit Spreads: Is it time to cap your downside when the VIX hits 22? Vega and Gamma Focus: How your attention should shift between Greeks in a fast-moving market. Nvidia (NVDA) Outlook: Dealing with underwater bull call spreads after the GTC conference. 0DTE Expansion: Will we see more zero-day contracts added to the roster this year? UVIX vs. SVIX: A deep dive into leveraged and inverse VIX products and how to use them to hedge "weekend risk."

The Options Insider Radio Network
Options Boot Camp 383: Weekend Risk and Dangerous Donkeys

The Options Insider Radio Network

Play Episode Listen Later Mar 25, 2026 23:27


Is the "Wheel Strategy" making you nervous in a high-volatility market? In this episode, Mark Longo and Dan Passarelli field listener questions about navigating black swan fears, the "Greeks" you should watch when the market moves fast, and whether the AI narrative in Nvidia still has legs. Dan also prepares for his upcoming trip to Greece by getting some questionable advice from Mark about riding donkeys in Santorini and ordering "Euros" at the local taverna. Key topics covered in this episode: The Wheel Strategy vs. Credit Spreads: Is it time to cap your downside when the VIX hits 22? Vega and Gamma Focus: How your attention should shift between Greeks in a fast-moving market. Nvidia (NVDA) Outlook: Dealing with underwater bull call spreads after the GTC conference. 0DTE Expansion: Will we see more zero-day contracts added to the roster this year? UVIX vs. SVIX: A deep dive into leveraged and inverse VIX products and how to use them to hedge "weekend risk."

PC Perspective Podcast
Podcast #860 - DLSS 5 Reaction, Arrow Lake Refresh, New Ryzen Rumor, MSI's DDR4 Mobo + MORE!

PC Perspective Podcast

Play Episode Listen Later Mar 23, 2026 76:10


Back in the podcast booth this week!  Enjoy the DLSS takes, noticing that SSD vendors are having a great year, there are some actual good news on Copilot, and fake ram kits are a thing, AMD has CPU refresh rumors, and a Windows "degraded" security state is coming soon!0:00 Intro00:37 Patreon01:41 Food with Josh03:01 DLSS 517:18 Intel Arrow Lake refresh20:36 AMD Ryzen refresh rumor21:31 NVIDIA dGPU market share dominance23:52 GTC in spaaaaace29:03 SSD vendors had an amazing quarter for some reason32:02 Some good news for Copilot 365 victims36:09 Antec and Noctua collaborate on a case37:46 Impress your friends with TWO sticks of DDR539:06 MSI plans more DDR4 motherboards40:33 Podcast sponsor - Zapier42:12 (In)Security Corner55:42 Gaming Quick Hits1:03:32 Picks of the Week1:13:03 Outro ★ Support this podcast on Patreon ★

FView Friday
AI 时代,最应该做的反而是归纳与总结

FView Friday

Play Episode Listen Later Mar 21, 2026 166:24


本期嘉宾:彭林、十天、森森、蓝白本期节目的主要内容有:· 00:01:03 -- 苹果无预警发布 AirPods Max 2· 00:09:30 -- 等四年终于用上,国行 Apple Watch 房颤历史正式上线· 00:14:22 -- 苹果游戏展示会· 00:37:11 -- 21.99 万元起,新一代小米 SU7 正式发布· 00:53:41 -- 9999 元起,OPPO Find N6 正式发布· 01:08:35 -- Nothing Phone (4a) 系列手机国内发售· 01:21:39 -- 小米深夜发布三款模型· 01:37:03 -- 小米笔记本 Pro 14 体验· 01:46:03 -- 索尼发布首款 LOFIC 传感器 IMX908· 01:52:09 -- 英伟达 GTC 主题演讲· 02:10:53 -- 「3·15 晚会」曝光AI大模型投毒· 02:15:25 -- 任天堂 Switch 2 升级新增「官方超频」模式· 02:17:25 -- 追觅芯际穿越「瑶台」算力基站发射成功· 02:17:55 -- 面壁智能发布「龙虾盒」EdgeClaw Box:给「龙虾」装上安全大脑· 02:20:00 -- 微软电脑管家上线「一键卸载龙虾」功能· 02:22:22 -- 乐天「日本最强 AI」塌房:扒开代码全是 DeepSeek,还删了开源协议· 02:24:53 -- 闲聊环节(标题在最后)每周五晚 8 点,爱否直播间,我们一起开心聊天

Tech News Weekly (MP3)
TNW 429: DarkSword Puts Hundreds of Millions at Risk - The Internet's Reaction to NVIDIA's DLSS 5

Tech News Weekly (MP3)

Play Episode Listen Later Mar 19, 2026 68:24


Jennifer Pattison Tuohy of The Verge joins Mikah Sargent this week on Tech News Weekly! IKEA's smart home products are not quite there yet. NVIDIA unveiled DLSS 5 at GTC 2026 and faced backlash from within the gaming community. A powerful iPhone-hacking technique has been discovered to take over devices running iOS 18. And some updates on the probes into Tesla's and hands-free driving systems. Jennifer shares her experience using IKEA's newest smart home products utilizing the Matter-over-Thread protocol, expressing her concerns and frustrations with trying to connect them to any smart home platform. Mikah talks about Nvidia's DLSS 5 that was unveiled at GTC 2026, and the criticism & memes it has faced since its unveiling. Andy Greenberg of WIRED joins the show to talk about a tool that has been discovered, called DarkSword, that can hack into millions of iPhones running iOS 18. And Mikah chats about hands-free driving technology in vehicles and NHTSA's probe into Tesla and Ford's hands-free driving systems. Hosts: Mikah Sargent and Jennifer Pattison Tuohy Guest: Andy Greenberg Download or subscribe to Tech News Weekly at https://twit.tv/shows/tech-news-weekly. Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Sponsors: outsystems.com/twit hipebl.ai

The Future of Work With Jacob Morgan
NVIDIA CEO Says Leaders Lack Imagination, Cognizant's $4.5T Warning, & The Case Against the AI Apocalypse

The Future of Work With Jacob Morgan

Play Episode Listen Later Mar 19, 2026 44:09


March 19, 2026: Jensen Huang had one of the biggest weeks in tech at Nvidia's GTC — but his sharpest line wasn't about chips. When asked why companies are laying off workers, he said simply: because they're out of imagination. We unpack what that means, plus his surprise take on compensation from the All-In podcast. Then Cognizant drops a bombshell update to its 2023 workforce study: 93% of jobs impacted by AI, $4.5 trillion in labor shifting to machines, six years ahead of schedule. Their own words: "We underestimated the technology." But two CEOs are pushing back on the doom narrative — Uber co-founder Travis Kalanick says humans will be "super fine" until AGI arrives, and Tech Mahindra CEO Mohit Joshi argues the demand for human labor isn't going anywhere, and has the data to back it up. We close with JPMorgan Chase's 2026 tech trends report and the concept quietly reshaping what leaders actually do: context engineering. Watch the full episode on YouTube ---------- Start your day with the world's top leaders by joining thousands of others at Great Leadership on Substack. Just enter your email: ⁠⁠https://greatleadership.substack.com/ Stop patching problems and start designing an intentional workplace. The 8 Laws of Employee Experience gives you the how. Order your copy: 8EXlaws.com

Tech News Weekly (Video HI)
TNW 429: DarkSword Puts Hundreds of Millions at Risk - The Internet's Reaction to NVIDIA's DLSS 5

Tech News Weekly (Video HI)

Play Episode Listen Later Mar 19, 2026 68:24


Jennifer Pattison Tuohy of The Verge joins Mikah Sargent this week on Tech News Weekly! IKEA's smart home products are not quite there yet. NVIDIA unveiled DLSS 5 at GTC 2026 and faced backlash from within the gaming community. A powerful iPhone-hacking technique has been discovered to take over devices running iOS 18. And some updates on the probes into Tesla's and hands-free driving systems. Jennifer shares her experience using IKEA's newest smart home products utilizing the Matter-over-Thread protocol, expressing her concerns and frustrations with trying to connect them to any smart home platform. Mikah talks about Nvidia's DLSS 5 that was unveiled at GTC 2026, and the criticism & memes it has faced since its unveiling. Andy Greenberg of WIRED joins the show to talk about a tool that has been discovered, called DarkSword, that can hack into millions of iPhones running iOS 18. And Mikah chats about hands-free driving technology in vehicles and NHTSA's probe into Tesla and Ford's hands-free driving systems. Hosts: Mikah Sargent and Jennifer Pattison Tuohy Guest: Andy Greenberg Download or subscribe to Tech News Weekly at https://twit.tv/shows/tech-news-weekly. Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Sponsors: outsystems.com/twit hipebl.ai

All TWiT.tv Shows (MP3)
Tech News Weekly 429: DarkSword Puts Hundreds of Millions at Risk

All TWiT.tv Shows (MP3)

Play Episode Listen Later Mar 19, 2026 68:24 Transcription Available


Jennifer Pattison Tuohy of The Verge joins Mikah Sargent this week on Tech News Weekly! IKEA's smart home products are not quite there yet. NVIDIA unveiled DLSS 5 at GTC 2026 and faced backlash from within the gaming community. A powerful iPhone-hacking technique has been discovered to take over devices running iOS 18. And some updates on the probes into Tesla's and hands-free driving systems. Jennifer shares her experience using IKEA's newest smart home products utilizing the Matter-over-Thread protocol, expressing her concerns and frustrations with trying to connect them to any smart home platform. Mikah talks about Nvidia's DLSS 5 that was unveiled at GTC 2026, and the criticism & memes it has faced since its unveiling. Andy Greenberg of WIRED joins the show to talk about a tool that has been discovered, called DarkSword, that can hack into millions of iPhones running iOS 18. And Mikah chats about hands-free driving technology in vehicles and NHTSA's probe into Tesla and Ford's hands-free driving systems. Hosts: Mikah Sargent and Jennifer Pattison Tuohy Guest: Andy Greenberg Download or subscribe to Tech News Weekly at https://twit.tv/shows/tech-news-weekly. Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Sponsors: outsystems.com/twit hipebl.ai

M觀點 | 科技X商業X投資
EP287. 輝達 GTC 發表重點、OpenAI 砍掉支線、Agent 外殼大戰 | M觀點

M觀點 | 科技X商業X投資

Play Episode Listen Later Mar 19, 2026 86:29


《AI 百用百科|生產力系列》 專為職場工作者設計,學完直接用在工作上,成為真正的 AI 高手 不只是自己會用,更能為團隊建立一套「可持續運行」的工作系統。 — 課程特色 — ✓ 情境導向|直接對應工作場景,教你用 AI 解決真實工作任務 ✓ 手把手教學|完整示範操作流程,從思路到實際成果一步一步帶著你走 ✓ 下課即用|提供完整自動化流程模板,回到工作崗位就能馬上應用 生產力系列共 8 堂課,從工作任務出發,打造全方位的應用指南,收錄 8 大職場常見情境,讓你把時間從重複性工作中省下來,回到真正重要的事。 會議記錄統整|名單篩選寄發 大量資料整理|專業簡報製作 行銷週報生成|信件語意分流 自動排班規劃|合約風險辨識 ⏰ 預購優惠到 4/1 截止,現在入手最划算!

Tech News Weekly (Video LO)
TNW 429: DarkSword Puts Hundreds of Millions at Risk - The Internet's Reaction to NVIDIA's DLSS 5

Tech News Weekly (Video LO)

Play Episode Listen Later Mar 19, 2026 68:24


Jennifer Pattison Tuohy of The Verge joins Mikah Sargent this week on Tech News Weekly! IKEA's smart home products are not quite there yet. NVIDIA unveiled DLSS 5 at GTC 2026 and faced backlash from within the gaming community. A powerful iPhone-hacking technique has been discovered to take over devices running iOS 18. And some updates on the probes into Tesla's and hands-free driving systems. Jennifer shares her experience using IKEA's newest smart home products utilizing the Matter-over-Thread protocol, expressing her concerns and frustrations with trying to connect them to any smart home platform. Mikah talks about Nvidia's DLSS 5 that was unveiled at GTC 2026, and the criticism & memes it has faced since its unveiling. Andy Greenberg of WIRED joins the show to talk about a tool that has been discovered, called DarkSword, that can hack into millions of iPhones running iOS 18. And Mikah chats about hands-free driving technology in vehicles and NHTSA's probe into Tesla and Ford's hands-free driving systems. Hosts: Mikah Sargent and Jennifer Pattison Tuohy Guest: Andy Greenberg Download or subscribe to Tech News Weekly at https://twit.tv/shows/tech-news-weekly. Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Sponsors: outsystems.com/twit hipebl.ai

Wall Street Unplugged - What's Really Moving These Markets
What the hot PPI means for the market

Wall Street Unplugged - What's Really Moving These Markets

Play Episode Listen Later Mar 18, 2026 61:44


The PPI came in hot: What it means for interest rates… Why the market NEEDS a rate cut… Key takeaways from Nvidia's (NVDA) GTC conference… Is NVDA a buy at current levels? … And the latest massive tech layoffs. In this episode: Happy late St. Patrick's Day! (And my Irish car bomb story) [0:15] The PPI came in hot: What it means for interest rates [5:30] Why the market NEEDS a rate cut [7:57] Key takeaways from Nvidia's GTC conference [22:02] Is NVDA a buy at current levels? [24:47] Is AI to blame for the massive tech layoffs? [38:14] My pick for the March Madness champion [48:56] Did you like this episode? Get more Wall Street Unplugged FREE each week in your inbox. Sign up here: https://curzio.me/syn_wsu Find Wall Street Unplugged podcast… --Curzio Research App: https://curzio.me/syn_app --iTunes: https://curzio.me/syn_wsu_i --Stitcher: https://curzio.me/syn_wsu_s --Website: https://curzio.me/syn_wsu_cat Follow Frank… X: https://curzio.me/syn_twt Facebook: https://curzio.me/syn_fb LinkedIn: https://curzio.me/syn_li

Market Mondays
MM #301: The Next Market Crash? Nvidia's Future, Oil To $100? & Best Assets To Buy Now

Market Mondays

Play Episode Listen Later Mar 17, 2026 107:59


Time Stamps ⏰00:07—Investing Fact Of The Week19:00—Nvidia's GTC Outlook28:00—Time To Sell Nvidia?36:00—Is The Crash Coming?46:00—Top Assets For The Next 12 Months?52:00—Commodities54:00—Robinhood Stock Outlook59:00—Invest Fest Sneak Peek1:05:00—4 Stocks You Hate1:16:00—Target Boycott Fiasco1:30:00—Will Oil Go Above $100?1:38:00—FICO1:40:00—2026 Market CycleIn this episode of Market Mondays, we break down the biggest stories shaping the market right now. From Nvidia's GTC conference and the future of AI, to the possibility of a market crash, oil potentially hitting $100, and the assets investors should be watching over the next 12 months.We also discuss the outlook for Robinhood, the growing conversation around commodities, and four stocks we are not fans of right now. Plus, we address the Target boycott controversy, take a look at the upcoming 2026 market cycle, and give a sneak peek at what's coming for Invest Fest.If you want to stay ahead of the market and understand where the opportunities are in the current economic environment, this episode is packed with insights for investors and entrepreneurs alike.#MarketMondays #StockMarket #Investing #Nvidia #AIStocks #StockMarketNews #FinancialEducation #InvestingTips #Robinhood #Commodities #OilPrices #InvestFest #EarnYourLeisure #WealthBuildingSupport this podcast at — https://redcircle.com/marketmondays/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

Squawk on the Street
SOTS 2nd Hour: Cramer Interviews Nvidia CEO, Apollo's Chief Economist, & Iran's Fed Impact 3/17/26

Squawk on the Street

Play Episode Listen Later Mar 17, 2026 43:48


Carl Quintanilla, Sara Eisen, and David Faber kicked off the show with fresh data, tensions within the Fed's dual mandate ahead of tomorrow's rate decision, and more on Iran's market impact before a wide-ranging interview you don't want to miss - Nvidia CEO Jensen Huang, alongside Jim Cramer at the company's GTC conference out in California. Hear the man himself break down Nvidia's staggering $1T forecast, AI demand, and more.  Elsewhere in the hour: what to expect out of the Fed tomorrow - according to CNBC's exclusive Fed Survey results, and Apollo Global's Chief Economist Torsten Slok (who says there'll be no cuts tomorrow - or this year).  Squawk on the Street Disclaimer Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

AI For Humans
NVIDIA's Jensen Huang Wants It All (GTC 2026)

AI For Humans

Play Episode Listen Later Mar 17, 2026 22:23


Jensen Huang just stood on stage and said $1 trillion. He wasn't joking. NVIDIA's GTC 2026 keynote was a masterclass in flexing, and we're breaking down every layer of the cake. We walk through Jensen Huang's massive GTC 2026 keynote, from NVIDIA's $1 trillion business projection to the inference inflection point that's reshaping the entire AI industry. We dig into DLSS 5 and why AI-powered neural rendering is about to change gaming forever (sorry, gamers), NVIDIA's deep integration with OpenClaw and the launch of NemoClaw for enterprise agents, chips in space, and what it all means when every company becomes an agentic-as-a-service company. Plus the Dwarkesh podcast with Dylan Patel on the real bottlenecks in compute that nobody's talking about. JENSEN HUANG SAID ONE TRILLION DOLLARS AND DIDN'T BLINK. WE BLINKED. PS, we're now coming to you TWICE a week (both a little shorter). Come to our Discord: https://discord.gg/muD2TYgC8f Join our Patreon: https://www.patreon.com/AIForHumansShow AI For Humans Newsletter: https://aiforhumans.beehiiv.com/ Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow To book us for speaking, please visit our website: https://www.aiforhumans.show/   // Show Links // NVIDIA GTC 2026 Full Keynote with Jensen Huang https://www.youtube.com/live/jw_o0xr8MWU?si=VZAIG3E7vuUCwz6N DLSS 5: Breakthrough in Visual Fidelity for Games https://www.nvidia.com/en-us/geforce/news/dlss5-breakthrough-in-visual-fidelityfor-games/ DLSS 5 Official Trailer https://youtu.be/dJACkKbN-Eo?si=fIJvsV52---bOyTr Digital Foundry Deep Dive on DLSS 5 https://youtu.be/4ZlwTtgbgVA?si=g8TMgNlOWknKnqHo Good Til' Cancelled: The GTC Game https://x.com/SAlexashenko/status/2033585849586331985?s=20 Dwarkesh Podcast: Dylan Patel on Compute Bottlenecks and Chips https://youtu.be/mDG_Hx3BSUE?si=YnLEIVhsaCpdVQgi  

CNBC's
Nvidia's GTC Conference Underway… And When To Buy The Dip 3/16/26

CNBC's "Fast Money"

Play Episode Listen Later Mar 16, 2026 43:26


All eyes on Nvidia, as the chip giant kicks off its GTC conference. The next-gen GPU's investors are waiting to hear about, and what it means for the stock's next move. Plus Stocks rallying to start the week, as oil pulls back on the latest developments out of Iran. What a top market strategist needs to see before buying the dip, and the areas of the market where he'd put new money to work. Fast Money Disclaimer Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

The AI Breakdown: Daily Artificial Intelligence News and Discussions

The AI discourse is absolutely frenetic right now — everything from Karpathy's misinterpreted jobs visualization to a viral dog cancer cure story that's both less and more than it seems. NLW's argument: we're in AI's Second Moment, the agentic equivalent of the original ChatGPT shock, but with bigger capabilities, billions more people in the conversation, higher economic stakes, and an industry that's had three years to get worse at explaining itself. In the headlines: a preview of NVIDIA's GTC, SEC filings quietly listing AI agents as a material risk, and ByteDance shelving its video model over copyright disputes.Learn more about AGENT MADNESS: Our 64-Bracket tournament to find the coolest Agent of 2026 ⁠⁠⁠⁠https://www.agentmadness.ai/⁠⁠⁠⁠Brought to you by:KPMG – Agentic AI is powering a potential $3 trillion productivity shift, and KPMG's new paper, Agentic AI Untangled, gives leaders a clear framework to decide whether to build, buy, or borrow—download it at ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.kpmg.us/Navigate⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Mercury - Modern banking for business and now personal accounts. Learn more at ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://mercury.com/personal-banking⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠AIUC-1 - Get your agents certified to communicate trust to enterprise buyers - ⁠⁠⁠⁠⁠https://www.aiuc-1.com/⁠⁠⁠⁠⁠Blitzy - Want to accelerate enterprise software development velocity by 5x? ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://blitzy.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠AssemblyAI - The best way to build Voice AI apps - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.assemblyai.com/brief⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Robots & Pencils - Cloud-native AI solutions that power results ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://robotsandpencils.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠The Agent Readiness Audit from Superintelligent - Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://besuper.ai/ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://pod.link/1680633614⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Our Newsletter is BACK: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://aidailybrief.beehiiv.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Interested in sponsoring the show? sponsors@aidailybrief.ai

Halftime Report
Navigating the Energy Market Turmoil 3/16/26

Halftime Report

Play Episode Listen Later Mar 16, 2026 45:32


David Faber and the Investment Committee debate how to trade oil and the market as turmoil in the energy sector grows. CNBC's Brian Sullivan joins us with the latest comments from Treasury Secretary Scott Bessent. Plus, CNBC's Kristina Partsinevelos joins us to discuss the latest news out of San Jose, California, where Nvidia is set to kick off its annual GTC event. The Committee debate how to trade the company ahead of the Jensen Huang's keynote speech. And later, the desk debate retail investors abandoning private credit and what it means for the sector.  Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Rob Black and Your Money - Radio
Market Bounces To Start The Week

Rob Black and Your Money - Radio

Play Episode Listen Later Mar 16, 2026 20:17


Stocks rose while oil prices pulled back, Meta reports that it is planning to lay off more than 20 percent of its workforce and Nvidia shares rose ahead of its GTC conference, Next Rob Black event is Pints and Portfolios in Sunnyvale on Saturday April 18th 11:30am to 1:30pm sign up for exact location

CNBC's
Federal Judge Blocks Subpoena In Fed Reserve Probe… And Counting Down to Nvidia's GTC 3/13/26

CNBC's "Fast Money"

Play Episode Listen Later Mar 13, 2026 43:35


A fresh wave of semi catalysts lining up for next — from Nvidia's GTC event to Micron earnings and an AWS–Cerebras tie-up. What Fast Money Friend Gene Munster is watching, and what to expect from Nvidia's CEO Jensen Huang when he takes the stage. Plus Jefferies' David Zervos joins us with a simple message for investors: “Don't panic,” as traders weigh inflation risks, Meta's reported AI delay, and Boeing's push to fix wiring issues. Fast Money Disclaimer Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.