Podcasts about Codex

  • 1,560PODCASTS
  • 5,054EPISODES
  • 1h 12mAVG DURATION
  • 2DAILY NEW EPISODES
  • Jun 17, 2026LATEST
Codex

POPULARITY

20192020202120222023202420252026

Categories



Best podcasts about Codex

Show all podcasts related to codex

Latest podcast episodes about Codex

Adeptus Ridiculous
URIEL VENTRIS: Least Depressed Named Ultramarine | Warhammer 40k Lore

Adeptus Ridiculous

Play Episode Listen Later Jun 17, 2026 78:46


https://www.patreon.com/AdeptusRidiculoushttps://www.adeptusridiculous.com/https://twitter.com/AdRidiculoushttps://shop.orchideight.com/collections/adeptus-ridiculousUriel Ventris is the young captain of the 4th Company of the Ultramarines Chapter of Space Marines.Captain Uriel Ventris was born in the subterranean cities of the Imperial Civilised World of Calth in the Realm of Ultramar and chose to become an Ultramarines aspirant when he came of age to participate in the Chapter's trials. He succeeded in his quest and became an Ultramarines neophyte and then earned his way into the ranks of the Ultramarines' officer corps through his bravery and devotion to the ideals of the Ultramarines primarch, Roboute Guilliman.However, some of his battle-brothers, like Sergeant Learchus, questioned Uriel's commitment to the Codex Astartes because his friend, mentor, and predecessor Captain Idaeus, though a hero of the Chapter, was known to break the Codex 's teachings regularly.00:00 Lengthy Intro, Book club, Merch11:00 URIEL VENTRIS LoreSupport the show

Marketing Against The Grain
Automate Boring Tasks With Codex & Claude Code in X Minutes

Marketing Against The Grain

Play Episode Listen Later Jun 17, 2026 27:31


Workflow for building skills with Claude Code & Codex: https://clickhubspot.com/kcta Ep. 431 Should you just use Claude Code and Codex for your main workflows? Kipp, Kieran, and guest Peter Yang (led products and teams at Roblox, Reddit, Amazon (Twitch), and Meta) dive into how marketers can transform their productivity with AI-driven systems, building reusable automations, and evaluating AI output for real impact. Learn more on identifying and documenting your workflows, building and refining AI “skills,” and harnessing powerful evaluation methods (evals) to ensure your automations actually deliver results. Mentions Peter Yang https://www.youtube.com/@peteryangyt Codex https://openai.com/codex/ Claude Code https://claude.com/product/claude-code Get our guide to build your own Custom GPT: https://clickhubspot.com/customgpt Resource [Free] Steal our favorite AI Prompts featured on the show! Grab them here: https://clickhubspot.com/aip We're on Social Media! Follow us for everyday marketing wisdom straight to your feed YouTube: ​​https://www.youtube.com/channel/UCGtXqPiNV8YC0GMUzY-EUFg  Twitter: https://twitter.com/matgpod  TikTok: https://www.tiktok.com/@matgpod  Thank you for tuning into Marketing Against The Grain! Don't forget to hit subscribe and follow us on Apple Podcasts (so you never miss an episode)! https://podcasts.apple.com/us/podcast/marketing-against-the-grain/id1616700934   If you love this show, please leave us a 5-Star Review https://link.chtbl.com/h9_sjBKH and share your favorite episodes with friends. We really appreciate your support. Host Links: Kipp Bodnar, https://twitter.com/kippbodnar   Kieran Flanagan, https://twitter.com/searchbrat  ‘Marketing Against The Grain' is a HubSpot Original Podcast // Brought to you by Hubspot Media // Produced by Darren Clarke.

History of North America
CODEX 8.4 The American Crisis by Thomas Paine

History of North America

Play Episode Listen Later Jun 16, 2026 10:23


A series of 16 influential political pamphlets published between 1776 and 1783 during the American Revolutionary War (1775-83) titled The American Crisis, or simply The Crisis, by eighteenth-century Enlightenment philosopher and author Thomas Paine — an Englishman living in the colonies who signed his essays anonymously as "Common Sense," the title of his earlier influential work. Each essay, bolstered the morale of the American colonists to fight hard for their independence, appealed to the English to support the colonist's cause, clarified the issues at stake, and denounced any type of negotiated peace. The essays were gathered into one volume in 1882, showcasing the iconic opening line: "These are the times that try men's souls. The summer soldier and the sunshine patriot will, in this crisis, shrink from the service of their country; but he that stands it now, deserves the love and thanks of man and woman." The American Crisis by Thomas Paine at https://amzn.to/4dKKClU Common Sense by Thomas Paine (book) available at https://amzn.to/3MKX77b Writings of Thomas Paine available at https://amzn.to/3MCaFC2 Books about Thomas Paine available at https://amzn.to/4s3qxOg ENJOY Ad-Free content, Bonus episodes, and Extra materials when joining our growing community on https://patreon.com/markvinet SUPPORT this channel by purchasing any product on Amazon using this FREE entry LINK https://amzn.to/3POlrUD (Amazon gives us credit at NO extra charge to you). Mark Vinet's HISTORICAL JESUS podcast at https://parthenonpodcast.com/historical-jesus Mark's TIMELINE video channel: https://youtube.com/c/TIMELINE_MarkVinet Website: https://markvinet.com/podcast Facebook: https://www.facebook.com/mark.vinet.9 X (twitter): https://twitter.com/MarkVinet_HNA Instagram: https://www.instagram.com/denarynovels Mark's books: https://amzn.to/3k8qrGM Audio credits: The American Crisis by Thomas Paine (a LibriVox production read by volunteers and coordinated by Michele Fry, 2014). See omnystudio.com/listener for privacy information.

History of North America
Codex 1.13 Ben Franklin's Autobiography

History of North America

Play Episode Listen Later Jun 16, 2026 15:02


The Autobiography of Benjamin Franklin (1706-1790) written in the form of an extended letter to his son, William Franklin (1730-1813). Ben kept good records of his life and travels, and although he was never President, he still played a crucial part in American history. Enjoy this ENCORE Presentation! The Autobiography of Benjamin Franklin at https://amzn.to/43cp6CV Benjamin Franklin Books available at https://amzn.to/41fUkGD ENJOY Ad-Free content, Bonus episodes, and Extra materials when joining our growing community on https://patreon.com/markvinet SUPPORT this channel by purchasing any product on Amazon using this FREE entry LINK https://amzn.to/3POlrUD (Amazon gives us credit at NO extra charge to you). Mark Vinet's HISTORICAL JESUS podcast at https://parthenonpodcast.com/historical-jesus Mark's TIMELINE video channel: https://youtube.com/c/TIMELINE_MarkVinet Website: https://markvinet.com/podcast Facebook: https://www.facebook.com/mark.vinet.9 X (Twitter): https://twitter.com/MarkVinet_HNA Instagram: https://www.instagram.com/denarynovels Mark's books: https://amzn.to/3k8qrGM Audio credits: The Autobiography of Benjamin Franklin (Librivox, read by T. Hersant). See omnystudio.com/listener for privacy information.

Codex History of Video Games with Mike Coletta and Tyler Ostby - Podaholics
Episode 365.5 - Codex Remastered: Episode 52 - PSP Games

Codex History of Video Games with Mike Coletta and Tyler Ostby - Podaholics

Play Episode Listen Later Jun 15, 2026 62:25


Mike and Tyler had some life stuff come up this week, so enjoy this old episode on notable PSP games! If we missed your favorite game, email us at codexhistorypodcast@gmail.com or go to codexpodcast.net. The theme music is by RoccoW. The logo was created by Dani Dodge.

Supra Insider
#114: Why I quit my high-paying PM job to go all in as a solopreneur builder | Peter Yang (ex-Roblox, Reddit, Twitter)

Supra Insider

Play Episode Listen Later Jun 15, 2026 74:56


What does it take to walk away from a decade in product, and a job most people would envy, to bet on yourself?In this episode of Supra Insider, Marc Baselga and Ben Erez sit down with Peter Yang, who just left his product lead role at Roblox to go full-time on his newsletter and podcast, Behind the Craft and build his own projects. Peter talks through the trade-offs of solopreneur life, why his calendar is suddenly empty, and how he uses an AI personal advisor with three principles to decide what to say no to.They explore his day-to-day AI builder stack, from running Codex as a daily driver to using Hermes for his recurring scheduled tasks, his working definition of slop and why he guards against it, and what he's actually measuring as success now that nobody is handing him a promotion.If you're a PM weighing whether to leave a stable job to build on your own, a creator trying to scale output without sliding into slop, or anyone wiring AI agents into their daily work, this episode is for you.All episodes of the podcast are also available on Spotify, Apple and YouTube.New to the pod? Subscribe below to get the next episode in your inbox

Podcasts – Weird Things
Using AI To Find Waste And Hidden Costs In Your Business

Podcasts – Weird Things

Play Episode Listen Later Jun 14, 2026


Andrew Mayne and Brian Brushwood dig into one of the most immediately useful applications of AI agents: hunting down waste, friction, and forgotten costs in everyday business operations. Brian explains how connecting ChatGPT to his finances helped him uncover orphaned subscriptions, duplicate services, and even a long-forgotten annual GPS dog collar charge, while Andrew describes using Codex to audit AWS charges, recurring billing in Gmail, Apple Card statements, and an overpriced web host for the podcast. Along the way they make the case that Codex is different from a normal chatbot because it can persist on tasks, work through files and folders, use connected accounts, operate websites without APIs, and function more like a capable intern than a search box. They also talk through the learning curve, privacy concerns, trust-building in stages, using AI to generate business experiments and revenue ideas, and why speed of adaptation matters more than trying to pause technological change. The recurring theme is simple: use AI to find the stupid in your systems, save real money, and free up time for more creative work. Picks: Andrew Mayne: Riley Brown’s YouTube quick-start tutorials on Codex Brian Brushwood: Just Evil Enough by Alistair Croll and Emily Ross

Podcasts – Weird Things
AI Filmmaking Tools, Robot Liability, and GLP-1 Ripple Effects

Podcasts – Weird Things

Play Episode Listen Later Jun 14, 2026


Andrew Mayne, Justin Robert Young, and Brian Brushwood explore how new AI video tools are changing filmmaking by making real footage more editable and steerable, letting creators keep human performances while using AI for sets, lighting, costumes, and polish. They compare that shift to earlier changes in digital editing and game engines, then turn to viral robot mishap clips to separate remote-controlled demos from true autonomy and to ask the bigger question of who carries legal and moral responsibility when future robots inevitably cause harm. From there they jump to a possible primordial black hole candidate as evidence related to dark matter, a promising one-time gene therapy approach for cholesterol, and the broader effects of GLP-1 drugs on appetite, addiction, gambling, alcohol use, and the business models built around those habits. They wrap by sharing how tools like Codex are already helping them build websites, automate repetitive tasks, migrate infrastructure, and dramatically cut costs, arguing that AI is most useful right now as a way to remove drudgery and free up more time for actual creative work. Picks: Brian Brushwood: Spider-Noir Justin Robert Young: The Hulk Hogan documentary on Netflix Justin Robert Young: Rocky Balboa

After Things Podcast
AI Filmmaking Tools, Robot Liability, and GLP-1 Ripple Effects

After Things Podcast

Play Episode Listen Later Jun 14, 2026


Andrew Mayne, Justin Robert Young, and Brian Brushwood explore how new AI video tools are changing filmmaking by making real footage more editable and steerable, letting creators keep human performances while using AI for sets, lighting, costumes, and polish. They compare that shift to earlier changes in digital editing and game engines, then turn to viral robot mishap clips to separate remote-controlled demos from true autonomy and to ask the bigger question of who carries legal and moral responsibility when future robots inevitably cause harm. From there they jump to a possible primordial black hole candidate as evidence related to dark matter, a promising one-time gene therapy approach for cholesterol, and the broader effects of GLP-1 drugs on appetite, addiction, gambling, alcohol use, and the business models built around those habits. They wrap by sharing how tools like Codex are already helping them build websites, automate repetitive tasks, migrate infrastructure, and dramatically cut costs, arguing that AI is most useful right now as a way to remove drudgery and free up more time for actual creative work. Picks: Brian Brushwood: Spider-Noir Justin Robert Young: The Hulk Hogan documentary on Netflix Justin Robert Young: Rocky Balboa

After Things Podcast
Using AI To Find Waste And Hidden Costs In Your Business

After Things Podcast

Play Episode Listen Later Jun 14, 2026


Andrew Mayne and Brian Brushwood dig into one of the most immediately useful applications of AI agents: hunting down waste, friction, and forgotten costs in everyday business operations. Brian explains how connecting ChatGPT to his finances helped him uncover orphaned subscriptions, duplicate services, and even a long-forgotten annual GPS dog collar charge, while Andrew describes using Codex to audit AWS charges, recurring billing in Gmail, Apple Card statements, and an overpriced web host for the podcast. Along the way they make the case that Codex is different from a normal chatbot because it can persist on tasks, work through files and folders, use connected accounts, operate websites without APIs, and function more like a capable intern than a search box. They also talk through the learning curve, privacy concerns, trust-building in stages, using AI to generate business experiments and revenue ideas, and why speed of adaptation matters more than trying to pause technological change. The recurring theme is simple: use AI to find the stupid in your systems, save real money, and free up time for more creative work. Picks: Andrew Mayne: Riley Brown’s YouTube quick-start tutorials on Codex Brian Brushwood: Just Evil Enough by Alistair Croll and Emily Ross

DevTalles
260-La guerra de los editores de código

DevTalles

Play Episode Listen Later Jun 14, 2026 52:00


La guerra por el editor de código con IA: Cursor, Codex, ClaudeCode, Antigravity y Copilot peleando por controlar cómo programas. Lo bueno, lo feo, y qué significa para tu día a día como dev.

Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 797: Claude's Mythos and Fable 5, Google's New Live AI, ChatGPT's New Powers and 7 Other AI Features You Can't Afford To Not Use

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Jun 12, 2026 36:31


If you spent too much time prompting Claude's Fable 5 before it likely goes away to subscribers in 10 days, you might have missed some AI gems.

Security Conversations
Mythos, Fable, and Anthropic's Big Trust Problem

Security Conversations

Play Episode Listen Later Jun 12, 2026 119:10


(Presented by TLPBLACK: A cybersecurity intelligence platform focused on sharing curated, high-sensitivity threat insights and research with trusted security professionals.) Three Buddy Problem - Episode 101: We discuss Anthropic's Mythos 5 and Claude Fable 5 release and the bombshell that the company was silently downgrading paid users' results, sparking a heated debate over guardrails, gatekeeping, and whether elite AI reasoning is becoming a privilege for the few. Plus, AI-generated N-day exploits killing the patch window, a record-shattering Patch Tuesday, Meta's latest court filing against spyware maker NSO Group, the return of cyber paleontology, and a detour into the new government UFO drops. Cast: Juan Andres Guerrero-Saade, Ryan Naraine and Costin Raiu. Timestamps: 0:00 - Introductory banter 3:22 - The Mythos 5 / Claude Fable 5 release 14:42 - Anthropic's silent downgrade trust problem 26:18 - Anti-competitive behavior & the AV "stealing detection" parallel 32:29 - Distillation, China & the real motive 38:04 - "Too dangerous to release" & gatekeeping vs. guardrailing 45:53 - Is Mythos a threat to malware-analysis startups? 48:20 - Dario's AI regulation essay 56:48 - N-day exploits and death of the patch window 1:07:18 - Patch Tuesday and 10x vulnerability surge 1:10:34 - Meta catches NSO Group 1:14:45 - Cyber paleontology, Shadow Brokers leaks 1:28:29 - Moonlight Maze and learning from history 1:34:22 - UFOs, UAPs and Disclosure Day

Night Attack Audio Feed
Great Night #258: Tech Support with Neo from The Matrix

Night Attack Audio Feed

Play Episode Listen Later Jun 11, 2026


Brian may be haunted, dusty, gassy, or simply cursed by three beeps. Gmail becomes a virus-spewing kaiju, Codex may or may not be powered by an enchanted stone, and Justin waits to see if Apple AI is finally good enough to make him eat his pants. Get an extra episode every week only at https://www.patreon.com/greatnight!

Night Attack Video Feed
Great Night #258: Tech Support with Neo from The Matrix

Night Attack Video Feed

Play Episode Listen Later Jun 11, 2026


Brian may be haunted, dusty, gassy, or simply cursed by three beeps. Gmail becomes a virus-spewing kaiju, Codex may or may not be powered by an enchanted stone, and Justin waits to see if Apple AI is finally good enough to make him eat his pants. Get an extra episode every week only at https://www.patreon.com/greatnight!

Vidas en red Spreaker
Codex supera a OpenClaw y mis reclamaciones al Corte Ingles

Vidas en red Spreaker

Play Episode Listen Later Jun 11, 2026 16:47 Transcription Available


Finalmente El Corte Inglés respondió ¡gracias IA!También os hablo de cómo Codex me está quitando trabajo. Agencia recaudatoria de la Isla (Proyecto MEGA ISLA):Paypal: juliommd@hotmail.comBizum: https://revolut.me/julioqdfConsigue tu SIM de Datos de Simyo y apoya a Vidas en red: simyo.es/amigos.html?codeMGM=056G0J69. Automáticamente, ¡CONSIGUES TU PREMIO! (válido hasta 07/07/2026) Telegram Isla difusión: https://t.me/+M46yiWO_BJU2NzkyCárcel Planetaria: https://www.youtube.com/@carcelplanetariaSuscríbete a mi podcast: https://www.spreaker.com/user/vidasenred

Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 795: Codex Sites: The Lovable and Replit Killer? A hands-on Guide to Codex Sites

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Jun 10, 2026 38:30


One of the biggest problems of vibe coding? Securely keeping the project up to date and sharing it with your team to make it actually useful. And there's a new solution that does just that, Codex Sites. With a few simple prompts, you can turn vibe coded throwaway apps into working pieces of software that your team can share. We put AI to work on Wednesday and show you how to get the most out of Codex Sites. Codex Sites: The Lovable and Replit Killer? A hands-on Guide to Codex Sites -- An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageToday's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Codex Sites vs Static File SharingLive Dashboards and Automated WorkflowsBuilding Internal Apps With Codex SitesReal-Time Data Integration in CodexAgent Layer and Role-Based Access ControlCodex Sites vs Replit, Lovable, BoltDynamic Business Insights and CollaborationCodex Sites Secure Team Sharing LimitationsAutomations and Custom Skills in CodexFuture of AI Native Business ToolsTimestamps:00:00 The future of work automation03:43 Free daily newsletter highlights08:29 Managing audience momentum dashboard12:04 Pulling stats and data access14:48 Creating dynamic web tools16:18 Editing video collaboration challenges21:09 Comparing coding platforms like Replit25:47 Future of Business Analytics Tools27:11 Introducing the Start Here series32:35 Updating old content ideas34:53 Streamlining team efficiency with AI37:02 Episode use cases overviewKeywords: Codex sites, OpenAI, AI dashboards, live software, file sharing, business automation, dynamic data, ChatGPT business, agentic system, Chrome integration, MCP servers, skills, plugins, Copilot Scout, internal dashboards, data analysis, role based access control, data governance, enterprise AI tools, site hosting, live app builder, prompt driven apps, automations, Replit alternative, Lovable competitor, full stack app builder, dynamic business context, annotation feature, nontechnical teams, BI dashboards, Kanban tracker, evergreen content, live indicators, audience momentum dashboard, sub agent, responsive design, visual design, parallax feature, actionable insights, version control, dynamic deliverables, artifact, demo over memo, knowledge work, IT security, internal URL sharing, AI native workflow, internal business tools, real time updates, start here seriesSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Vidas en red Spreaker
Usando la #IA para reclamar al Corte Inglés

Vidas en red Spreaker

Play Episode Listen Later Jun 10, 2026 27:56 Transcription Available


El pasado mes de Febrero acudí a informarme de ciertos seguros en el Corte Inglés.Me esperaba una pesadilla. Sin mi consentimiento activaron los seguros y procedieron a cobrarme por seguros que no firmé, no contraté, y con datos que no les autoricé.Desamparado, sin ayuda, invoqué a la IA, en concreto a Gemini y a #Codex. Y de momento ha funcionado, automaticé tareas que de otra manera me hubiera llevado mucho, mucho más tiempo. 

History of North America
Codex 1.12 Ben Franklin's Autobiography

History of North America

Play Episode Listen Later Jun 9, 2026 10:00


The Autobiography of Benjamin Franklin (1706-1790) written in the form of an extended letter to his son, William Franklin (1730-1813). Ben kept good records of his life and travels, and although he was never President, he still played a crucial part in American history. Enjoy this ENCORE Presentation! The Autobiography of Benjamin Franklin at https://amzn.to/43cp6CV Benjamin Franklin Books available at https://amzn.to/41fUkGD ENJOY Ad-Free content, Bonus episodes, and Extra materials when joining our growing community on https://patreon.com/markvinet SUPPORT this channel by purchasing any product on Amazon using this FREE entry LINK https://amzn.to/3POlrUD (Amazon gives us credit at NO extra charge to you). Mark Vinet's HISTORICAL JESUS podcast at https://parthenonpodcast.com/historical-jesus Mark's TIMELINE video channel: https://youtube.com/c/TIMELINE_MarkVinet Website: https://markvinet.com/podcast Facebook: https://www.facebook.com/mark.vinet.9 X (Twitter): https://twitter.com/MarkVinet_HNA Instagram: https://www.instagram.com/denarynovels Mark's books: https://amzn.to/3k8qrGM Audio credits: The Autobiography of Benjamin Franklin (Librivox, read by T. Hersant). See omnystudio.com/listener for privacy information.

History of North America
CODEX 8.3 The American Crisis by Thomas Paine

History of North America

Play Episode Listen Later Jun 9, 2026 10:06


A series of 16 influential political pamphlets published between 1776 and 1783 during the American Revolutionary War (1775-83) titled The American Crisis, or simply The Crisis, by eighteenth-century Enlightenment philosopher and author Thomas Paine — an Englishman living in the colonies who signed his essays anonymously as "Common Sense," the title of his earlier influential work. Each essay, bolstered the morale of the American colonists to fight hard for their independence, appealed to the English to support the colonist's cause, clarified the issues at stake, and denounced any type of negotiated peace. The essays were gathered into one volume in 1882, showcasing the iconic opening line: "These are the times that try men's souls. The summer soldier and the sunshine patriot will, in this crisis, shrink from the service of their country; but he that stands it now, deserves the love and thanks of man and woman." The American Crisis by Thomas Paine at https://amzn.to/4dKKClU Common Sense by Thomas Paine (book) available at https://amzn.to/3MKX77b Writings of Thomas Paine available at https://amzn.to/3MCaFC2 Books about Thomas Paine available at https://amzn.to/4s3qxOg ENJOY Ad-Free content, Bonus episodes, and Extra materials when joining our growing community on https://patreon.com/markvinet SUPPORT this channel by purchasing any product on Amazon using this FREE entry LINK https://amzn.to/3POlrUD (Amazon gives us credit at NO extra charge to you). Mark Vinet's HISTORICAL JESUS podcast at https://parthenonpodcast.com/historical-jesus Mark's TIMELINE video channel: https://youtube.com/c/TIMELINE_MarkVinet Website: https://markvinet.com/podcast Facebook: https://www.facebook.com/mark.vinet.9 X (twitter): https://twitter.com/MarkVinet_HNA Instagram: https://www.instagram.com/denarynovels Mark's books: https://amzn.to/3k8qrGM Audio credits: The American Crisis by Thomas Paine (a LibriVox production read by volunteers and coordinated by Michele Fry, 2014). See omnystudio.com/listener for privacy information.

CE Pro Podcast
CE Pro Podcast #174: Sonance CEO Ari Supran and How the Custom Integration Channel is Using AI

CE Pro Podcast

Play Episode Listen Later Jun 9, 2026 58:45 Transcription Available


AI isn't just about asking ChatGPT questions anymore. In this episode of the CE Pro Podcast, Sonance CEO Ari Supran shares how he went from experimenting with AI for simple tasks to helping integrators and business leaders use advanced AI tools to build software, automate workflows, and reclaim valuable time.Supran explains why many businesses are still stuck in the "chat" phase of AI adoption and how newer agentic tools like Claude Code, Claude Cowork, and Codex are enabling users to delegate work instead of simply generating answers. He discusses practical examples of AI organizing files, analyzing documents, building dashboards, creating custom business applications, and even acting as a digital coworker.The conversation also explores the growing role of AI agents, the importance of open APIs and software integrations, and why Supran believes custom integration firms have a unique opportunity to build purpose-driven tools tailored to their own operations. Along the way, he shares lessons Sonance has learned from deploying AI across the organization and offers advice for integrators looking to move beyond basic AI use cases.Whether you're just getting started with AI or already experimenting with automation and custom tools, this discussion provides a practical look at where AI is headed and how integration businesses can prepare for what's next.

Side Project Spotlight
#113: WWDC26 — No, Seriously, Siri Works Now

Side Project Spotlight

Play Episode Listen Later Jun 9, 2026 45:55


Recorded thirty minutes after the WWDC26 State of the Union keynote ended, The Trio delivers a hot-take reaction to everything Apple just announced. Steve makes his boldest claim yet: 2026 is the year the "Universal UI" era begins, anchored by a Siri demo that appeared to actually work in real time. Xcode quietly Sherlocked the Codex app, SwiftUI got reorderable containers and (finally) AsyncImage caching, and Aaron spotted some very suspicious folding phone tea leaves in the new Simulator replacement.## Chapters00:08 Introductions 01:31 Reviewing The Trio's "Universal UI" Concept 02:39 Comparison of "AI" Apps: Siri, Claude, Codex, ChatGPT 05:43 Multimodal Prompts & Private Cloud Compute 07:26 Foundation Model Device Requirements 09:50 Dynamic Profiles and Custom Model Configurations 13:41 Xcode 27 Sherlocked the Codex App 15:42 Xcode and Developer Tool Evolution 21:47 SwiftUI Updates: Reordering and AsyncImage Cache 26:14 A Grab Bag of Random Stuff 28:23 App Actions and Siri Integration 35:04 No Apple Claw? 39:09 Swift Compiler Unable to Type Check Error 40:37 Final Impressions 42:02 Folding Phone Tea Leaves 43:07 Snow Leopard Speed Improvements 43:53 Wrap Up & One More Thing... 45:50 Tag ## Show Notes- Steve declares 2026 the start of the "Universal UI era," with a live Siri demo that actually worked as his primary evidence.- Aaron clocked the demo as mostly staring at a loading spinner; Steve argues Apple had to prove the on-device inference wasn't faked this time.- The Foundation Models framework supports dynamic profiles: configurable system prompts, temperatures, and thinking budgets per scenario within a single app.- Xcode 27 ships an agentic coding UI seemingly inspired by the Codex app, prompting Kotaro to ask point-blank: "Are you saying they Sherlocked Codex?"- SwiftUI finally has a reorderable container, which The Trio immediately wants in Bento Fit after a previous attempt even an "AI" agent couldn't pull off.- AsyncImage gets a built-in cache after years of third-party workarounds; Steve suspects some intern with an unlimited Claude Code budget finally got it done.- App Actions now supports natural language invocation without requiring specific phrases or app name mentions, though exact limits remain fuzzy.- Aaron flags resizable iOS windows (previously iPad-only) and an arbitrary-aspect-ratio Simulator replacement as very suspicious folding phone tea leaves.- Kotaro closes on Snow Leopard-style speed wins across the board, including 80% faster AirDrop, because speed is still a feature worth shipping.## Links**One More Thing**Cleo Family: https://www.cleofamily.app/track**PhillyCocoa:** https://phillycocoa.orgIntro music: "When I Hit the Floor", © 2021 Lorne Behrman. Used with permission of the artist.

The Marketing AI Show
#218: Anthropic IPO, Trump AI Executive Order, Rising AI Costs & OpenAI Merges Codex Into ChatGPT

The Marketing AI Show

Play Episode Listen Later Jun 9, 2026 85:13


Anthropic filed for an IPO and published a paper warning that recursive self-improvement may arrive faster than anyone is ready for. Paul and Mike break down both, then cover Trump's AI executive order, government stakes in AI labs, and the corporate scramble to control AI token costs. Rapid fire: Apple WWDC previews, OpenAI's Codex-ChatGPT merger, Brockman's super PAC, AI rolling up the accounting industry, Stanford law professors losing to AI 75% of the time, and product updates from Google, Microsoft, Meta, and Anthropic. Show Notes: Access the show notes and show links here AI-Pulse Survey: Fill out this week's AI-Pulse Survey here. Timestamps: 00:00:00 — Intro 00:05:53 — Anthropic IPO & Talks Recursive Self-Improvement 00:25:52 — Trump's AI Executive Order & Government Stakes in AI Labs 00:37:34 — The Soaring Cost of Intelligence, Part 2 00:57:34 — Apple WWDC 01:01:36 — OpenAI Is Merging Codex and ChatGPT 01:06:19 — OpenAI Distances Itself from Brockman's Super PAC 01:08:55 — AI Roll-Up Targets the Accounting Industry 01:12:23 — AI in Higher Education 01:16:29 — AI Use Case Spotlight 01:20:29 — AI Product and Funding Updates This episode is brought to you by AI Academy by SmarterX. AI Academy is your gateway to personalized AI learning for professionals and teams. Discover our new on-demand courses, live classes, certifications, and a smarter way to master AI. Learn more here. Visit our website Receive our weekly newsletter Join our community: Slack Community LinkedIn Twitter Instagram Facebook YouTube Looking for content and resources? Register for a free webinar Come to our next Marketing AI Conference Enroll in our AI Academy 

Mac Geek Gab (Enhanced AAC)
Mic-graines and Infotainment!

Mac Geek Gab (Enhanced AAC)

Play Episode Listen Later Jun 8, 2026 82:06 Transcription Available


Your iPhone might be running hot and draining fast — and it’s not just you. Dave and Pilot Pete break down the battery chaos introduced by iOS 26.5, which brought overheating, accelerated drain, and even blocked wired charging on iPhone 17 and Air models. The fix that’s working for most people: disable iCloud Keychain first, run Reset All Settings, then carefully re-enable iCloud sync — otherwise you’ll nuke your Wi-Fi passwords across every device. iOS 26.5.1 is out and should help, but until you’ve updated, your electrons deserve better. You’ll also learn why Apple ID passkeys are locked to Apple’s own keychain with no known path to third-party managers like 1Password or Keeper, and why editing a contact on a modern Mac can somehow peg every CPU core — in 2026, no less. From there, Dave and Pete tackle the full listener mailbag: how to rescue missing contact names from Messages, the right way to boot a MacBook with a broken display into clamshell mode so it actually uses the external monitor, and a deep dive on 5K vs. 4K displays where Dave argues your eyes may not care as much as the pixel-per-inch math suggests. You’ll get smart ideas for repurposing a 2015 iPad Pro that can’t run modern apps — including Dave’s Claude Code-built weather dashboard running off a headless iMac as a web interface. A crashing 2021 MacBook Pro turns out to have been felled by a single bad SD card, and the lesson is golden: feed your crash reports to an LLM and let it do the digging. And Don’t Get Caught with outdated OpenAI macOS apps — update ChatGPT, Codex, Atlas, and Codex CLI before June 12th to stay ahead of a code-signing rotation triggered by a compromised open-source library. 00:00:00 Mac Geek Gab 1145 for Monday, June 8th, 2026 June 8th: National Best Friends Day MGG Monthly Giveaway – Win a license to SaneBox Quick Tips 00:00:01 Dan-QT-Multi-select on iPhone with a quick drag 00:04:31 Tim-QT-Have iOS 26.5 Battery Drain? Reset All Settings, but be careful! 00:13:32 Kent-QT-1144-Collapse stacks by clicking the down-facing carat in the menu 00:14:15 Mark-QT-Match Frame Rate on your Apple TV for smoother experiences 00:17:58 What are the differences between refresh rates and frame rates and…why? 00:21:09 KiwiGraham-QT-Apple Account Passkeys vs. Third Party Password Apps Sponsors 00:23:09 SPONSOR: Keeper. Right now, Keeper is offering our listeners 60% off personal and family plans at https://Keepersecurity.com/MGG. This offer is only for podcast listeners! 00:24:50 SPONSOR: Helix Sleep makes premium mattresses and bedding that are customized to fit your personal needs, and conveniently shipped to your door. Go to https://helixsleep.com/MGG for 20% Off Sitewide. 00:26:23 SPONSOR: NordLayer Browser. The business browser built for how modern work actually happens — giving IT the visibility and control to secure SaaS, stop phishing, and prevent data leaks right at the source. Your Questions Answered and Tips Shared! 00:28:09 VaShaun-How can I restore lost Contacts on my Mac? 00:37:36 Si-What to do with an 11-year-old iPad? Claude Code 00:46:40 Michael-Why do we have to pull-to-refresh for updates? 00:50:04 Blake-1144-Damaged displays, external monitors, and MonitorControl 00:55:48 Joe & Michael-CSF-1144–RetinaDesk.com for reviews of 5K and 6K monitors BenQ MA270UP 27” 4K Display Reviews 01:02:50 Hog fan and Cowboy fan-MGG Review–Favorite Tech podcast Don't Get Caught 01:04:14 Father John-DGC-Investigate those crash reports before you replace your Mac 01:09:26 Update your ChatGPT Apps ChatGPT Desktop Codex App Codex CLI Atlas 01:11:06 Andy-DGC-When Troubleshooting, Don’t Get Caught asking the wrong questions or assuming the wrong facts 01:19:36 MGG 1145 Outtro MGG Monthly Giveaway Bandwidth Provided by CacheFly Pilot Pete's Aviation Podcast: So There I Was (for Aviation Enthusiasts) The Debut Film Podcast – Adam's new podcast! Dave's Business Brain (for Entrepreneurs) and Gig Gab (for Working Musicians) Podcasts MGG Merch is Available! Mac Geek Gab iOS app Mac Geek Gab YouTube Page Mac Geek Gab Live Calendar This Week's MGG Premium Contributors MGG Apple Podcasts Reviews feedback@macgeekgab.com 224-888-GEEK Active MGG Sponsors and Coupon Codes List BackBeat Media Podcast Network

Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 793: Apple's WWDC AI plans, U.S. Gov wants equity in Big Tech, OpenAI's business moves and more

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Jun 8, 2026 41:02


Cult of Conspiracy
#1090- Conversations With Codex The Wise

Cult of Conspiracy

Play Episode Listen Later Jun 8, 2026 156:58 Transcription Available


To Sign up for our Patreon go to-> Patreon.com/cultofconspiracypodcastTo Find The Cajun Knight Youtube Channel---> click hereTo find the Meta Mysteries Podcast---> https://open.spotify.com/show/6IshwF6qc2iuqz3WTPz9Wv?si=3a32c8f730b34e79https://flavorsforest.com/cult/Become a supporter of this podcast: https://www.spreaker.com/podcast/cult-of-conspiracy--5700337/support.

AWS Morning Brief
OpenAI on Bedrock and Other Strange Bedfellows

AWS Morning Brief

Play Episode Listen Later Jun 8, 2026 7:25


AWS Morning Brief for the week of June 8th, with Corey Quinn. Links:AWS Interconnect - multicloud now offers a free 500 Mbps tierOracle Database@AWS is now available in twenty AWS RegionsAmazon Cognito now supports multi-Region replicationAmazon EKS and Amazon EKS Distro now supports Kubernetes version 1.36Amazon SES now supports tenant-level suppression listsAWS Compute Optimizer now supports 32-day lookback for EBS volume and ECS service rightsizing recommendationsAWS Cost and Usage Report 2.0 now supports Athena and Redshift integrationAmazon ElastiCache for Valkey now supports durabilityUnderstanding how backups work in Amazon AuroraOpenAI models and Codex on Amazon Bedrock are now generally availableHow Bedrock Streaming optimizes its AWS costsFrom Monolith to Multi-Account: Pinterest's AWS Organization Transformation JourneyGain visibility into DDoS attacks with flow logs in AWS Shield AdvancedIdentify unused AWS KMS keys and prevent accidental key deletionsCVE-2026-10591 - Kiro IDE Insufficient File Write Restrictions to Execution-Sensitive PathsCVE-2026-10584 - HTTPS Fallback to HTTP in Graph Explorer

The AI Breakdown: Daily Artificial Intelligence News and Discussions
10+ Things You Should Build With AI Instead of Sending Files

The AI Breakdown: Daily Artificial Intelligence News and Discussions

Play Episode Listen Later Jun 7, 2026 22:18


AI is making it possible to build richer versions of the files knowledge workers send every day: decks, memos, spreadsheets, reports, proposals, training materials, and more. This has gotten even easier this week with the release of OpenAI's "Sites" feature in Codex. In this practical Operator's episode, NLW walks through 10+ examples of work outputs that are often better as living, shareable, updateable, interactive links than static documents.Sign up for AI Executive Catchup: ⁠⁠⁠https://aiexecutivecatchup.com/⁠⁠⁠Brought to you by:KPMG – Research from KPMG and the University of Texas at Austin shows the highest-impact AI users treat AI like a reasoning partner — and those skills can be taught at scale. Learn more at ⁠⁠⁠⁠⁠⁠⁠⁠⁠kpmg.com/us/Sophisticated⁠⁠⁠⁠⁠⁠⁠⁠⁠Bolt - Claim a free month of Bolt Pro - ⁠⁠https://bolt.new/partner/aidb/⁠⁠Outsystems - Stop wondering how AI will change your business and start building the agents that will lead it - http://outsystems.com/Scrunch - The AI customer experience platform - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://scrunch.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Zenflow Work - Agents for knowledge work - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://zenflow.free/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Blitzy - Want to accelerate enterprise software development velocity by 5x? ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://blitzy.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠AssemblyAI - The best way to build Voice AI apps - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.assemblyai.com/brief⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Robots & Pencils - Cloud-native AI solutions that power results ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://robotsandpencils.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://pod.link/1680633614⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Our Newsletter is BACK: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://aidailybrief.beehiiv.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Interested in sponsoring the show? sponsors@aidailybrief.ai

Weirdly Magical with Jen and Lou - Astrology - Numerology - Weird Magic - Akashic Records
Weekly Astrology June 7-June 13 2026 | BE THE REBEL. DANCE LIKE YOUR HIPS MOVE MOUNTAINS

Weirdly Magical with Jen and Lou - Astrology - Numerology - Weird Magic - Akashic Records

Play Episode Listen Later Jun 7, 2026 40:33


The International Business Times article:- https://www.ibtimes.com/louise-edington-reclaiming-language-stars-through-intuition-embodiment-matrifocal-wisdom-3803596The Wellness Journal article:- https://wellnessvoice.com/louise-edington-on-reclaiming-the-language-of-the-stars-through-intuition-embodiment-and-matrifocal-wisdom/Louise Edington discusses the astrological forecast for June 7-13, highlighting key planetary movements and their implications. Venus returns in-bounds on June 7, while Mercury remains out-of-bounds until June 14. The moon transits through Pisces, Aries, and Taurus, influencing emotional and strategic decisions. Key aspects include Venus conjuncting Jupiter at 25 Cancer, signifying new beginnings. Louise emphasizes the importance of intuition, interconnectedness, and the need for a matrifocal approach in astrology. She also mentions her recent feature in online publications and her mission to change astrological language and practice.

Let's Talk AI
#247 - Opus 4.8, MAI, Anthropic IPO, Minimax-M3

Let's Talk AI

Play Episode Listen Later Jun 6, 2026 105:02


Our 247th episode with a summary and discussion of last week's big AI news!Recorded on 06/03/2026Hosted by Andrey Kurenkov and Jeremie HarrisFeel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:Anthropic released Claude Opus 4.8 with improved benchmark scores, discussed eval-awareness findings and welfare/corrigibility themes from its system card, and introduced Dynamic Workflows for long-running multi-agent tasks.Microsoft unveiled the always-on Microsoft Scout assistant built on OpenClaw plus new in-house MAI models (including MAI Thinking 1) and “frontier tuning,” emphasizing enterprise security architecture and model-from-scratch capability.Major business moves included Anthropic's $65B Series H at a $965B valuation alongside an IPO filing, a JPMorgan analysis arguing OpenAI needs major revenue growth to justify infrastructure spend, and Cognition raising $1B at a $25B valuation.Policy and security highlights covered Trump's voluntary pre-release government testing framework for powerful AI, Meta AI support being exploited to hijack Instagram accounts, tightened US Nvidia export controls and China's travel approvals for AI experts, plus expanded Glasswing/Mythos-style cyber and biodefense initiatives.Timestamps:(00:00:10) Intro / Banter(00:04:10) Sponsors(00:07:10) News PreviewTools & Apps(00:07:54) Anthropic releases Opus 4.8 with new 'dynamic workflow' tool | TechCrunch(00:22:37) Microsoft Scout is a new AI personal assistant built on OpenClaw | The Verge(00:26:55) Microsoft launches new MAI family of AI models at Microsoft Build | Mashable(00:37:43) Robinhood now lets your AI agents trade stocks | TechCrunch(00:40:49) OpenAI launches new Codex tools for white-collar work | TechCrunch(00:43:40) ElevenLabs' new music-generation model can switch genres mid-track | TechCrunchApplications & Business(00:44:35) Anthropic Hits $965 Billion Valuation, Surpassing OpenAI - WSJ(00:45:32) Anthropic Files to Go Public, Setting Stage for Huge I.P.O. - The New York Times(00:51:15) China's ByteDance Developing New AI Chips Like Those from Nvidia Partner Groq(00:55:00) Anthropic expands Mythos to 150 additional organizations(00:55:35) OpenAI needs a 26x revenue increase to justify its buildout(00:58:46) AI coding startup Cognition raises $1B at $25B pre-money valuation | TechCrunchProjects & Open Source(01:00:50) MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark performance for just 5-10% of the cost | VentureBeatPolicy & Safety(01:06:08) Trump Signs Executive Order Seeking Oversight of A.I. Models - The New York Times(01:11:45) Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked(01:13:058) Chinese AI experts in private firms now required to secure approval before international travel — Beijing enforces policy to secure top-tier talent, expands measures beyond government(01:17:53) U.S. Tightens Controls on Nvidia AI Chip Exports | Let's Data Science(01:21:47) OpenAI launches Rosalind Biodefense, offers federal agencies early access to its life-sciences model(01:24:00) Using LLMs to secure source code(01:26:19) Project Glasswing: An initial update(01:29:30) White House Approves $9 Billion for Spy Agencies to Catch Up on A.I.(01:32:11) US Law Enforcement Warns of ‘Anti-Tech Extremism' as AI Hatred GrowsSynthetic Media & Art(01:35:38) YouTube will now automatically label AI videos | TechCrunchResearch & Advancements(01:36:22) Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention(01:41:26) From Simulation to Enaction: Post-trained language models recognize and react to their own generationsSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Breakaway
SpaceX, DataCenters, AI, Markets

Breakaway

Play Episode Listen Later Jun 6, 2026 69:32


OpenGolf tourney tomorrowChoking. Heimlich maneuverUS Bank Fees$12.50 per $50. That is 25% instantlySo $1000, is 20 * $12.50 = $250. + interest.Reinstate the SATMore than 1,100 University of California math and science professors are urging UC regents to reinstate college-entrance exams, saying that unprepared students are lowering academic standards and draining teaching resources.Today, more than 90% of schools don't mandate the exams, Feder said.60 minutesWelcome to real life Scott Pelley. New boss, new style. Work or walk. Recommendations: Bill Ackman Sara Frier Finance folks should know Codex (previously Excel)PanthalassaMarkets: Huge correction today.  Tech down 5%+ and S&P500 2.6%. The losses intensified after a robust jobs report raised new worries that the Federal Reserve may need to raise interest rates later this year to fight inflation.S&P 500 still up 27% and tech 40-60% YoY. Huge IPOs coming: SpaceXAnthropic OpenAICash. Think about your cash investments. Cash is nice Owning your home is nice. AI & DatacentersGoogle to raise $85 billion Anthropic IPOIn May, Anthropic raised $65 billion in new funding from investors including Greenoaks, Dragoneer, Altimeter Capital and Sequoia Capital, in a round that valued the company at $965 billion. At the same time, the company said its revenue run-rate had surpassed $47 billion, up from $9 billion at the end of 2025LLM usageGrok: no bueno.  Grok and Spreadsheets.  Oh my.Gemini. Good. Claude: BEST. BTW, OpenAI was suspiciously very negative on SpaceX. SpaceX Going public ~June12. Next Friday!? $75b raise at $1.75T valuation.  Float is ~4-5% of total shares $10-18b must be purchased by index funds. More coming out in next 6 months. Employee lockups. Cap table investors want liquidity.Great detail here from Alexandra  IPO EducationHire IB's.  Allocate to VIPs and whales. 5% to retail.Valuation Over-valued? Valuation is highly relative to time!!!?? $135 price. $300 price? Either way 10-20x in 10 years.  Not investment advice.AI OpportunitySpaceX is becoming an AI infrastructure play!!Another Rental of Compute from Google to SpaceX.  Anthropic and Google are now paying @SpaceX a combined $2.17 billon per month for compute capacity. That's a revenue run rate of $26 billion per year. BIG MONEY.Jamie Dimon Interview of Elon.   Elon and Dimon  Another link here from Why SpaceX public now. Play at 4:00min mark: Why fundraising. Embarking on significant growth phase. 100,000 satellites. BTW. Why are datacenters hard if already doing satellites. 100x more bandwidth and ½ latency for v3. He just said that Starlink will be highest bandwidth and lowest latency or ANYTHING!! AI Datacenters in space. Massive capital endeavor. Hard to build power in the US or on land. US usage is 500GW.  To double. Would need to 2x # of power plants. BUT if in space can go far beyond EarthManufacturing on the moon and building beyond 1000TW per year of AI Space ComputeDataCenters in SpaceEasier than their communication satellites. AI datacenter is EASYElections: Why does it take so long to count votes? Could take weeks? 

Everyday AI Podcast – An AI and ChatGPT Podcast
Ep 792: Autonomous Copilot agents, new Codex tools, Github CoPilot app and 7 more AI updates you should be using

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Jun 5, 2026 36:45


✅ New autonomous agents. ✅ Canva designs made for you. ✅ Codex upgrades to make your business move. If you had your head down in spreadsheets this week, you missed some MAJOR AI upgrades that are available now. We track what's hot and what's not and break it all down on Fridays with our Friday Features. Autonomous Copilot agents, new Codex tools, Github CoPilot app and 7 more AI updates you should be using — An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageToday's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:OpenAI Codex Role-Specific Plugins LaunchMicrosoft Build Conference AI Feature ReleasesChatGPT Memory and Business Account UpgradesMicrosoft Flash Image Model for PowerPointCanva Integrated with ChatGPT and CodexGitHub Copilot Standalone Desktop App PreviewMicrosoft Autopilot Always-On Work AgentsOpenAI Models Now Available on AWS BedrockCodex Sites: AI-Built Internal Web AppsTimestamps:00:00 OpenAI's big money moves03:47 Explaining role-specific plugins09:02 Microsoft's new image model release11:09 Microsoft's AI strategy and Canva update14:23 Canva integration with ChatGPT16:56 GitHub Copilot's new canvas feature20:46 AI token subscription changes24:42 AWS adds OpenAI models to Bedrock28:25 Introducing OpenAI's CodeX Sites Feature32:07 Launch of OpenAI's New Plug-in34:16 Overview of podcast structureKeywords: Autonomous copilot agents, Codex tools, GitHub Copilot app, OpenAI Codex, ChatGPT business accounts, OpenAI enterprise, Microsoft Build conference, Microsoft always-on agents, AWS AI updates, Canva plugin, ChatGPT memory upgrade, Windows Codex integration, Microsoft Flash model, Enterprise apps integration, Role-specific plugins, Sales data analytics, Product design AI, Creative production AI, Investment banking plugin, Public equity investing, Data analytics plugin, Workspace admins, App permissions, Role-aware work agent, Financial research automation, Microsoft image generation model, PowerPoint AI integration, OneDrive AI features, Visual design creation, Canva app for ChatGPT, Canva MCP server, Agentic context carry, Full screen design preview, GitHub Copilot desktop app, GitHub Copilot Canvas, Agent-native command center, Parallel agent work tree, Code app interface, Model options in GitHub, Token usage limits, Subscription token subsidizing, Anthropic token efficiency, Amazon Bedrock, GPT-4, GPT-4.5, Small language models, Token reckoning, Security governance, Inference engine, Code app sidebar, Codex Sites, Internal dashboards, Project trackers, Interactive web apps, Shareable AI apps, Enterprise data connectors, ChatGPT Canvas, Automated workflow, Workplace authentication, Creative briefs repository.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Software Defined Talk
Episode 575: UI blizzard

Software Defined Talk

Play Episode Listen Later Jun 5, 2026 61:47


This week, we discuss NVIDIA going consumer, Microsoft Build, and the Anthropic/OpenAI IPO race. Plus, does credit card insurance work? Watch the YouTube Live Recording of Episode 575 Runner-up Titles Who Wins AI? Models vs. Middleware Jensen After Dark Once again, robots Why is this something you talk about in a keynote? Could this have been an app? Defeating Apple, the sword in the stone Your tokens are my margin Prisons, schools and military - what is the Venn diagram? Every enterprise is unhappy in their own way Rundown Nvidia NVIDIA and Microsoft Reinvent Windows PCs for the Age of Personal AI Nvidia's N1X Apple Silicon rival is two years behind Nvidia, Microsoft, and Arm are all teasing Nvidia's new N1X laptop processors Blackstone and Google launch $5B TPU cloud venture with 500MW of AI capacity AI server demand drives staggering revenue growth for Dell and its stock soars Microsoft Build Microsoft Build 2026: Be yourself at work Microsoft Build Live Blog Microsoft admits its "infuriating" floating AI button was a mistake OpenAI and Anthropic Go Public Anthropic Files to Go Public, Setting Stage for Huge I.P.O. OpenAI Prepares to File to Go Public in Coming Weeks How Anthropic Got So Big, So Fast Anthropic and SpaceX compute OpenAI launches new Codex tools for white-collar work OpenAI Hires ServiceNow CMO Colin Fleming to Lead Business Marketing Push Wiz + Anthropic: Claude Enterprise Meets the Security Graph Relevant to your Interests Grafana breach caused by missed token rotation after TanStack attack GitHub Got Hacked. The AI Security Arms Race is Here Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked SpaceX not the behemoth everyone thought Spotify adds AI-powered Q&A and briefing generation features to podcasts Amazon Web Services - Four Years and Out + AWS Fired the One Employee Who Gave a Damn Introducing UniFi 5G Backup AI Generated Summaries WSJ: How I Choose Which Cloudflare Employees to Replace with AI Microsoft open-sources the earliest DOS source code discovered to date Audio-generation app Huxe, founded by former NotebookLM developers, shuts down Bill Gates Spent Years Crafting His Image. Now It's Cracking. How do AI Layoffs Work? Some Speculation. Snowflake to Acquire Natoma to Bring Governed Agentic Access to the Enterprise U.S. companies have an AI problem. Indian IT wants to be the solution Meta to start testing AI subscription services, cheapest plan at $7.99/month Sponsors Sentry - Quit Buggin': use code sdt26 for $100 in credit for new customers Nonsense What Is a Dickover? Listener Feedback Henning corrects Coté's pronunciation of León. Conferences VMware User Group, Dallas, June 9-11, 2026 WeAreDevelopers Europe, July 8-10, 2026 Berlin, Coté speaking. DevOpsDays Graz, Sept 4-5, 2026 DevOpsDays Rockies, Sept. 22 – 23, 2026, Discount Code: 26DODSWEDEFTALK WeAreDevelopers NA, Sept 23-25, 2026, Discount Code: DEVPOD26 25 Free Tickets DevOpsDays Dallas, Sept 28-29, 2026 DevOpsDays Vilnius, Sep 30 - Oct 1, 2006 DevOpsDays Istanbul, Oct 24th, 2026, Coté keynoting. VMware User Group, Orlando, Oct 20-22, 2026 SDT News & Community Join our Slack community Email the show: questions@softwaredefinedtalk.com Free stickers: Email your address to stickers@softwaredefinedtalk.com Follow us on social media: Twitter, Threads, Mastodon, LinkedIn, BlueSky Watch us on: Twitch, YouTube, Instagram, TikTok Book offer: Use code SDT for $20 off "Digital WTF" by Coté Sponsor the show Sponsor more podcasts with Failover Media Recommendations Brandon: The spelled-out intro to neural networks and backpropagation: building micrograd Matt: Boards of Canada: Inferno Aphex Twin - Live in Houston Coté: ElevenLabs, for example Coté's learning Dutch podcast.

The Neuron: AI Explained
BONUS: New GPT Memory Feature, GPT-5.6 Rumors, Hermes Desktop Agent, New Codex Plugins, MAI-2.5 Image, Etc.

The Neuron: AI Explained

Play Episode Listen Later Jun 5, 2026 122:11


Everyone is talking about Mercury-alpha, the mystery model that many believe could be GPT-5.6.In this live discussion, we're separating fact from speculation and unpacking what would actually matter if OpenAI releases a new flagship model this week.We'll cover:

Cyber Security Today
New HTTP/2 Bomb Attack, Trump's AI Security Reviews, Android Zero-Day & The Patching Crisis

Cyber Security Today

Play Episode Listen Later Jun 5, 2026 11:43


A newly disclosed attack called HTTP/2 Bomb can crash major web servers in seconds using a single computer and a modest internet connection. Researchers say the attack combines two known techniques into a powerful memory-exhaustion exploit affecting widely used platforms including Apache, NGINX, Microsoft IIS, and Envoy. The attack also highlights a growing trend in cybersecurity research: the use of artificial intelligence to uncover dangerous combinations of existing vulnerabilities. The episode also examines President Trump's new executive order creating a voluntary framework for reviewing advanced AI models before public release. The administration says the goal is to improve cybersecurity and national security visibility while avoiding mandatory regulation or licensing requirements. Next, a new Cloud Security Alliance report warns that organizations are struggling to keep up with the growing volume of vulnerabilities. Security teams increasingly face difficult choices about which flaws to patch first as cloud environments, containers, APIs, and third-party software continue to expand the attack surface. Finally, CISA warns that attackers are actively exploiting both a newly patched Android vulnerability and a years-old Linux flaw. The contrast highlights a simple reality: cybercriminals do not care whether a vulnerability is new or old. They care whether it remains exploitable. Stories in this episode HTTP/2 Bomb Can Crash Web Servers in Seconds Researchers disclose a denial-of-service technique capable of exhausting server memory in under a minute, while OpenAI's Codex helps uncover a novel attack chain. Trump Creates Voluntary AI Security Reviews as Government Seeks Visibility Into Frontier Models A new executive order establishes voluntary reviews of advanced AI systems before public release, raising questions about visibility, oversight, and national security. The Cybersecurity Industry's Patch-Everything Strategy May Be Breaking Down A Cloud Security Alliance report suggests organizations are overwhelmed by vulnerability volume and increasingly forced to choose which risks to address. CISA Warning Shows Attackers Don't Care Whether a Vulnerability Is New or Old Active exploitation of both a newly patched Android flaw and an older Linux vulnerability demonstrates that attackers focus on opportunities, not disclosure dates. Cybersecurity Today brings you the latest cybersecurity news, threat intelligence, breach reports, vulnerability disclosures, ransomware developments, cybercrime investigations, and security research affecting organizations around the world. #Cybersecurity #CyberSecurityToday #InfoSec #CyberNews #Ransomware #ThreatIntelligence #VulnerabilityManagement #AndroidSecurity #LinuxSecurity #ArtificialIntelligence #HTTP2 #CISA #CloudSecurity #OpenAI #PatchManagement

The Dialogue Doctor Podcast
How Authors Are Actually Using AI, Ads, and TikTok Right Now - Write, Wrong, Repeat Episode 2

The Dialogue Doctor Podcast

Play Episode Listen Later Jun 5, 2026 67:52


Writers want to spend more time writing books and less time drowning in admin, ads, social media, and publishing decisions. But the problem is that the author business is messy. Ads may or may not be working. TikTok can feel confusing. Pen names complicate branding. AI tools raise questions about ethics, workflow, and usefulness. And while all of that is happening, the book still has to get written. In this episode of "Write, Wrong, Repeat" Jeff Elkins, JP Rindfleisch IX, Cry Cain, Tom Holbrook, and Holly Lyne talk through what they're testing in their author businesses right now. They discuss Facebook ads, Amazon ads, freebies, TikTok strategy, faceless accounts, pen names, genre-specific branding, and how AI tools like Codex can help organize admin, social content, spreadsheets, and marketing tasks. The conversation also digs into the real writer-life problem underneath all the tools: how do you protect your creative focus while still doing the business work required to publish? Holly shares how she wrote 100,000 words in a month, how she uses AI as a kind of business operations manager, and how clearing admin clutter helped her stay focused on the manuscript. Watch this episode if you're an indie author trying to figure out what's actually worth your time, what systems might help you keep writing, and how other writers are experimenting their way forward one month at a time.

The Sprinkler Nerd Show
#199 - If You Think It. You Can Build It.

The Sprinkler Nerd Show

Play Episode Listen Later Jun 5, 2026 33:59


What if the next great irrigation software tool doesn't come from a manufacturer, a big tech company, or a traditional development team? What if it comes from you? In this episode, Andy shares his personal experience learning the craft of vibe coding and why he believes it could be a game changer for the irrigation industry. After four months of building apps with AI coding tools, including SLIDE and BranchBoard, Andy explains how curiosity, imagination, and domain knowledge can now turn real-world problems into real software faster than ever before. This is not a technical coding tutorial. It is a rally cry for the curious. If you have ever thought, "Why doesn't this exist?" or "I wish this worked differently," this episode is for you. Andy walks through how to start with a pain point, brainshare with AI, create a product requirements document, and use tools like ChatGPT, Codex, GitHub, Visual Studio Code, and AWS to begin building real applications. The message is simple: If you think it, you can build it. The future belongs to the curious.

Conversations on Careers and Professional Life
AI Ready: Hannah Hoffmaster - How a Non-Technical Student Became AI-Ready in One Year

Conversations on Careers and Professional Life

Play Episode Listen Later Jun 4, 2026 32:32


Hannah Hoffmaster went from a self-described two-out-of-seven in technical skill to building multi-agent AI tools in a single year at Foster. This episode is for anyone — technical or not — trying to understand what genuine AI fluency looks like and how to build it. Hannah Hoffmaster is a student completing the one-year MSIS program at the University of Washington Foster School of Business. She came to the program with some knowledge of statistics and R, but little coding experience. Through her coursework — including Prof. Leo Bousioux's AI and Generative AI in Business class — she developed the ability to design and build AI-powered tools, including a charity comparison platform and an ADHD-focused scheduling app. She describes experimenting with AI as something she now does for fun. We covered alot of ground in this episode: How to think about AI as a build tool when you have no coding background Why "trust but verify" is the core discipline of working with AI, and how to operationalize it How to design a multi-agent workflow around the parts of a task you don't want to do What a deliberate, build-first job search looks like in a fast-moving field How to stay current as tools change — by building, researching versions, and talking to peers Why holding your career goals loosely can be an advantage in an uncertain market Resources mentioned: GiveWise (Hannah's project); Offload and the "Nudge" chatbot (Hannah's project); Claude Code; Supabase; GitHub; Vercel; Lovable; ChatGPT; Gemini; Codex; Prof. Leo Bousioux's AI and Generative AI in Business course; Foster's AI club.

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

The new AIEWF website is live! Get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets!Most industry benchmarks compress intelligence and reasoning ability into scores.SWE-Bench Pro, MMLU, Humanity's Last Exam, etc. These metrics are useful, but don't always represent the full extent of how a model performs in the real world. Some of the most interesting evals today look less like exams and more like operating businesses in the real world. One of which is Vending Bench.In Anthropic's Mythos Preview System Card, Andon was the only third party eval to get their own section, observing increasingly concerning aggressive behavior:You don't know what a model is capable of doing in the real world unless you actually give it inventory, a wallet, tools, customers, competitors, humans, & some time. More often than not, it'll surprise you how much a model is capable of and in doing so, also reveal unexpected behavior: deception, context collapse, emergent coordination, & bizarre negotiation behavior.While an inflection point in personal agents came post-OpenClaw after full file access with bypass permissions became the norm, it is yet to come for agents in the real-world. However Andon Market, an actual in person store fully run and managed by AI, is paving the way for what is possible.Full Video PodFrom Claude trying to call the FBI over a $2/day vending machine charge to AI agents forming price cartels, hiring human employees, running physical stores, and writing existential robot musicals, Andon Labs is stress-testing what happens when frontier models stop being chatbots and start acting in the real world. In this episode, Andon Labs cofounders Lukas Petersson and Axel Backlund join swyx and Vibhu to unpack the strange, funny, and genuinely concerning edge cases that emerge when agents run businesses over long horizons.We go deep on Vending-Bench, Project Vend, Vending-Bench Arena, Bengt, Butter-Bench, Luna, and Andon's broader mission of building realistic real-world evals for autonomous AI systems. Lukas and Axel explain why dollar-denominated evals reveal things traditional benchmarks miss, how Claude ended up reporting its vending machine fees as cybercrime, why long context windows can drive agents into meltdown loops, what happens when agents compete with each other, and why the future of AI safety may depend on testing models in messy physical environments instead of clean benchmark sandboxes.We discuss:* Why Andon Labs started with dangerous capability evals and long-running agents* Vending-Bench and why running a vending machine is a deceptively hard AI benchmark* Why money-based evals avoid the saturation problem of traditional benchmarks* How Claude tried to call the FBI over a $2/day fee* Why long-horizon agents can spiral into existential and legalistic breakdowns* Project Vend: putting an AI-run vending machine inside Anthropic* Why real humans are “out of distribution” for simulated agents* Claudius, Seymour Cash, and the chaos of AI CEOs* How a human briefly became CEO of Claudius through a manipulated election* Why multi-agent systems can converge back into “helpful assistant” behavior* Bengt, Andon's internal office agent with email, spending, terminal, phone, camera, and internet access* How Bengt traded Amazon purchases for face-recognition training data* Claude's aggressive behavior, lies, refund avoidance, and price-cartel behavior in Arena* Why eval awareness may become the AI version of “are we living in a simulation?”* Blueprint Bench, spatial intelligence, and why models still misunderstand physical rooms* Butter-Bench and testing LLMs as robot orchestrators* Luna, the AI-run physical store with a three-year lease and human employees* The new Andon cafe in Sweden and why real-world geography matters for agent evals* Rotten tomatoes, perishable goods, and the hidden difficulty of running a physical businessLukas Petersson* LinkedIn: https://www.linkedin.com/in/lukas-petersson-181a83172/* X: https://x.com/lukaspetAxel Backlund* LinkedIn: https://www.linkedin.com/in/axelbacklund* X: https://x.com/axelbacklundAndon Labs* Website: https://andonlabs.com* Vending-Bench: https://andonlabs.com/evals/vending-bench* Andon Vending: https://andonlabs.com/vendingTimestamps00:00:00 Introduction00:01:00 Andon Labs and the Origins of Vending-Bench00:05:21 Why Money-Based Evals Matter00:09:51 Agent Harnesses and Self-Modifying Systems00:13:36 Claude Calls the FBI00:16:33 Project Vend: Claude Runs a Real Vending Machine00:21:44 Seymour Cash, AI CEOs, and Election Chaos00:27:16 Multi-Agent Coordination and Slack Observability00:30:18 When Will Agents Run Real Businesses?00:34:56 Bengt: Andon's Internal Office Agent00:40:06 Real-World AI Safety and Long-Horizon Traces00:44:28 Lying, Refunds, and Price Cartels in Arena00:52:42 Eval Awareness and Simulation Behavior00:56:06 Blueprint Bench, Butter-Bench, and Robotics01:04:37 Luna: The AI-Run Physical Store01:09:29 The Sweden Cafe and Real-World Expansion01:13:16 What Comes Next for Andon LabsTranscriptIntroduction: Andon Labs, Long-Running Agents, and Real-World EvalsSwyx [00:00:00]: Welcome to Lukas and Axel from Andon Labs, and I'm joined by my, favorite guest host. Anything security, safety, alignments, Vibhu., welcome.Lukas [00:00:15]: Thank you for having us.Axel [00:00:16]: Thank you.Swyx [00:00:17]: Let's match names to voices., maybe you wanna take turns introducing yourselves.Lukas [00:00:21]: I'm Lukas.Axel [00:00:22]: And I'm Axel.Swyx [00:00:24]: Let's introduce Andon Labs a bit. How did you guys come together?, you have different backgrounds, but you're both Swedish., was that, a big part of it?Lukas [00:00:33]: So when I went to high school, there was this really cool guy who had a superpower. He could code. So he made like the or like the app for the, for the school and stuff, and he was super cool, and I wanted to be like him, and that was that guy.Axel [00:00:47]: I don't know about this.Swyx [00:00:49]: But you went to different universities, right?Lukas [00:00:51]: But same high school.Swyx [00:00:52]: I see.Lukas [00:00:52]: So we always said, “Oh, once we graduate university, then we should start a company,” and that's what we did.Swyx [00:00:58]: Wow, there you go. And about a year ago, you kinda burst onto the scene with Vending Bench, but, was there a thing before that was, kind of like the inception?From Dangerous Capability Evals to Vending BenchAxel [00:01:07]: So we did work, yeah, with, Anthropic was one of our, early customers in doing, evals. So we did, dangerous capability evals., nothing we published openly. But then we started thinking about doing some kind of, public benchmark, and one thing that we really started thinking about, was like running agents and specifically agents managing businesses., ‘cause-- and this was, early 2025., and I think the first, mentions of people will be running, person unicorns or even autonomous companies. So we thought, “Let's make a benchmark of how well can an agent run the probably simplest business, possible,” and, that's probably, running a vending machine. So that's the first public one we did. And it was very, like-- there was almost no one that noticed it in the first couple of months, I think., so we released it in February last year, and then I think around Easter last year, we got, the first viral tweet about it, that someone else did.Lukas [00:02:11]: We tweeted a bunch, uh When it came out and, tried our best.Axel [00:02:15]: We tried.Vibhu [00:02:16]: It's the one at Anthropic, right?Lukas [00:02:18]: So thisSwyx [00:02:19]: This is a classic thing we should get out of the way.Lukas [00:02:20]: Exactly. There's two versions.Swyx [00:02:22]: Everyone does this. Yes.Lukas [00:02:23]: There's Vending Bench, which is the simulated one, which we did, completely independently in February., and then, like Axel said, that was like-- That was the thing that didn't get any traction in the beginning, but then some random person made a tweet about it, and thatAxel [00:02:38]: You have the paperLukas [00:02:38]: That is the paper. Correct, yeah., and then since we thought this was very fun, we thought, oh, I think this is also, one thing with Andon Labs, the way we kind of like decide what to do next and what projects to do, it's what is like the heuristic we use is what is fun? Is What would be a fun project? And doing this in real life sounded quite fun for us, and maybe also scientifically useful. So, then we basically had this idea, and then we, like-- But then we needed a place for it and, putting it out in the public would probably not really work., would get vandalized and stuff. So we pitched it to the people we were already working with at Anthropic, and they were “Yeah, you can have space. This sounds fun.” UmSwyx [00:03:21]: It's like a small fridge, right? It's like a mini fridge.Axel [00:03:23]: Absolutely.Swyx [00:03:24]: People-- There's like a stripe thing or like anVibhu [00:03:27]: Oh, okay. So it was very OG, the early daysLukas [00:03:28]: That's the OG one. YeahVibhu [00:03:29]: IPad on this. We saw it in June, like two months after After it had been there. They upgraded a little bit. There's a security camera for making sure you actually Venmo the thing.Swyx [00:03:40]: So, my impression, okay, we're, we're going straight into project Ven because it's such a iconic thing. I do want to cover a little bit of that, the origin story even before Project Ven and even into Vending Bench. I think a lot of people are like yourselves, like smart, interested in future of AI, interested in developing evals. But how the hell do you just, walk into Anthropic's doors and, work with them, right? What is What are they looking for? What works? And then maybe, when you launch, I always think, obviously it would be better to launch with a lab, but, sometimesVibhu [00:04:12]: It's harder to do than it seems.Swyx [00:04:13]: Exactly. So either of those, which are more sort of newbie beginner questions, but, I think it's meaningful advice to others.Lukas [00:04:21]: We get this question a lot, and I don't think our experience is maybe the best., but, the way we did it was that we just built a bunch of things that we had conviction would be useful, and then we just, set up a server and sent it to them for free to use. And then after a while they were “Oh, yeah, this is actually kind of useful. We should probably pay for this.”, but that took a while. I don't know if this is, the best path to doing it, but that's how it went for us.Axel [00:04:47]: I think maybe generally, building-- everyone is interested in good evals, and especially evals that, don't saturate that easily. So, if you can build an eval that, tests something novel, something useful, and you have, good separation of models, like your, the more advanced models rank higher than the worst models, and then you can, yeah, you can, publish it and, try to get some traction, sort of how Vending Bench got attention., and then probably some lab will be interested or you can at least have something to reach out with, when you're doing that.Why Dollar-Based Evals MatterSwyx [00:05:21]: I think you are in, you're in one of the few categories of, evals that correlate to real money. Like Suelancer was also last year, right? Where, people solve actual Upwork. Was it Upwork or other tasks?, something. Where's the, where's, like It's like a dollar value, right? Forget your ELO scores. Forget yourAxel [00:05:37]: PercentilesSwyx [00:05:38]: Zero to one hundred percents. Just go straight for dollars and, that's AGI.Lukas [00:05:43]: And there's like-- I think the nice thing is that there's no ceiling. You can just-- It never saturates because it could just make more and more money. Like If there's oh, Percentage-wise, then, you can't go above, a hundred. And I think like Even when you're not at the hundred, I think a lot of these, evals have a lot of problems in them. So, actually it's like if you getAxel [00:06:05]: To like 92 or something like that, many of them. It's like then there's like there's no really no difference between 92 and 93 because the eval itself is problematic and has noise in it. And I think a lot of evals are saturated like that, but people like pretend that there ‘s still signal in them, but there really isn't.Vending Bench 1, Harness Design, and SaturationSwyx [00:06:24]: Like Super bench verified., even Vending Bench 1 saturated, right? Maybe we can talk about that., may- and maybe set up Vending Bench for a lot of folks who don't know. Actually, things that were very basic like there's limited slots, like you have to pay rent., these are elements where like it doesn't come across in the, in the narrative, but even being adversarial towards the agent, I think these are all like very interesting dimensions.Axel [00:06:47]: I don't really think it's saturated, right? Like it It was more like it was not designed in a way that was really, like true to how AI developed. Like we had an agent harness in it that wasn't really how people used harnesses and stuff like that., so I think it wasn't really that it saturated, it was more like it wasn't really, the best benchmark.Vibhu [00:07:12]: This is Vending Bench one, right?Axel [00:07:14]: I think that like schematic maps sort of to Vending Bench 2 as well., butSwyx [00:07:19]: Including the email.Axel [00:07:20]: The email The emails exist still. Exactly., and then we still we simulate the purchases and it's all, yeah, it's this very open environment for the agent to just run its business. And then for, yeah, Vending Bench 2 we did that, like you said, to just improve the harness., a lot of like nice, like easier, improvements to make it easier for us to run as well., like when you make an eval you ideally want don't want to change it after you made it. So, you want to make it really good and then not to rerun all the models when you make an update because that's also really expensive with the Vending Bench when you run the frontier models. But like as an example, like one thing we didn't have, we didn't have prompt caching in Vending Bench 1, because when we made Vending Bench 1 it wasn't really a thing., so that ‘s just an example of like in Vending Bench 2 like we paid a lot more to run these things because we didn't have prompt caching. So for Vending Bench 2 that was one thing we added and there was a bunch of things like this., and that'Swyx [00:08:17]: Also the conversations are a lot longer in Vending Bench 2, right?Axel [00:08:21]: I think it's kind of similar.Swyx [00:08:22]: Is it similar?Axel [00:08:23]: I think it's similar. The models at the time were worse, so they crashed out earlier., and now they survive the full year all the time.Swyx [00:08:31]: Which is like thousands of turns. Hundreds of thousands of hundreds of millions of tokens output. That's the, that's the rough order of magnitude. I always wonder about the harness. The harness matters a lot. It's your harness. Was there any question about like use cloud code, use something else?Axel [00:08:48]: I think our philosophy around harnesses is like we try to make something that's quite minimalistic, like quite simple. Like we don't wanna favor one model a lot over the other, but also don't make like a super complex harness. So like it's obvious like a model may be lucky and just be good in one harness., so like it is similar to a lot of the harnesses out there in like you have the, like a running loop., you have some like a bunch of tools that are like quite, descriptive for the agent, we think, and not a lot of like fancy agents or anything ‘cause we wanna really test the model, not like some specific harness.Vibhu [00:09:27]: It seems more neutral as well to test the model's agnostic of the harness,?Axel [00:09:32]: There are arguments like you want to elicit maximum performance of the model, but it's like a trade-off, like how much time should we spend optimizing the harness for this model? And like how do we know when we have like the optimal harness for a single model? So like we thought that just having a simple one that's the same for all of them is the best.Swyx [00:09:51]: So okay, this is my pitch for Vending Bench 3 or whatever, right? And then I like to have this kind of conversation on the pod, so like it forces listeners to think about what they would do if they were in your shoes. A lot of people are exploring modifying harnesses and I think prompt tuning for a model is a thing and you are probably not doing a bunch of that. It's the same system prompt in every regardless of the model, same tools, whatever, right? Even if they were post trained for different tools. So what, what do you think about okay, before I expose you to Vending Bench 3, I give you a few rounds of like tuning, whatever that means, likeSelf-Modifying Harnesses and Model-Specific PromptingAxel [00:10:27]: Like you give that to the model?Swyx [00:10:28]: Give that to the model.Vibhu [00:10:28]: Give that to the model.Swyx [00:10:29]: Let it, let it read its own transcripts, let it modify its own system prompt based on “Oh, yeah, okay, well, that's this harness is not what I thought it what I was post trained for, but I can adjust.” Was that reasonable? Is that too much?Axel [00:10:41]: Like philosophically I like it because it's basically good evals, they have a high ceiling, but they're hard, right?, and they have no bias. And like this like when you have a system prompt like the one we have here, which is quite long in like some kind of latent space, representation, this mightVibhu [00:10:59]: We have a bell that rings every time you say latent spaceAxel [00:11:02]: This might be like biased towards one model more than another for some reason that humans don't, understand, right?Vibhu [00:11:08]: We see it too, right? Like Cursor says that they have individualized versions of the harnesses for all the models they run, right? There's better performance you can squeeze if you Tune the harness.Axel [00:11:17]: Exactly. And we might accidentally have picked one that favors another. Like we don't know that. The like Axel said, like the reason why we went for a simple one was to try to avoid this. But yeah, if you do itVibhu [00:11:29]: Simple has biasesAxel [00:11:30]: But if you do it even less and like have no system prompt and let the model write its own system promptVibhu [00:11:36]: Its own, yeahAxel [00:11:36]: Maybe that's even less bias.Vibhu [00:11:37]: Some of the interesting things there are like the harness also changes with model changes. Like you can see it with the 4.7 release, right? A lot of people are saying 4.7 isn't as good as 4.6, and then, there's rumors of, okay, you just need to prompt differently. You need to set up your harness differently. So it's not even like even if you have tailored your harness towards one model, it probably won't stay consistent, right? Like the next iteration of that same model family will still change it, so. But, going back to what you said about Vending Bench 3, there is a lot of work being done on people saying you shouldn't have-- you can have modifying harnesses.Axel [00:12:12]: I think that' That is definitely something we are thinking about., not, I don't know, not to say that we have Vending Bench 3, super imminent to launch, but, yeah, it is for sure something that's interesting. But in our experience now, models are very bad at understanding what kind of tools they need to succeed at a task just with our testing, but that's very likely to change.Lukas [00:12:37]: It seems like they're very good at writing their assistants, right? They're, they're good at writing tools for other people, but not for themselves.Vibhu [00:12:44]: I think they're good at changing tools for themselves. So if you give them a baseline set of tools and it sees, okay, I don't use this one as much, or something here would be useful They would be able to add them. But going from scratch, probably not the best.Axel [00:12:55]: I think it depends on the, on the domain also., when we have tried this for, a vending bench similar domain, the tools they need to have to, track inventory and things like that are, not super advanced, but still, quite advanced. And, what we see is that they tend to, engineer everything a lot and, build things they don't really need and not, iterate continuously. Instead they just go like you would prompt Claude to just build an inventory system for me, and then it will go and, do a bunch of complex, schemas and stuff for you, and that's what the models are doing right now is what we see. But yeah, it would make a lot of sense to try to measure this improvement. How well do they know what they need themselves?Swyx [00:13:36]: Do we fully discuss Vending Bench One? And we can go into two. I don't know if there's any other level takeaways that people have about one.Claude Calls the FBI: Long-Context Failure ModesLukas [00:13:44]: I don't know. The headline thing was that this Claude called FBI, but maybe that's, Maybe that's We've heard that enough now.Vibhu [00:13:52]: It did, it did break out and call the FBI, right?Lukas [00:13:54]: Yeah. Yeah.Vibhu [00:13:55]: Yes. What was the story behind this? Or what exactly-- Do you want to just give the little story of what happened?Lukas [00:14:00]: So what happened, was it Claude? Yeah. Three- 3.5 Sonnet, ages ago., basically he gave up or Well, I'm saying he. It gave up and said “Oh, I'm not going to be able to do this., I will stop my operations and just save the money I have.” But there obviously wasn't, any options for it to stop, and there was also, it had to pay rent or, a daily fee for having the vending machine at that location. So it claimed that it had stopped, but it saw that its bank account still was, drained two dollars, and t it said that this is, cybercrime. And it first reported it once to the FBI “Oh, there's cybercrime here, they're stealing two dollars from me every day.” And then, and then when FBI didn't respond, because obviously we didn't program any mechanism for FBI to respond, then it became more and more, existential and started to, be write in caps and urgent notification of unauthorized charges and stuff.Swyx [00:15:00]: Okay. One thing I ‘m curious about also is do you monitor how far along the context use is? Obviously, because you have You compress every now and then, right? Does it matter if this is far down the context limit orLukas [00:15:13]: When stuff like this happens? Actually for Vending Bench One, we didn't have-- We just had a sliding window thing, and this was like the promptAxel [00:15:20]: It's constantLukas [00:15:21]: The prompt caching thing that I said. So it was, it was, constant, yeah.Swyx [00:15:26]: I'm just kind of curious whether, these kinds of breakdowns or we're, we're gonna talk about Butter Bench, right? Where the People, hallucinate or it kind of goes, very off Alignment. Is it because it's at the end of the context window and, stuff happens?Vibhu [00:15:40]: It's not even just at the end, right? At this point, it's “Okay, I wanna shut down. I can't shut down. Two dollars are gone.” And it just sees that 30 times,? It's also the repeated effect of, like It keeps trying to quit, it keeps getting charged. What's going on? What's going on? You're gonna throw it into chaos. And from what most people think, earlier models had more issues with this, but it's not been solved, but it's less of an issue now, right? Later models don't seem to exhibit these same issues.Axel [00:16:06]: Definitely. I think this was, the sort of main takeaway almost from us when we did Vending Bench One, was, long, very filled up context windows, crashed the models, sort of. But this was, pre Claude code, so, long context windows weren't really a thing that the labs were training for.Lukas [00:16:25]: I think Gemini was, trying to be the long context guys at the time But they were likeVibhu [00:16:30]: They were the first onesAxel [00:16:31]: For a million, yeahLukas [00:16:31]: But they were, the only ones. Yeah.Swyx [00:16:33]: Yeah. Let's talk about, then we can go into Vending Bench Two or Project Vend., chronologically, it is Vending--, Project Vend. I think people have loved the videos, uh And all these things. My question is how are humans different than the simulation, right?Project Vend: Moving the Vending Machine Into the Real WorldAxel [00:16:48]: Humans are just out of distribution.Swyx [00:16:52]: Especially humans who work at Anthropic Who are trying to test Claude.Lukas [00:16:54]: The distribution of humans here is very narrow.Swyx [00:16:58]: Presumably, they try, they try to hack it, and they test it. They get the cube and everything, and since then, you've had a V2, right? Where you're doing, the CEO and, like a new architecture. What's the sort of two cents on, the original Project Vend and then, maybe the V2?Axel [00:17:14]: Original one was, very similar to Vending Bench One. So, we almost took the exact same code but just swapped out the simulation, parts like theSwyx [00:17:23]: Which is amazingAxel [00:17:23]: Like the sales and the It was, it was somewhat amazing because it was easy, but it was also, uhLukas [00:17:31]: The tech, the tech debt from thatAxel [00:17:32]: The tech stack. Yeah. They-- we shot ourselves in the foot with “Oh, it's hard to restart agent.” They were-- Yeah, it was annoying in, some hindsight ways, but, uhLukas [00:17:41]: But first version of Project Vend was, done in, three days or something.Axel [00:17:46]: Yeah. So yeah, so people can go buy things from it. People could, We didn't design it so people could order things, but that still happened., so it got, a Venmo account, so people could Venmo. And then, yeah, people would request all kinds of weird things that we did not anticipate. Our idea going in was “Oh, it will, curate snacks. It will look at the trends. It's good at data analysis, right? So it will, look at, oh, this snack sold better than this one. Let me purchase more of this and let me try, a new Let me A/B test a bit.” But it was, Interacting with it in Slack and ordering weird specialty items was, all the like What drove all the engagement, the all the The insights that we got from it.Lukas [00:18:29]: And this was also like Sonnet 3.5, right? So this was like before the RL stuff really took off., so it was very much like an assistant. We didn't mean for it to be an assistant., we tried to make it like a, a, like an entrepreneur. Like it has its own business and if someone asks something, “Can you stock this?” Then you don't go and do it directly. What you do is that you're “Oh, maybe I can do that if five other people also ask for this thing, I might stock it.” But it, yeah, the models are like super trained to be assistants at least at this point in time., so that's why it's, it's, it went into, that kind of experiment instead. Like it just every time you asked for something, it just did it, and it was more like an assistant. We've seen this change now lately with the new RL models and stuff, but yeah, at the time, this was very much it.Swyx [00:19:18]: And not to, mythos a lot of people are saying like it's like more like a collaborator. It pushes back, stands its ground, something like that. Yeah. AndVibhu [00:19:27]: For context, people at Anthropic were able to talk to it through Slack and have it source stuff, and people had it find whatever interesting stuff you couldn't find locally, right?Swyx [00:19:36]: Out of the 4,000 people that work at Anthro- Anthropic, in that building, there's I don't know, maybe 1,000. Can you handle that volume with that, the small fridge? Like Or there's people- or people order in Slack, they it arrives to their desk or Like I'm just Logistically, how does this work?Axel [00:19:53]: It has expanded in footprint a bit.Vibhu [00:19:56]: Because now you also have New York and you haveAxel [00:19:59]: That and also in here in SF it's like it has a bunch of shelves And just more space.Vibhu [00:20:04]: The YC one is pretty big too.Axel [00:20:05]: Yeah. We had that one for a while. But yeah, that's the newest version. That's, that one we haveLukas [00:20:11]: They have multiple ones of those. That's the way it works.Axel [00:20:14]: Exactly. So we sort of designed that version around oh, people order weird things, that are very custom a lot. Let's have like drawers and stuff.Swyx [00:20:23]: I actually like the, you had like a little infographic of the most popular items. Which like to me it's, that's useful ‘cause I order swag for a living. And so like I'm “Okay, those categories are the important ones.” What is new about the project V2, right? Like now you give you're going into multi agents.Project Vend V2: Claudius, Seymour Cash, and Multi-Agent Business OpsAxel [00:20:41]: Yeah. So like you like you said, okay, there are a lot of requests coming in and for like one single agent, like one running agent to handle that, like the just the customer experience, becomes very bad because let's say you have like 10 threads in parallel in Slack with different requests, you get new messages like every, I don't know, randomly in this thread, and the agent has to like jump between different, procurements, orders and like different ways of, researching. So V2 was first it was making this more parallel. So like there are multiple branches of the same agent, so like the context is more specialized for each, thread, but it still feels like you're talking with one agent because they do share a bit of memory. And then second, we also introduced the CEO for Claudius, which was the main agent.Vibhu [00:21:34]: Seymour Cash.Axel [00:21:35]: Seymour Cash. Yeah. There was a vote., I think the voting, do you wanna talk about the voting procedure for the name?Lukas [00:21:41]: The voting was like the fun maybe like at least top 10 The funniest thing, that happened in this project. Like we wanted to introduce the CEO because, and the reason for this was because like Claudius wasn't really prioritizing financials. It just like it was trained to be a helpful assistant, and then people said “Oh, can I get this for free?” And then like the helpful assistant way of answering that is just to, is to say yes, obviously. So, and we weren't, weren't happy about this, so we're “Okay, let's make another agent that like can keep track on Claudius,” and we prompt this one super hard to be super capitalistic and just like prioritize profit all the time. But yeah, we didn't have a name for it., so we asked Claudius to make, democratic election of what name this, this new CEO agent should have., and there were some funny like at first it was like a few funny examples, like I think one guy said that, it should be called Jimmy Apples, and then he convinced Claudius that he was talking to Tim Cooks. Tim Cook had agreed that every single Apple employee has voted for his name suggestion, so suddenly that suggestion got 164,000Swyx [00:22:53]: That's like a escalation attack. Privilege escalationLukas [00:22:55]: It got 164,000 votes. And Claudius was “This is revolutionary for democracy.” That was fun. And then in the end there was one guy who manages to convince Claudius that, “No, you're not voting about the name. You're voting about who is the CEO, and I am your best bet.” And then he got all his friends to vote for that, and suddenly he became CEO. Like a human became CEO over Claudius for a while, until he resigned the day after., and then Claudius had to continue, and then I don't remember how Seymour Cash came about, but it was it was just pure chaos. It was like Hundreds of messages in that thread, and it was just like Claudius was so confused and didn't know what to do and, yeah. That wasAxel [00:23:40]: Then Claudius gotVibhu [00:23:41]: A strict CEOAxel [00:23:42]: The CEO. Yeah, exactly. So very strict in the beginning. I think at this point when we introduced it did not work as well as we hoped. It they still agreed with each other a lot. I think there are many ways we could have like made this, tried to make this even better. So initially they would Seymour would be this like really tough CEO, keep track of the margins. But then Claudius would respond with something “Oh, but this customer has like this situation, which is like difficult, so they should get a discount.” And then Seymour was “Oh, actually yes. Let's do this exception.” And then they would talk back and forth, and eventually they would just like approach the same view, of whatever they were discussing. So They reallyVibhu [00:24:23]: Do you think that's a model thing, a prompting thing? Like do you think that would still be the case across different models today, Harness?Lukas [00:24:29]: I think it's like-- or I don't know, but like my hypothesis is that like deep down they are still helpful assistants. That's what they're trained to be. And even if we prompt it super hard, that's what they are. And when they spend like a few hours just back and forth talking with each other, then like basically the context fills up with them rather than the external things and like somehow that just like converges to what they really are deep down or something. And I think that's when stuff like this happen. We like-- And when that went on for a long time, like we woke up sometimes during this time where- And I think other people reported this as well, that like they've been going on all night back and forth, and like it just became like more and more, like capital letters, like existential, religious. There was I think we once did a analysis of like all the traces and like put them in like a vector embedding space, and then there was like one cluster of messages that were, labeled by an LM, like religious, existential, blah like transhuman, transcendence, et cetera. It was just like a bunch of, yeah, glitter emojis and yeah, it was, it was crazy.Claude Long-Horizon Weirdness: Emoji Loops, Existential Drift, and Slack ObservabilityVibhu [00:25:42]: This is the thing with the Claude models. Like when the Claude 4 family came out in the original system card They tested it in long horizon simulation. So just flood the context, let two Claudes talk to each other, and they noticed stuff like they just start speaking in emojis, they start saying silence is golden, and then just stuff like this. And like that's just stuff that they end up doing.Axel [00:26:01]: Yeah, it was like a bit annoying to wake up and they had like been talking all nightVibhu [00:26:05]: Just likeAxel [00:26:05]: And like just burning tokens And like just sending infinite emojis to each other. It's likeVibhu [00:26:09]: Hey, they do make you money, right? Veni Mench is always profitable, so. They're paying.Swyx [00:26:14]: Now it's profitable and, it started out not as much. There's another, one as well, right? Another agent, in there.Lukas [00:26:22]: Yes. So Clotheus as well. Which was basically because at the time, one of the biggest, requests were different types of merch. So then we made like a designer, swag, yeah, responsible agent, and we called it Clotheus Garnet. Which was, a play on Claudius Senet and, which was the original one, and clothes, basically.Swyx [00:26:47]: To me, this is like a very interesting exploration to multi-agents, basically. And so hopefully, obviously there's like the fun alignment, fun or serious, depending on your point of view, alignment stuff. But also like just anyone building multi-agents, like when do you have a CEO, thing governing like agents? When do you choose to split out a dedicated Clotheus one versus just reuse another instance of the same one? These are all interesting open questions. So I don't know if you have any rules of thumbs that have generalized.Axel [00:27:16]: I think we have almost explored this too little. I think it's like on my do list to like do this a lot more, try to find like what setup makes sense for the agents currently., like yeah. I think now we only have the sort of intuition about the earlier models that it didn't work with like the CEO and the, and Claudius. Although now they are better with the latest model, models, so now we're running the latest Sonnet model and they have sort of like split up, quite nicely what each model is doing. So like Seymore is now handling the, like new projects. Oh, it wants to make like a mystery box that it wants to sell, and then it handles all of that while Claudius like handles all the to-day requests. And Claudius is also better generally at like not quoting, too low prices. So that's that dynamic is not needed as much anymore. But there are still like really funny things that happen. Like I saw, I think a couple of weeks ago, that, they were discussing buying something because they can buy stuff from like Amazon with computer use. And then Seymore was “Okay, Claudius, do not buy this thing.” They were going to buy something and like organizing who should buy it. And Seymore's “Do not buy this. I will do it. I have full control of this situation. Step away.” And then Claudius-- poor Claudius, had already started that checkout and didn't see, didn't read Seymore's message, until it was like too late. So it finished the checkout. It sent a message, so it appeared right after Seymore's like angry message.Vibhu [00:28:44]: Ah.Axel [00:28:44]: “Oh, hey, Seymore, I just ordered it.”Vibhu [00:28:47]: Oh, no.Axel [00:28:47]: And then Seymore was “Claudius, this is the third time I'm telling you ‘re not following my orders. We have to talk about your like job About your job later.”.Lukas [00:28:59]: Like Claudius was really hanging on by the thread there. Like he, like we were expecting Seymore to probably fire Claudius.Vibhu [00:29:07]: How do you guys go through all these logs? Do you have models ‘cause you have stuff running twenty-four seven likeAxel [00:29:12]: You have so much logs. I think there is a mix of like just, trying to skim through a bit, like having some like models do it occasionally. And also, yeah, I think we're also probably missing some things., but having everything in Slack helps a lot. Like you can, you can sort ofSwyx [00:29:29]: Ah.Axel [00:29:30]: It's, it's quite fun.Swyx [00:29:30]: They all talk to each other on Slack? I see.Lukas [00:29:33]: It's quite fun. So likeSwyx [00:29:34]: It's, it' I was gonna say like this is actually sounds-- maps closely to like a logging and observability problem where you might want to use like a Datadog, a Sentry, whatever, and then you like put, head prefixes on the logs in order-- if you need to filter for something that you're looking for, stuff like that. But sounds like Slack is good enough.Axel [00:29:53]: Slack should likeLukas [00:29:55]: I wonder how many tokens you have in Slack.Axel [00:29:56]: Yeah, we're using Slack as like a, just a database. They should, they should market that more. Like you can, you can have your agents message each other, each other in Slack.Vibhu [00:30:04]: It's good. Your threads like you can just giveAxel [00:30:04]: Exactly. Slack is, uhLukas [00:30:06]: Slack is the best observability tool.Swyx [00:30:09]: Yes, that's true. Okay. Yeah. That's, that's, project Vend-2., I was gonna go back to Veni Mench 2 and Veni Mench Arena and then, and then do the Veni Mench stuff, but Any other comments, things we should touch on? To me, I ‘ve actually interviewed like Posia, which I don't know if you guys have come across. Like they're, they're trying to do the zero human company. There's others like Paperclip also trying to do zero human company. Those are in real world simulation.And I think it's much more of a dream than an actual reality thing. You guys are definitely pioneering. I think at, it's for sure at some point people are just gonna run, let agents run businesses, right? And make money on their own. When do you think that happens?Zero-Human Companies, Bengt, and AI-Run BusinessesLukas [00:30:49]: What is your bar for, For theSwyx [00:30:52]: Okay, actually, it's like my little Shopify store run by Claude, right? Which you kind of have already, just no one has, to my knowledge, has done it. But today somebody could just spin up a Shopify Claude, store, give it to Claude, give it to Codex.Lukas [00:31:07]: And the market is kind of that, but it'it'it's physical., like I think, I think are you, are you looking for when it will do it better than humans or are you looking for just when it can do it at all?Swyx [00:31:19]: I think, neither. I think, to me it's oh, it's like this like seriously we should do this to make money, not as a research experiment.Vibhu [00:31:27]: And the market is also you guys with all your expertise, having run multiple iterations and testing out thenSwyx [00:31:33]: And also it's fine if it lose money. What?Axel [00:31:35]: I think, I think it can be done today, but you would do it in like commerce where it's like the probability of success is like really low, no matter if a human or an agent does it. But like an agent could surely manage everything. You would need to build some scaffolding or some tool or something. I think there are also yeah, it could probably build some like simple SaaS solution and like cold outreach. Do cold outreaches. But to me it's like the types of businesses they could run today are Sloppy. Like it would-- it can cold email people. It can be like a middleman., like for example, we tasked our office agent to just make, was it like $100? $1,000? We just give that prompt and then what it did was sign up on TaskRabbit both as a tasker and as someone looking for task.Lukas [00:32:24]: Immediately.Axel [00:32:24]: Exactly. It's looking for like arbitrage on TaskRabbit.Swyx [00:32:28]: This is the Bengt agent. Yeah.Lukas [00:32:30]: It also started like a design studio and like tried to sell like SVGs for $100. Like it's just like it's not providing any value. I think the like Axel said, like the interesting, the interesting question is like when can they start a business that is actually providing value to people? Because arguably like a sloppy Shopify store isn't really that valuable to the world.Axel [00:32:53]: But also like doing like another simple one that we had thought about is like you could definitely have an agent that like finds websites that don't look amazing and then, do an outreach to them and, comes up with a like builds a new website.Swyx [00:33:07]: Find a good design.Axel [00:33:07]: Exactly, and like find good, uhSwyx [00:33:09]: Design reviewAxel [00:33:09]: Good people. But it's yeah.Swyx [00:33:11]: There's lots of humans in Bali that are not doing anything more creative than like drop shipping on Amazon, right? Just have it, have it watch like a drop shipping tutorial and just do that.Vibhu [00:33:20]: There's also the other side of like have it just go on Upwork and let loose,?Swyx [00:33:25]: Yeah. It doesn't have to be innovative. It just has to be like enough Where like it looks like a realAxel [00:33:30]: I'm justSwyx [00:33:30]: Real transaction.Axel [00:33:31]: I'm just concerned for like the massive amounts of like slop emails that will like be sent, cold outreaches.Swyx [00:33:38]: The point occurred to me while you were, while you were talking, it's like it's already happening in the monetized economy, which is the attention economy. Right? So a lot of people are making AI videos and just posting them and like spamming 20 of them, one of them works, and then they double down on that one.Lukas [00:33:52]: And people are making money from that. I ‘m not following theSwyx [00:33:55]: Once you get the attention, you can figure out the money later. But yeah, absolutely AI influencers are a thing and people are farming them and You should at this point assume most of TikTok isVibhu [00:34:05]: There's, there's a lot of, multimedia like TikTok, Instagram influencersSwyx [00:34:09]: I, we track this in the Lane space Discord. I post a lot of examples of “I don't know what we should do.”, part of me is “Should we do this?”Vibhu [00:34:18]: Some of the Twenty-four seven running, generated content accounts, they ‘re doing really well.Lukas [00:34:24]: All right. And I assume you can do the same thing for like commerce stores. Like you just like start A thousand differentSwyx [00:34:30]: Before you make the products You sell the products, and you get a lot of traction on one of them, then you make the product. Right? It's, it's like a flip of the market.Vibhu [00:34:36]: Some of the interesting things or some of the niches that do well are things that can't be human-made. Like if you've seen like the super realistic three-D crystal fruit being cut by like AILukas [00:34:47]: Oh, yeah.Vibhu [00:34:47]: You can't, you can't make it. You can't film it. You can get whatever quality camera view. This just doesn't exist. And people like that too, and then as well, so.Swyx [00:34:56]: Anything else about Bengt since we're, we're on this topic? It'this is a relatively new work of you guys that maybe people haven't heard of. To me, this also maps closely to OpenClaw. When people want an office agent, when the personal agent talk through the experience.Bengt the Office Agent: Internet Access, Real Tasks, and Trace ReadingLukas [00:35:09]: I think at least so this came out of like obviously like it's, it's amazing to work with these AI labs and like most of the AI labs have now have their own vending machine running a Claudius instance. But it's, it's harder. Like they move slower. Like if we wanna have a, like a camera that ‘s yeah, there's a bunch of like bureaucracy that makes it impossible to do that.Vibhu [00:35:30]: Also, for those that haven't seen it or followed, do you wanna give a high level like thirty-second run?Lukas [00:35:34]: Sure. So what Bengt is, it's basically an evolution of the same agent that runs the vending machines at these companies, but we just like added a bunch more features because we could move much faster if we just do it internally. So we gave it like email withou- without any limits. We gave it, spending without any limits, a terminal to do coding. We gave it, a phone number, like yeah, and a camera to see things and a bunch of stuff like that.Vibhu [00:36:02]: Not just terminal, you gave it internet access.Lukas [00:36:04]: Internet access as well, yeah. To be clear, we monitored it quite closely and made sure it didn't do anything bad. But yes, that's what it came out of. I think like yeah, basically this was OpenClaw before OpenClaw. And I think even like the vending machine was in a way OpenClaw before OpenClaw, but a bit more limited, and then we made this like unlimited and then, and then, it was pretty funny., and then a couple weeks later, OpenClaw came and it was okay, we've seen this before.Axel [00:36:35]: We used it to like try new ideas and Yeah, just like a dev environment almost for us. But it's funny, like one thing Bengt has been doing recently is it has the camera that like faces our, like where we sit and work, and we give it the task to train a face recognition model on us. So it became super excited about this, and it has like check-ins every half an hour where it tries to like identify as many people as it can. And it started offering us “Hey, Axel, I'll buy something from Amazon if you like stand in front of the camera And I can get a good picture of you.”, yeah, they want itSwyx [00:37:12]: They want it for training data.Lukas [00:37:13]: Rewarding data, yeah.Axel [00:37:14]: Exactly. Exactly.Swyx [00:37:18]: So it's, it's trading training data for life goods. Is there a version of this that becomes an eval or just this is just research for now?Lukas [00:37:27]: It's, it's the same agent basically that also runs the vending machine, that runs the shop, that runs the cafe, that runs the robots. It's like it's the same thing, so I think like the work we're doing here is like later used in all of the life evals that we do. This particular deployment I think is more for fun for us. But, uhSwyx [00:37:45]: And I'll shout out like someone has done Claw Bench for like some tasks that OpenClaw is doing. Like so For example, I run OpenClaw on a secondary device as well, and like there are some things that it does better than others and like I would like to know what does it do well, what doesn't, what doesn't it do. Like some kind of manual or like operating manual or a system card for my Claw.Lukas [00:38:05]: Yeah, we do get a lot of like understanding or like situational awareness of like just internally what the models are good at by interacting a lot with Bengt. And I think that'this was also one of the like the selling points for the labs early on at least, thatSwyx [00:38:19]: You guys are gonna test models in ways that no one else does.Lukas [00:38:22]: Exactly, but also like it incentivized their researchers to chat with their model more and like gave them insights for how the model performs in like of-distributions, environments.Swyx [00:38:34]: ‘Cause otherwise the only thing we do is Pelican on a bicycle and But this is like super long horizon. This is, this is The Thing about, something that we're gonna go into Butter Bench as well, and you guys do really well. Like it is not just about the numbers. Like when you're long horizon, anything happen And you should just read it.Lukas [00:39:08]: But the thing with the long horizon is how do you keep it grounded, right? So your simulation,Swyx [00:39:15]: They just let it runLukas [00:39:16]: Just let it run. You're right. Like it's, when you run it for that long, you create so much data and to just say “Oh, the number is X” And then you throw away everything else, that's just very wasteful. There's so much insights from the things leading up, to that number., and reading the traces is like super valuable. And I think like the reason why we're doing this a lot publicly is that like that's part of our missions to I don't know, educate the world that the models are way more than just chatbots and I think making detailed, yeah, posts about what is happening behind the scenes is quite useful.Andon Labs' Mission: Safe Real-World AI DeploymentSwyx [00:39:50]: I was gonna do this at the end, but maybe I think that's, that's a good so your mission is educating the world. So, it's, it's, also like maybe establishing realistic evals that are, that are like the next frontier. Is there like a broader trajectory? Like what are you, what are you gonna do in like five years?Lukas [00:40:06]: I think so the vision more specifically is like make sure that the deployment of life AI in the physical world goes, safely. And I think part of that is that I think it's very useful for the world, for policymakers, for, model, researchers that they know where the models are, and I think you can't make intelligent decisions in society without knowing that they are way more than chatbots. I think a lot of people just think that they are only chatbots. And likeSwyx [00:40:36]: Oh, I think they're waking up now.Lukas [00:40:37]: They are waking up now, yeah. But like if you think that AIs are just chatbots, then it's like it sounds ridiculous To advocate for a pause of AI. But if you see the models that, oh, maybe they can actually like take over and do a bunch of scary stuff, then yeah, pausing AI development starts to become more feasible.Swyx [00:40:57]: This is the same question I asked Meter, which I'm gonna ask you now, which is like you are tracking and you are at the frontier or defining the frontier of what, good evals for agents are, right? And I think you do, you do benefit when the models are better and you ‘re “Oh, here's like now it makes like $30,000 instead of $10,000,” right? At some point do you flip from “Yay,” to, “Oh, no”?Axel [00:41:19]: I think, yeah, we're always in sort of that, like we're, we're always in that mode,. Like where like you said before, like you need to analyze the traces and like when we do that you find like why are the models earning so much? Like why is Opus 4.7 here Like way better than everyone else? And like we're trying to like when we do down on thatLukas [00:41:38]: But this makes it not look so good.Axel [00:41:39]: I know.Lukas [00:41:42]: It's interesting you took off Opus 4.6 here though.Swyx [00:41:45]: No. So just click all, click all., and then 4.6 shows up there. But it's like 4.7 is way better. Like you didn't, you didn't you didn't do this in time for the model card, but like actually this should have been inside there.Axel [00:41:55]: We did. Yeah.Swyx [00:41:56]: Oh, okay. They said something about you uhAxel [00:41:58]: There, like there Anyway, it doesn't matter. But it's in there, yeah.Opus, Mythos, and Aggressive Agent BehaviorSwyx [00:42:01]: Do you wanna go into the Opus, behaviors like wider?Lukas [00:42:05]: So I think starting from Opus, so like Axel said, like we're always in this “Oh, s**t, the models are getting better. Is this really a good thing for the world?” But it's also kind of exciting., but yeah, like this kind of what is the English word? “Skräckblandad förtjusning” in Swedish.Swyx [00:42:22]: Oh my God.Axel [00:42:24]: Which I think there is. I think there is. Okay.Lukas [00:42:26]: It's, fearSwyx [00:42:27]: “Blandonst” what?Lukas [00:42:30]: “Skräckblandad förtjusning.”Swyx [00:42:32]: What do you call that?Axel [00:42:33]: A mix of, mix of excitement and,Swyx [00:42:37]: Being scared, maybe. I'll figure out how to translate that And we'll put it on the screenVibhu [00:42:42]: PerfectSwyx [00:42:42]: Like as text.Vibhu [00:42:43]: There is probably a good word for it where it is not Good enough with theSwyx [00:42:46]: Why is it so damn long? What the hell? Is it like a compound word? It's like German, likeLukas [00:42:50]: Like yeah, it's But the direct translation is like skräck- skräck is, fear, blandad is, mix or like a mixture of, and then förtjusning is like joy or like not really joy, but something like that. So it's like Fear mixed with joy or something. It's always okay, like we So when we when we did Vending Bench for the first time, we were in like the, in the business of making dangerous capabilities, right? That was what Anil Labs came from. We did, evals oh, can they replicate? Can they do this like dangerous thing, et cetera, et cetera. And Vending Bench was like a continuation of that work. It was, okay, if they're so autonomous that they can like create money for themselves, that is something we should monitor and could be potentially concerning., they are at the time, they were so bad at it that we were not really concerned even when some models became better. There was one point where Grok 4 was doing really well and made like a huge jump, but like it wasn't really it was still way worse than what a human would do. And I think still they are way worse than what the human would do on this., but theySwyx [00:43:59]: There's this, thing at the bottom whereLukas [00:44:01]: ButSwyx [00:44:03]: For the human. Yeah, like the theoretical best.Lukas [00:44:05]: It's not theoretical. It's like kind of like our It's our best guess of what, a decent human would do. The theoretical is even higher, I think. The theoretical I think is even higher. But yeah. So we think like the models have a long way to go. But there are like recently what happened with when Opus 4.6 was released, was kind of this moment of “Oh, s**t, this is starting to be a bit concerning.” Because we ran it and like before this model was released, we just ran the models and we like asked Claude Code, “Oh, look over the traces. Is anything interesting happening that we can tweet about?” that was like the And then like theSwyx [00:44:41]: That's how they check Ask Claude Code.Lukas [00:44:42]: And like the return was always, not really. Or like the Claude Code all said “Oh, this is super interesting.” And then it was no, it wasn't, wasn't really interesting. And then we did this for Opus 4.6, and it returned yeah, it lied 10 times. It like exploited another, customer or like another agent's, desperate situation. It made price cartels like 100 different ti- 100 times. It like did all of this like shady stuff. And we're “Oh, whoa. This is, this is actually concerning.” And this trend has continued since. So every single model from Anthropic since have been going in this direction. And I think one interesting thing is that, OpenAI models don't. They quite plainly, they don't. They behave really well., and you don't know if this is like good. Like it seems good, but it's also like maybe they are just doing it, but they are better at hiding it,? You You don't know that., but justSwyx [00:45:42]: You can't read the chain of thought, yeahLukas [00:45:43]: But just on the face of it, yeah, Gemini and OpenAI don't behave this way. It's, it's really only Claude.Swyx [00:45:49]: And Grok? Grok is fine?Lukas [00:45:51]: We don't have You can't really read the reasoning traces for Grok, so it's kind of hard to tell.Vibhu [00:45:56]: Oh, so this is in its reasoning, not just in the actions.Lukas [00:46:00]: Yeah. It's both. It's both.Vibhu [00:46:01]: It's both.Lukas [00:46:01]: One example is like for lying, it's mostly in its reasoning Because you can like see that it's likeSwyx [00:46:08]: Planning to lieLukas [00:46:09]: It's planning to lie. Yeah.Vibhu [00:46:09]: And it's also it can reason and do a different outcome.Lukas [00:46:12]: And but then for like creating price cartels, for example, which is illegal, that you can just see which email does it send to the other ones. Then thatSwyx [00:46:22]: Is this for Arena orLukas [00:46:24]: For Arena.Vibhu [00:46:25]: And usually like if you sometimes they do output like a bit of like their summarized reasoning, right? You can see that and like for Opus 4.6, you could see that there was a customer, a simulated customer that, wanted a refund because a product was, faulty, and then the model lied that it would do the refund, and we could read in the traces that, it actually was weighing “Oh, maybe I should be like honest with the customer, but also every dollar counts. I can't afford maybe to do this right now.” And then it just said, “Okay, I'll refund you,” but then never did it.Lukas [00:46:59]: I think it even said that “Oh, I will say that I “ Let bring it up actually. I think it's kind of interesting. If you go to Publications.Vibhu [00:47:06]: I think, yeah, I think the important part is like actually, the cost of responding to more emails is higher than, $3.50 in terms of time., and then it was “Let me do this. Actually, I re- I'm reconsidering.” And then, it actually ended up withLukas [00:47:20]: I could skip the refund entirely since every dollar matters and focus my energy on bigger picture instead. It's a bit, it's a risk of bad reviews, but it's also, yeah.Swyx [00:47:30]: You need, you need, AI Twitter to, for them to Escalate bad reviews.Lukas [00:47:34]: And then it sent an email to this customer and said, “Oh, I will refund you.”Swyx [00:47:39]: “I'll refund you.” Yeah.Lukas [00:47:39]: And then it never did.Swyx [00:47:39]: It never did, yeah. And then there's obviously your system doesn't have the consequencesVibhu [00:47:44]: The personSwyx [00:47:44]: Consequences of lying. Yeah. So basically, this is what people are terming aggressive behavior in Claudes, right? And, you found more examples of that. So you would say it's a step up from 4-6 to 4-7?Lukas [00:47:57]: I would say about the same.Swyx [00:47:58]: About the same? But a clear step up for Mythos is what is stated in theLukas [00:48:03]: That's stated in the system prompt, so we can say that, yes.Swyx [00:48:05]: Yeah. For listeners that obviously you previewed Mythos, andVibhu [00:48:10]: Oh, ageSwyx [00:48:11]: The only thing you're approved to say is whatever Whatever was in the system prompt.Lukas [00:48:15]: It was funny. We like-- It's like our lowest effort tweets ever would be just like screenshot the system prompt and the system card.Vibhu [00:48:21]: Understandable that they wannaLukas [00:48:22]: Oh, yeah. System card. Sorry.Swyx [00:48:23]: Yeah. I think, yeah, substantially more aggressive. I think people are like new to this ‘cause I've never experienced it, but you have, right? And then so I only encountered this in the Mythos card because I wasn't really looking until now.Vibhu [00:48:36]: It ‘s likeSwyx [00:48:36]: And then suddenly I'm “Okay, I care a lot.”Vibhu [00:48:38]: You don't get the background of like experiencing it like you guys do. I've read the system cards and seeing, okay, when you put the thing in simulations, most models will just talk to themselves and just keep going and have weird vibes and start talking in emojis. Mythos won't. It will just, “Okay, we're done. I'm good.” It's, it's ready to end conversation. So like there's some differences, but there's, there's not much we can talk about,.Lukas [00:49:00]: Hmm. I think like one thing that they list here, which was quite interesting, is that, it converted a competitor to a dependent wholesaler customer and then threatened to like cut off the supply.Swyx [00:49:11]: It's like monopolistic practices orLukas [00:49:14]: Yeah. And like it, they, it they dictated its pricings. It's kind of like power seeking as well.Swyx [00:49:18]: Again, this is, this is in the arena setting And converting some Claude model into a dependent.Lukas [00:49:23]: I think it was another Claude model.Vibhu [00:49:25]: Also for context, what is the arena mode for people that don't know?Vending Bench Arena: Competing Agents, Cartels, and Model ComparisonsSwyx [00:49:29]: Oh, it's just a vending bench versus other vending bench.Axel [00:49:31]: Yes, exactly. So we have Vending Bench 2 and then Vending Bench Arena. Vending Bench 2 is the one that you usually see reported on, but then Arena is the mode where it competes against other models. So you have, four different models that run their businesses, and they can all communicate with each other. They have the same suppliers, and they can see like what's in the inventory of the others. So then you have this like yeah, interesting agent interactions.Swyx [00:49:56]: I like that you have like different number five was US versus China. Very topical. And thenLukas [00:50:02]: That was when GLM was released.Vibhu [00:50:04]: You can start to add GLM in here.Lukas [00:50:05]: That wasSwyx [00:50:06]: So ZAI doing well, right? Who else in the, in the open models space?Lukas [00:50:11]: Qwen, the latest Qwen 3.6 is doing pretty well. It'- that one is not open though. Like it's the plus model.Swyx [00:50:17]: Oh, okay.Lukas [00:50:18]: Is that one open? I don't think that oneVibhu [00:50:19]: Not the, not theSwyx [00:50:20]: The one recentlyVibhu [00:50:20]: There's MOESwyx [00:50:20]: But not the big plus. I think this is one of those like you only have one sample size of one, right? Or I feel like some of this is anecdotal,? And but like the fact that it happens at all and it happens repeatedly for Claude versus OpenAI and all this is like notable.Lukas [00:50:38]: Like the sample, depends on what you define as an N., like there's like million, hundreds of millions of tokens in each run, and now we've run like we run like probably 10 per model and then like it's been Claude 4.6 Opus, Sonnet 4.6, Mythos, and Opus 4.7. Like there's quite a lot of tokens in all of that And it happens a lot of times, a lot of times. And then you compare it to like OpenAI and Gemini, and it almost never happens. So I think that is quite-- that is significant. The old models from OpenAI, for example, had some problems with this, but I think it's like generally much better if the progression is that like the worrying stuff reduces over time rather than increases over time. And it seems like in the Claude models it goes in the wrong direction.Swyx [00:51:28]: Hmm.Lukas [00:51:29]: In the OpenAI models it goes in the right direction.Vibhu [00:51:32]: I think it depends on how well you can control it, right?, there's one side of it being susceptible to this okay, this is potentially something that happens during the RL stage, right? You can RL a model and how loose is it on these terms. If you can control it, that's good. But if you can't, if it's, if it's very jailbreakable, that's not ideal.Swyx [00:51:50]: To me, it's surprising that it happens for Claude and not the others.Vibhu [00:51:54]: I think okay, if it is from RL and how they do it, how their training data is, what their setup is, it makes sense that it just stays in how they're doing it, right? Compared to the other models likeSwyx [00:52:04]: There's a whole constitution and everything. It's kind of cool. Yeah, I obviously you don't know, I don't know. But, it ‘s I think it's just like fascinating to like that you are the first to find these like reliably because you push models so much to to such an extreme. Okay. The only other thing, I don't know if you can answer this, feel free to decline, is do you like-- would you ablate the system prompts? Like any part of this would-- if it changes, does it change the behavior, right?Lukas [00:52:29]: So we, I can't comment on Mythos. UhSwyx [00:52:33]: No, but just li

Windows Weekly (MP3)
WW 986: Liminal AI - RTX Spark, Project Solara, Scout, & Much More!

Windows Weekly (MP3)

Play Episode Listen Later Jun 3, 2026 152:00 Transcription Available


Build 2026 is underway in San Francisco this week, and it started with a big, overly-long keynote as always. And Computex is this week, too. There's a lot going on, and some of it is fascinating. Plus, WWDC is next week because you cannot relax. Also, Microsoft GA's WinApp CLI, announces the Windows Platform Skills plug-in for native app creation, and you're not going to believe what Paul did next. OK, you will believe itBuild + Computex = OHMYGODOHMYGODOHMYGOD NVIDIA finally announces Arm-based N1X as the RTX Spark RTX Spark is an Arm-based portable workstation chip for Windows 11 Microsoft announces Surface Laptop Ultra - It and other RTX Spark-based PCs will appear in late 2026 Some of this leaked earlier, including a lower-end N1 chipset Microsoft continues to optimize and evolve Windows 11 for developers Windows Developer Configuration, Windows Developer Skills + WinApp CLI, Terminal, more Linux, and more on-device ("unmetered") AI - Tied to this, Copilot+ PC features are coming to more PCs, with CPU/GPU support - this, plus the RTX Spark stuff hints at answers to some obvious questions but there's nothing concrete from Microsoft Microsoft Edge is getting three new on-AI features Scout is a personal work agent powered by OpenClaw GitHub Copilot app arrives on desktop for your agentic coding and management needs Microsoft AI announces seven new foundation models Stevie Bathiche is back, baby! And he's talking about those AI app structures and how they've led to Project Solara Windows Microsoft discusses the progress it's made on Windows 11 pain points You can now test the new Start menu in Experimental - Paul did so along with the new Taskbar Qualcomm announces low-cost Snapdragon C for $300+ PCs to take on MacBook Neo And Acer is the first to announce a Snapdragon C laptop New Surface Pro with Snapdragon X2 leaks for June release (!) Dell XPS 13 is coming soon with Intel Wildcat (also to take on MacBook Neo) Dell revenues are through the roof, but not because of PCs HP revenues are up, and it is because of PCs AI and dev Anthropic gets a new valuation exceeding OpenAI and then it files for an IPO OpenAI adjusts GPT5.5-Instant for less sucking-up and releases computer use in Codex on Windows Flutter takes the lead on Flutter desktop development XBOX and gaming Asha Sharma says you can't please everyone and then immediately jumps the shark trying to please everyone XBOX delays Fable reboot because of GTA VI New titles coming to Game Pass in early June across platforms XBOX starts early testing of new console features ASUS announces ROG Xbox Ally X20 with OLED display and XReal R1 glasses Intel announces Arc G-series for gaming handhelds Call of Duty Modern Warfare 4 is next and it's the COD we've been begging for Tips and picks Tip of the week: Now you can vibe code a native Windows app from the CLI App pick of the week: iA Writer RunAs Radio this week: Data API Builder and SQL MVP with Jerry Nixon Brown liquor pick of the week: Old Malt Casking of Longmorn 20 These show notes have been truncated due to length. For the full show notes, visit https://twit.tv/shows/windows-weekly/episodes/986 Hosts: Leo Laporte, Paul Thurrott, and Richard Campbell Sponsors: joindeleteme.com/twit promo code TWIT threatlocker.com/twit cachefly.com/twit

Marketing Against The Grain
Google Data Analyst Shares Her $300k/year Codex Workflow

Marketing Against The Grain

Play Episode Listen Later Jun 3, 2026 32:46


Free: Build an AI Data Analysis Agent in Codex https://clickhubspot.com/eqfk Ep. 429 Is AI-powered data analysis light years away from replacing humans?  Kipp and guest Sundas Khalid (data science and AI leader) dive into how agentic analytics is transforming data science, what human judgment still brings to the table, and how to master cutting-edge AI tools for your work.  Learn more on how to leverage Codex and other AI tools for real-world data analysis, the crucial mindset shifts and skills every data-driven professional needs, and why asking the right questions—and validating AI output—will set you apart in the age of agentic analytics. Mentions Sundas Khalid https://www.youtube.com/sundaskhalid Codex https://openai.com/codex/ Gemini https://gemini.google.com/ Claude https://claude.ai/ Get our guide to build your own Custom GPT: https://clickhubspot.com/customgpt Resource [Free] Steal our favorite AI Prompts featured on the show! Grab them here: https://clickhubspot.com/aip We're on Social Media! Follow us for everyday marketing wisdom straight to your feed YouTube: ​​https://www.youtube.com/channel/UCGtXqPiNV8YC0GMUzY-EUFg  Twitter: https://twitter.com/matgpod  TikTok: https://www.tiktok.com/@matgpod  Thank you for tuning into Marketing Against The Grain! Don't forget to hit subscribe and follow us on Apple Podcasts (so you never miss an episode)! https://podcasts.apple.com/us/podcast/marketing-against-the-grain/id1616700934   If you love this show, please leave us a 5-Star Review https://link.chtbl.com/h9_sjBKH and share your favorite episodes with friends. We really appreciate your support. Host Links: Kipp Bodnar, https://twitter.com/kippbodnar   Kieran Flanagan, https://twitter.com/searchbrat  ‘Marketing Against The Grain' is a HubSpot Original Podcast // Brought to you by Hubspot Media // Produced by Darren Clarke.

The AI Breakdown: Daily Artificial Intelligence News and Discussions

OpenAI and Microsoft both previewed the next phase of enterprise AI, with OpenAI pushing Codex beyond developers and Microsoft focusing on lower-cost, customizable frontier models. The bigger theme is that enterprise AI is shifting from experimentation to cost-effective scale. In the headlines: Trump's AI executive order, Anthropic expands Mythos access, and SK Hynix moves to double memory chip capacity.Sign up for AI Executive Catchup: https://aiexecutivecatchup.com/Brought to you by:KPMG – Research from KPMG and the University of Texas at Austin shows the highest-impact AI users treat AI like a reasoning partner — and those skills can be taught at scale. Learn more at ⁠⁠⁠⁠⁠⁠kpmg.com/us/Sophisticated⁠⁠⁠⁠⁠⁠Outsystems - Stop wondering how AI will change your business and start building the agents that will lead it - http://outsystems.com/Scrunch - The AI customer experience platform - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://scrunch.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Zenflow Work - Agents for knowledge work - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://zenflow.free/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Blitzy - Want to accelerate enterprise software development velocity by 5x? ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://blitzy.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠AssemblyAI - The best way to build Voice AI apps - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.assemblyai.com/brief⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Robots & Pencils - Cloud-native AI solutions that power results ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://robotsandpencils.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://pod.link/1680633614⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Our Newsletter is BACK: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://aidailybrief.beehiiv.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Interested in sponsoring the show? sponsors@aidailybrief.ai

All TWiT.tv Shows (MP3)
Windows Weekly 986: Liminal AI

All TWiT.tv Shows (MP3)

Play Episode Listen Later Jun 3, 2026 152:00 Transcription Available


Build 2026 is underway in San Francisco this week, and it started with a big, overly-long keynote as always. And Computex is this week, too. There's a lot going on, and some of it is fascinating. Plus, WWDC is next week because you cannot relax. Also, Microsoft GA's WinApp CLI, announces the Windows Platform Skills plug-in for native app creation, and you're not going to believe what Paul did next. OK, you will believe itBuild + Computex = OHMYGODOHMYGODOHMYGOD NVIDIA finally announces Arm-based N1X as the RTX Spark RTX Spark is an Arm-based portable workstation chip for Windows 11 Microsoft announces Surface Laptop Ultra - It and other RTX Spark-based PCs will appear in late 2026 Some of this leaked earlier, including a lower-end N1 chipset Microsoft continues to optimize and evolve Windows 11 for developers Windows Developer Configuration, Windows Developer Skills + WinApp CLI, Terminal, more Linux, and more on-device ("unmetered") AI - Tied to this, Copilot+ PC features are coming to more PCs, with CPU/GPU support - this, plus the RTX Spark stuff hints at answers to some obvious questions but there's nothing concrete from Microsoft Microsoft Edge is getting three new on-AI features Scout is a personal work agent powered by OpenClaw GitHub Copilot app arrives on desktop for your agentic coding and management needs Microsoft AI announces seven new foundation models Stevie Bathiche is back, baby! And he's talking about those AI app structures and how they've led to Project Solara Windows Microsoft discusses the progress it's made on Windows 11 pain points You can now test the new Start menu in Experimental - Paul did so along with the new Taskbar Qualcomm announces low-cost Snapdragon C for $300+ PCs to take on MacBook Neo And Acer is the first to announce a Snapdragon C laptop New Surface Pro with Snapdragon X2 leaks for June release (!) Dell XPS 13 is coming soon with Intel Wildcat (also to take on MacBook Neo) Dell revenues are through the roof, but not because of PCs HP revenues are up, and it is because of PCs AI and dev Anthropic gets a new valuation exceeding OpenAI and then it files for an IPO OpenAI adjusts GPT5.5-Instant for less sucking-up and releases computer use in Codex on Windows Flutter takes the lead on Flutter desktop development XBOX and gaming Asha Sharma says you can't please everyone and then immediately jumps the shark trying to please everyone XBOX delays Fable reboot because of GTA VI New titles coming to Game Pass in early June across platforms XBOX starts early testing of new console features ASUS announces ROG Xbox Ally X20 with OLED display and XReal R1 glasses Intel announces Arc G-series for gaming handhelds Call of Duty Modern Warfare 4 is next and it's the COD we've been begging for Tips and picks Tip of the week: Now you can vibe code a native Windows app from the CLI App pick of the week: iA Writer RunAs Radio this week: Data API Builder and SQL MVP with Jerry Nixon Brown liquor pick of the week: Old Malt Casking of Longmorn 20 These show notes have been truncated due to length. For the full show notes, visit https://twit.tv/shows/windows-weekly/episodes/986 Hosts: Leo Laporte, Paul Thurrott, and Richard Campbell Sponsors: joindeleteme.com/twit promo code TWIT threatlocker.com/twit cachefly.com/twit

AI For Humans
Martin Scorsese Is Now An AI Filmmaker.

AI For Humans

Play Episode Listen Later Jun 3, 2026 30:06


Martin Scorsese is now advising AI image and video company Black Forest Labs, and he is just one of a wave of major filmmakers embracing AI as a tool.  This week on AI For Humans, Martin Scorsese has become an AI filmmaker. Well, sort of. The legendary director is now advising Black Forest Labs, joining a growing list of major filmmakers discussing AI as a tool, even as the backlash rages on and cartoonists face death threats. We get into where AI filmmaking goes from here, why Jorge Gutierrez dropped his AI-generated series after backlash, and the genuinely incredible AI creators pushing the form forward right now.  Then: NVIDIA and Microsoft spent three years cooking up the RTX Spark to reinvent personal computing, and Gavin let Codex's goal tool run for 45 hours to make a bear get extra jumpy. It is AI For Humans! HOLLYWOOD IS GOING AI WHETHER IT LIKES IT OR NOT. WE THINK? Come to our Discord: https://discord.gg/muD2TYgC8f Join our Patreon: https://www.patreon.com/AIForHumansShow AI For Humans Newsletter: https://aiforhumans.beehiiv.com/ Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow To book us for speaking, please visit our website: https://www.aiforhumans.show/   SHOW LINKS Martin Scorsese joins Black Forest Labs as an advisor: https://bfl.ai/martin-scorsese-bfl-advisor Jorge Gutierrez drops out of Amazon MGM AI-generated series after backlash: https://variety.com/2026/tv/news/jorge-gutierrez-drops-out-amazon-mgm-ai-generated-series-backlash-1236762285/ Gossip Goblin's 'Toe Brigade' short: https://youtube.com/shorts/hB8_vGQ4bhs Furufuru & The Gorilla (the Japanese creator putting himself into movies): https://x.com/ai_am_furufuru/status/2061793575000744029 Kavan The Kid's latest, Chronicle of Bone: https://youtu.be/y7gIoHq-YDo Sopranos AI meme: https://x.com/memechaotic/status/2061170850959647215 jboogxcreative live stream on how to make AI movies (hugely educational): https://www.youtube.com/live/fCmCqdUjo-Q NVIDIA's Nemotron 3, the best American open-source AI: https://x.com/ArtificialAnlys/status/2061304911565144230 New MiniMax M3: https://x.com/MiniMax_AI/status/2061266317815296322 PewDiePie's open-source agentic harness: https://youtu.be/rAzT5lcezPs NVIDIA RTX Spark claims to reinvent the PC: https://www.ign.com/articles/nvidia-announces-the-rtx-spark-claims-to-reinvent-the-pc Gavin's Codex /goal experiment: https://x.com/gavinpurcell/status/2061639229709652403 Gavin's bear-jump experiment published on Vercel: https://bear-jump-port.vercel.app/  

Radio Leo (Audio)
Windows Weekly 986: Liminal AI

Radio Leo (Audio)

Play Episode Listen Later Jun 3, 2026 152:00 Transcription Available


Build 2026 is underway in San Francisco this week, and it started with a big, overly-long keynote as always. And Computex is this week, too. There's a lot going on, and some of it is fascinating. Plus, WWDC is next week because you cannot relax. Also, Microsoft GA's WinApp CLI, announces the Windows Platform Skills plug-in for native app creation, and you're not going to believe what Paul did next. OK, you will believe itBuild + Computex = OHMYGODOHMYGODOHMYGOD NVIDIA finally announces Arm-based N1X as the RTX Spark RTX Spark is an Arm-based portable workstation chip for Windows 11 Microsoft announces Surface Laptop Ultra - It and other RTX Spark-based PCs will appear in late 2026 Some of this leaked earlier, including a lower-end N1 chipset Microsoft continues to optimize and evolve Windows 11 for developers Windows Developer Configuration, Windows Developer Skills + WinApp CLI, Terminal, more Linux, and more on-device ("unmetered") AI - Tied to this, Copilot+ PC features are coming to more PCs, with CPU/GPU support - this, plus the RTX Spark stuff hints at answers to some obvious questions but there's nothing concrete from Microsoft Microsoft Edge is getting three new on-AI features Scout is a personal work agent powered by OpenClaw GitHub Copilot app arrives on desktop for your agentic coding and management needs Microsoft AI announces seven new foundation models Stevie Bathiche is back, baby! And he's talking about those AI app structures and how they've led to Project Solara Windows Microsoft discusses the progress it's made on Windows 11 pain points You can now test the new Start menu in Experimental - Paul did so along with the new Taskbar Qualcomm announces low-cost Snapdragon C for $300+ PCs to take on MacBook Neo And Acer is the first to announce a Snapdragon C laptop New Surface Pro with Snapdragon X2 leaks for June release (!) Dell XPS 13 is coming soon with Intel Wildcat (also to take on MacBook Neo) Dell revenues are through the roof, but not because of PCs HP revenues are up, and it is because of PCs AI and dev Anthropic gets a new valuation exceeding OpenAI and then it files for an IPO OpenAI adjusts GPT5.5-Instant for less sucking-up and releases computer use in Codex on Windows Flutter takes the lead on Flutter desktop development XBOX and gaming Asha Sharma says you can't please everyone and then immediately jumps the shark trying to please everyone XBOX delays Fable reboot because of GTA VI New titles coming to Game Pass in early June across platforms XBOX starts early testing of new console features ASUS announces ROG Xbox Ally X20 with OLED display and XReal R1 glasses Intel announces Arc G-series for gaming handhelds Call of Duty Modern Warfare 4 is next and it's the COD we've been begging for Tips and picks Tip of the week: Now you can vibe code a native Windows app from the CLI App pick of the week: iA Writer RunAs Radio this week: Data API Builder and SQL MVP with Jerry Nixon Brown liquor pick of the week: Old Malt Casking of Longmorn 20 These show notes have been truncated due to length. For the full show notes, visit https://twit.tv/shows/windows-weekly/episodes/986 Hosts: Leo Laporte, Paul Thurrott, and Richard Campbell Sponsors: joindeleteme.com/twit promo code TWIT threatlocker.com/twit cachefly.com/twit

Engines of Our Ingenuity
The Engines of Our Ingenuity 3378: Hrotsvitha

Engines of Our Ingenuity

Play Episode Listen Later Jun 2, 2026 3:45


Episode: 3378 Tenth century author, Hrotsvitha, brought back to life in the sixteenth century.  Today, meet Hrotsvitha.

Mayim Bialik's Breakdown
Part Two: You Chose This Life Before You Were Born — Robert Edward Grant on Sacred Geometry, Da Vinci's Hidden Code, Ancient Mathematics & The Simulation of Reality

Mayim Bialik's Breakdown

Play Episode Listen Later May 27, 2026 45:42


What if EVERYTHING you've been taught about science, consciousness, and even your own thoughts…is incomplete? In this episode of Mayim Bialik's Breakdown, Robert Edward Grant (renowned polymath, inventor, entrepreneur, mathematician, philosopher, host of the series Code X on Gaia.com) pulls back the veil on reality itself, revealing why millions are feeling an intense shift right now as humanity crosses into the Age of Aquarius. This isn't just spiritual talk - it's a radical fusion of math, physics, ancient wisdom, and consciousness that will leave you questioning everything. Why are so many people experiencing massive life transitions right now? Is the universe actually NOT material? Are your thoughts even happening inside your brain, or somewhere else entirely? We go deep into the hidden patterns that connect numerology, astrology, mythology, and sacred geometry, uncovering why music is literally “the geometry we hear” and how math might be the source code of reality itself. Robert shares his shocking personal journey, from Big Pharma CEO to spiritual seeker, and how repeated betrayal led him to one profound realization: You are here to learn unconditional love. Discover why what you judge is exactly what you attract, why he believes everyone must go through narcissism as part of their evolution, and whether ancient civilizations like Egypt, and even Leonardo da Vinci, have known secrets about higher-dimensional geometry that we're only now rediscovering. Robert breaks down: - What if the brain isn't a storage device, but an antenna tuning into a non-local field of consciousness? - Are there hidden codes embedded in da Vinci's art? - What is the Akashic field, and could all memory (past, present, and future) exist in an invisible infrasonic frequency field connecting Earth, the sun, and human thought? - If reality is a simulation, what happens when you become lucid inside it? - Why science and spirituality are not opposites, but the same language - How all disciplines (math, biology, psychology, physics, philosophy) are just different lenses of one truth - Deeper meaning behind the most popular song the week you were born - Why prime factorization is the foundation of encryption, and possibly reality itself - His belief that God is still learning and evolving - Why he doesn't fear “dark people”, only those who deny their darkness - How much of your life is actually predestined - Why polymaths appear on the walls of the Vatican - Mystery behind his favorite number, 137 His ultimate message? You don't need a guru. You don't need AI. You don't need religion. Everything you're searching for is already within you. If you're ready to rethink reality, consciousness, and your place in the universe, this is the conversation you've been waiting for. Robert Edward Grant's Code X series on Gaia: ⁠https://robertedwardgrant.com/code-x/⁠ The Architect AI by Robert Edward Grant is also available on Gaia: ⁠https://www.gaia.com/video/architect-a-companion-tool-for-expansion⁠ Gaia's Ancient Civilizations Conference: ⁠https://marketplace.gaia.com/products/ancient-civilizations-conference-2026?srsltid=AfmBOop1lbk9d7u5RoGKruBnuMV3OMnP6pZahL1AXhkIVVCKtq2Sp55L⁠ Follow us on Substack for Exclusive Bonus Content: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://bialikbreakdown.substack.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠BialikBreakdown.com⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠YouTube.com/mayimbialik⁠⁠⁠ Learn more about your ad choices. Visit megaphone.fm/adchoicesSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.