Podcasts about gpu

  • 2,147PODCASTS
  • 5,677EPISODES
  • 53mAVG DURATION
  • 2DAILY NEW EPISODES
  • Jun 3, 2026LATEST

POPULARITY

20192020202120222023202420252026

Categories



Best podcasts about gpu

Show all podcasts related to gpu

Latest podcast episodes about gpu

The Cloudcast
Cerebras is disrupting the market with Fast Inference

The Cloudcast

Play Episode Listen Later Jun 3, 2026 35:21


SUMMARY: After the first successful AI IPO of 2026, we dig into what makes the Cerebras WSE architecture unique in the market for fast inference. GUEST: Andy Hock, at Chief Strategy Officer at Cerebras AISHOW: 1033SHOW TRANSCRIPT: The Enterprise AI Show #1033 TranscriptSHOW VIDEO: https://youtu.be/ed2nVbOtZiASHOW SPONSORS:OutShift - “Scaling Out Superintelligence”  The Internet of Cognition architectureShareGate - ShareGate Protect. Microsoft 365 Governance, we got this!Nasuni - Activate your data for AI and request a demoSHOW NOTES:OpenAI announces 750MW partnership with CerebrasCerebras and AWS partnershipCerebras announces IPOTopic 1 - Welcome to the show. Tell us about your background, and what you focus on today. Topic 2 - For anyone that's not familiar with Cerebras, give us an overview of the company, and especially an overview on the Cerebras technologies (e.g. Wafer-Scale Engine).Topic 3 - Cerebras' WSE architecture is different from many of the GPU or GPU-like architectures in the market today. Centralized vs. distributed architectures always have their tradeoffs. Walk us through the technical and economic value of the Cerebras architecture.Topic 4 - Congratulations on the recent IPO (raised $5.55B). Let's use that as a point in time vs the previous planned IPO. How has the market changed in that timeframe, and how has the Cerebras position changed? Topic 5 - Cerebras (today) offer both WSE hardware, and Cerebras Cloud (API) - very different GTM paths. Can we expect both of those to stay top priorities, or have the market dynamics shifted such that the priorities shift more towards the WSE business - as we're seeing OpenAI, AWS and other engagements announced?Topic 6 - Is Cerebras a training and inference company, or are the economics of inference significantly different enough that it needs to be the sole focus of the company (for now)? Topic 7 - How much effort is it for any company to add support for the Cerebras chips if they have previously been using other architectures?Topic 8 - An IPO is a major milestone for any company, but the markets will now look for your future story. How do you see the AI market evolving over the next 2-5 years, and what are some things that people aren't understanding yet about how it will evolve?FEEDBACK?Email: show @ the enterprise ai show dot comeBluesky: @TheEntAIShow.bsky.socialTwitter/X: @TheEntAIShowInstagram: @TheEntAIShow

Marketplace Tech
Wall Street sets its sights on an AI futures market

Marketplace Tech

Play Episode Listen Later Jun 2, 2026 8:52


There is growing demand for time with GPUs, the chips that power artificial intelligence. AI companies need those chips in order to keep their models up and running. And to do that, they can reserve time with a GPU. Now, there's interest from Wall Street in creating a futures market for this AI compute time, essentially treating it like a commodity. Marketplace's Stephanie Hughes spoke with Liz Hoffman, business and finance editor at Semafor and host of the “Compound Interest” podcast, who recently wrote about this.

Marketplace All-in-One
Wall Street sets its sights on an AI futures market

Marketplace All-in-One

Play Episode Listen Later Jun 2, 2026 8:52


There is growing demand for time with GPUs, the chips that power artificial intelligence. AI companies need those chips in order to keep their models up and running. And to do that, they can reserve time with a GPU. Now, there's interest from Wall Street in creating a futures market for this AI compute time, essentially treating it like a commodity. Marketplace's Stephanie Hughes spoke with Liz Hoffman, business and finance editor at Semafor and host of the “Compound Interest” podcast, who recently wrote about this.

The Interchange
The grid's missing operating system: Why a $100,000 AI controller could defer trillions in hardware and why utilities won't buy it

The Interchange

Play Episode Listen Later Jun 2, 2026 43:46


The energy transition conversation focuses on what connects to the grid. Far less attention goes to whether anyone is coordinating what those assets do once connected. AI training runs swing hundreds of megawatts in seconds as GPUs checkpoint and restart a profile that looks like a generator tripping offline. At distribution level, millions of inverter-based resources create localised variability that overwhelms individual circuits even when aggregate models look healthy. The planning tools in use today were designed for neither problem.Host Bridget van Dorsten is joined by Kay Aikin, CEO and Founder of Dynamic Grid, energy engineer, grid architecture advisor to the DOE-supported GridWise Architecture Council, and contributor to the UN Environmental Program's building decarbonisation work. Kay unpacks what an AI training facility actually does to the grid with full GPU load for hours or days, then a drop to ten percent in seconds during checkpointing. She talks about how at the scale now planned, the Stargate project in Texas alone could represent ten percent of ERCOT disappearing in four seconds. The behaviour is stochastic and cannot be modelled with traditional statistical tools. At distribution level, virtual power plants responding to wholesale signals without circuit-level visibility can create competing oscillations, the kind of emergent dynamics that contributed to the Spanish grid failure.The proposed fix is an AI controller at the substation, sending price-based signals and flexible operating envelopes to large assets and VPP operators, giving them twenty-four-hour forecasts and real-time circuit visibility. Total cost: under a hundred thousand dollars installed. The reason it isn't everywhere is cost-of-service regulation. Utilities earn returns on deployed capital, so a million-dollar transformer replacement is more profitable than software that eliminates the need for it.Without new approaches, rebuilding the US distribution grid could cost up to ten trillion dollars by 2040. Kay is developing grid utilisation metrics with regulators in Maine, Virginia, and Maryland to incentivise extracting more from existing infrastructure. The episode closes on the need for distribution system operators and the affordability death spiral that looms if the structural incentives don't shift. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The Look Back with Host Keith Newman
Becoming the VMware of AI Infrastructure: Lukas on Building the Operating System for GPU Clouds

The Look Back with Host Keith Newman

Play Episode Listen Later Jun 2, 2026 10:05


AI infrastructure is breaking the old data center model.In this episode of Liftoff with Keith, I sit down with Lukas Gentele, CEO & Co-Founder of vCluster Labs, to unpack what it really takes to operate GPU infrastructure at scale in 2026.As AI workloads explode and neoclouds race to meet demand, Lukas and his team are building the operational backbone for modern AI clouds — from managed Kubernetes and tenant isolation to automated node provisioning and GPU lifecycle management.We discuss:Why traditional data center assumptions are collapsing under AI pressureWhat's fundamentally changed since the VMware eraHow an early partnership with CoreWeave shaped vCluster's trajectoryAnd the one mistake AI cloud operators are making right now that could hurt them over the next 18 monthsIf you care about AI infrastructure, GPU economics, hyperscaler strategy, or building category-defining platforms — this conversation is essential.Sponsor Info: We are strategic business advisors with decades of leadership experience and a proven track record of driving businesses' growth. We specialize in creating custom-tailored strategies to introduce your company, drive growth, build leadership teams, and ensure companies implement appropriate compensation programs. Our mission is to utilize our expansive network to benefit your company https://www.compass-strategic-advisors.com/ Connect with Lukas Gentele: Website: https://www.vcluster.com/ LinkedIn: https://www.linkedin.com/in/gentele/ Subscribe for more founder insights and hit the bell for notifications! Follow us on our channels for exclusive startup content and behind-the-scenes insights from interviews like this one. Spotify: https://open.spotify.com/show/3cFpLXfYvcUsxvsT9MwyAD?si=f5a14e779777487d Apple Podcasts: https://podcasts.apple.com/ca/podcast/liftoff-with-keith-newman/id1560219589 Substack: https://keithnewman.substack.com/ Newman Media Studios: https://newmanmediastudios.com/ LinkedIn: https://www.linkedin.com/company/liftoffwithkeith For sponsorship inquiries, please contact: sponsorships@wherewithstudio.comFrom the Host: A special shout-out to our Great Host of the Ignite Studios: https://www.ignitegtm.com/ and Producers of AI Infra5 @Plug and Play World, HQ in Sunnyvale, CALiftoff is sponsored by a strategic consulting firm and the M&A specialists at Compass Strategic Advisors - https://www.compass-strategic-advisors.com/ and The GTM Firm - https://www.thegtmfirm.com/

That Tech Pod
What Happens When Critical Infrastructure Fails? with Robert "Max" Maxfield

That Tech Pod

Play Episode Listen Later Jun 2, 2026 29:35


What does it take to modernize the systems that keep water flowing, wastewater moving, and nine million New Yorkers served every day?In this episode, we sit down with Robert "Max" Maxfield, Chief Systems Architect at AITHERAS and the architect behind New York City's SCADA modernization efforts for the Bureau of Wastewater Treatment. Max takes us inside the world of critical infrastructure, where downtime isn't an inconvenience, it's a public risk. From managing decades-old industrial systems and balancing modernization against reliability, to defending essential services against cyber threats, Max shares what it really takes to operate technology that most people never think about until it fails.We also explore the realities of AI in critical infrastructure, the cybersecurity challenges facing utilities, the surprising longevity of legacy systems, and how Max's passion for motorcycles, racing, and building machines shapes his approach to engineering. It's a conversation about technology, risk, resilience, and why sometimes the most important systems are the ones nobody notices.Robert “Max” Maxfield is the Chief Systems Architect at AITHERAS, leading the SCADA Modernization Program for NYC's Bureau of Wastewater Treatment. In this role, Max designs and deploys the systems that keep critical water infrastructure operating for nine million New Yorkers. With 20+ years in industrial controls, 27 platform certifications, and prior architect roles on national operations centers and the Doyon Utilities Alaska modernization, Max specializes in the messy intersection of legacy industrial systems, modern SCADA, cybersecurity, and, increasingly, AI. He's been published in Forbes on industrial technology, runs his own GPU lab for local model fine-tuning, and spends his off-hours on custom motorcycles, off-road racing, and drag racing. Equal parts engineer, builder, and pragmatist, Max brings a field-tested perspective on what actually works when the stakes are critical infrastructure.

ARC ENERGY IDEAS
NVIDIA's Marc Spieler: AI, Data Centres, and Energy

ARC ENERGY IDEAS

Play Episode Listen Later Jun 2, 2026 47:57


The podcast opens with updates on the closure of the Strait of Hormuz, a German state-owned energy company contracting for Canadian West Coast LNG, and the Pope's theological document warning about AI. Next, Peter and Jackie introduce this week's guest, Marc Spieler, Senior Managing Director for the Global Energy Industry at NVIDIA, joining from Houston, Texas, to discuss the latest developments at the intersection of AI and energy. Energy and AI are deeply interlinked. Energy companies are using AI to improve efficiency across oil and gas, renewables, and emerging sources such as next-generation fission and fusion. At the same time, AI's explosive growth is driving significant new electricity demand, requiring a build-out of both generation and grid infrastructure. Predicting future power demand from AI remains uncertain; it depends on the pace of adoption and whether GPUs, along with other delivery components of the digital infrastructure stack, will become more efficient over time. Marc highlights that data centres are becoming more flexible, with the ability to reduce consumption during periods of grid stress. This would allow new data centre capacity to be added without straining the grid, while also lowering costs for all power consumers by improving system utilization during off-peak periods. Content referenced in this podcast: NVIDIA Blog with examples of energy company AI applications: Efficiency at Scale: NVIDIA, Energy Leaders Accelerating Power‑Flexible AI Factories to Fortify the Grid (March 2026) NVIDIA's NeMo Framework was used for asset integrity and reliability at Petrobras (March 2025) NVIDIA's Earth-2 library of open models, libraries, and frameworks that democratize global access to professional-grade weather and climate AI NVIDIA Vera Rubin DSX AI Factory reference design to maximize efficiency (March 2026) NVIDIA and Emerald AI, along with other energy companies, pioneer flexible AI factories (March 2026)  Pope Leo XIV, Magnifica Humanitas: On Safeguarding the Human Person in the Time of Artificial Intelligence (May 25, 2026) Please review our disclaimer at: https://www.arcenergyinstitute.com/disclaimer/ Check us out on social media: X (Twitter): @arcenergyinstLinkedIn: @ARC Energy Research Institute Subscribe to ARC Energy Ideas PodcastApple PodcastsAmazon MusicSpotify

Hacker News Recap
May 31st, 2026 | Cloudflare Turnstile requiring fingerprintable WebGL

Hacker News Recap

Play Episode Listen Later Jun 1, 2026 15:03


This is a recap of the top 10 posts on Hacker News on May 31, 2026. This podcast was generated by wondercraft.ai (00:30): Cloudflare Turnstile requiring fingerprintable WebGLOriginal post: https://news.ycombinator.com/item?id=48345840&utm_source=wondercraft_ai(01:55): Creatine raises brain energy levels and slows cognitive decline: studyOriginal post: https://news.ycombinator.com/item?id=48346947&utm_source=wondercraft_ai(03:21): Please Do Not Vibe Fuck Up This SoftwareOriginal post: https://news.ycombinator.com/item?id=48342705&utm_source=wondercraft_ai(04:47): The Website SpecificationOriginal post: https://news.ycombinator.com/item?id=48343683&utm_source=wondercraft_ai(06:13): Codex just found a "workaround" of not having sudo on my PCOriginal post: https://news.ycombinator.com/item?id=48348578&utm_source=wondercraft_ai(07:39): Dav2dOriginal post: https://news.ycombinator.com/item?id=48344961&utm_source=wondercraft_ai(09:04): The solution might be cancelling my AI subscriptionOriginal post: https://news.ycombinator.com/item?id=48345896&utm_source=wondercraft_ai(10:30): 1-Bit Bonsai Image 4B Image Generation for Local DevicesOriginal post: https://news.ycombinator.com/item?id=48346257&utm_source=wondercraft_ai(11:56): United Airlines 767 returns to Newark after Bluetooth name sparks alertOriginal post: https://news.ycombinator.com/item?id=48345248&utm_source=wondercraft_ai(13:22): I put a datacenter GPU in my gaming PCOriginal post: https://news.ycombinator.com/item?id=48345694&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

GreyBeards on Storage
175: GreyBeards talk Accelerated Object with SNIA TWG CoChairs, Jason Goldschmidt, DELL Distinguished Eng. & Nick Connolly, ARM Principal Eng.

GreyBeards on Storage

Play Episode Listen Later Jun 1, 2026 40:48


Jason Goldschmidt and Nick Connolly, co-chairs of SNIA's Accelerated Object TWG, discussed the importance of S3 over RDMA for AI processing. SNIAs work addresses industries need for faster data transfer to improve GPU utilization during model training and inferencing.

The Neil Ashton Podcast
S4 EP1 - Are AI Agents and Foundation Models About to Rewrite CAE?

The Neil Ashton Podcast

Play Episode Listen Later Jun 1, 2026 28:18


In this episode, Neil explores how agents, foundation models, and AI are set to transform the Computer-Aided Engineering (CAE) and Electronic Design Automation (EDA) landscapes. He shares a comprehensive historical perspective and predicts a near-future where AI-driven automation redefines engineering workflows, productivity, and innovation.Main Topics:The evolution of simulation codes from the 1960s to modern commercial softwareThe rise of cloud computing, GPUs, and their impact on CAE and EDA industriesThe integration of AI, surrogate modeling, and foundation models into simulation workflowsThe emergence of agentic AI systems capable of autonomously performing complex engineering tasksThe strategic responses of major software companies to AI and agent technologiesThe potential democratization and automation of engineering design through AI agentsCritical questions on model ownership, transparency, and industry adoptionTimestamps: 00:40 - Introduction: How agents and foundation models will disrupt CAE & EDA01:40 - Historical overview: From code writing in the 60s to commercial software03:10 - Growth of aerospace and automotive industry codes and commercialization04:40 - The impact of HPC, cloud computing, and hardware evolution06:25 - Rise of cloud SaaS models and "sassification" of simulation tools07:40 - Big tech entrance: AWS, Microsoft, and Google in CAE & EDA09:00 - GPU acceleration: Changed landscape in past three to four years09:10 - The role of AI startups offering surrogate models and real-time simulation10:40 - Industry consolidation: Mergers and acquisitions among software giants11:40 - The emergence of foundation models and surrogate systems in simulation13:00 - The significance of agents: Combining AI, models, and automation14:10 - Capabilities of autonomous AI agents in complex engineering workflows15:25 - Practical use cases: Running simulations, setting up experiments, and data analysis16:40 - How agent-driven automation could democratize engineering expertise16:10 - Questions about model ownership, open source codes, and licensing19:40 - The future of AI in engineering: Collaboration, transparency, and scientific rigor21:25 - Final thoughts: Opportunities, challenges, and the transformative potential of AI* Please note that this a personal opinion and not that of NVIDIA

The Six Five with Patrick Moorhead and Daniel Newman
IBM's $15B Day, Claude Opus 4.8, & Biggest Earnings Night of Spring 2026 | Ep. 306

The Six Five with Patrick Moorhead and Daniel Newman

Play Episode Listen Later Jun 1, 2026 58:04


Patrick Moorhead and Daniel Newman cover Daniel's acquisition of Enterprise Technology Research, IBM's historic $15 billion single-day commitment spanning quantum and open-source security, Anthropic's Claude Opus 4.8, and the heaviest single earnings night of the season featuring Dell, Marvell, Salesforce, Synopsys, Snowflake, HP, and Micron crossing $1 trillion in market cap. The handpicked topics for this week are: Anthropic Releases Claude Opus 4.8: Six Weeks After 4.7 Anthropic dropped Opus 4.8 just six weeks after 4.7, claiming it surpasses GPT-5.5 and Gemini 3.1 Pro on agentic coding, knowledge work, and computer use. Benchmark improvements across the board: agentic coding up from 64.3% to 69.2%, knowledge work from 1753 to 1890, agentic computer use from 82.8% to 83.4%. Three new features ship alongside it: Dynamic Workflows for multi-subagent orchestration inside Claude Code, Effort Control for managing token spend, and mid-task system messages via the API. Fast mode is now 2.5x faster and 3x cheaper. Pat's honest take: what it says on paper is good, particularly on tool triggering and citation precision, but he has lost significant trust in the company and is watching closely. (The Decode)   IBM Commits $10 Billion to Quantum: The Largest Single Quantum Bet in History IBM announced a $10 billion commitment over five years targeting a large-scale fault-tolerant quantum computer by 2029, landing the same day as the $5 billion Project Lightwell announcement for a single-day IBM strategic commitment of $15 billion. Pat has been calling 2029 to 2031 as the realistic commercial quantum window and calls this the strongest single corporate financial signal yet that the timeline is real. Daniel's framing: IBM wants to be the NVIDIA of quantum, and with a $10 billion commitment, it's sending a flare to the entire industry that pure-play quantum companies cannot compete at this balance sheet level. (The Decode)   IBM and Red Hat Launch Project Lightwell: $5B to Secure Open-Source Software IBM and Red Hat committed $5 billion and a global force of 20,000 engineers to secure open-source software for enterprises through frontier agentic AI, anchored by 11 of the largest US and Canadian banks including Bank of America, Goldman Sachs, JPMorgan Chase, Mastercard, and Visa. Pat's read: this is the productization answer to Anthropic Mythos. Mythos found the vulnerabilities. Lightwell is the industrial-scale patching and validation layer enterprises can actually buy on a subscription. Daniel adds that IBM is flexing its engineering talent base as a premium strategic asset, a direct counter to the narrative that AI replaces engineers. (The Decode)   Anthropic Project Glasswing: 23,000 Vulnerabilities Found Across 1,000 OSS Projects Anthropic's Claude Mythos scanned more than 1,000 widely deployed open-source projects and surfaced approximately 23,000 candidate vulnerabilities, with 1,094 confirmed as critical severity. The Cyber Verification Program now gates the strongest cyber-capable Claude variant behind vetted defenders only. While the tool creates real value, the surface of attack will likely grow as fast as any tool built to defend it. (The Decode)   Anthropic in Talks to Run Claude on Microsoft Maia 200 CNBC and The Information reported Microsoft is in active negotiations to supply Anthropic with its custom Maia 200 inference chip, which would make Anthropic the only frontier lab simultaneously running production workloads on four distinct silicon stacks: NVIDIA, AWS Trainium, Google TPU, and Microsoft Maia. Pat's context: Maia 200 delivers 30% better tokens per dollar than the latest Azure fleet per Satya Nadella, and this deal would be Maia's first major external deployment. Daniel's read: what can be built will be sold right now, and Anthropic chasing every available compute source is simply the structural reality of growing at 80x when you planned for 10x. (The Decode)   The Flip: Is AI CapEx Too Expensive to Earn Its Return? Pat takes the affirmative. With $725 billion in hyperscaler CapEx tracking for 2026, likely $1 trillion next year, memory has become the choke point making it even more expensive, and open-source models have closed enough of the quality gap for most enterprise tasks that the premium of frontier APIs is increasingly hard to justify. A recent Signal65 white paper shows on-prem payback at 18 months. Daniel's counter: Dell just booked $24 billion in AI orders in a single quarter. Agentforce crossed $1 billion ARR at 169% growth. NVIDIA guided to $91 billion. Only 20% of enterprises are using AI and only 2% of consumers. Both hosts admitted off the flip their notes looked nearly identical. (The Flip)   Micron Crosses $1 Trillion Market Cap Micron became the 12th US company ever to cross $1 trillion in market cap, surging 19% on May 26th as UBS raised its price target to $1,625, implying a $1.8 trillion market cap. Samsung's Q1 memory ASP jumped 146% year over year. DRAM spot prices spiked 55 to 60% quarter over quarter. Daniel has been pounding this call since sub-$100 and calls it a cycle elongated beyond anything seen in the 27 prior memory cycles, driven by HBM capacity reallocation away from consumer DRAM creating structural shortage. (Bulls and Bears)   Dell Technologies Q1 FY27: The Biggest Enterprise AI Infrastructure Print of 2026 Record $43.8 billion revenue, up 88% year over year, crushing the $35.7 billion consensus by $8 billion. AI-optimized servers at $16.1 billion, up 757% year over year. $24.4 billion in AI orders booked in a single quarter. FY27 AI server revenue guide raised from $50 billion to $60 billion. Non-GAAP EPS of $4.86 beat the $2.96 consensus by 64%. Stock up 18% after hours. Pat's framing: Dell was very clear about what they were going to do. Rack engineering, sales, and service. The basics. And they executed the basics at an extraordinary level while building a special relationship with NVIDIA who views Dell as a market maker for both enterprise and NeoCloud. Daniel's add: play nice and win. Michael Dell navigated the political landscape brilliantly and pulled the entire Dell brand along with him. (Bulls and Bears)   Marvell Technology Q1 FY27: Record Revenue, Data Center at 76% of Mix Record $2.418 billion revenue, up 28% year over year. Data center at $1.833 billion, up 27% year over year, now 76% of total revenue. Q2 guide of $2.7 billion at midpoint accelerates growth to 35% year over year. Operating cash flow a record $638.8 million. Daniel went on TV and said it's "written in the stars," arguing the market had misunderstood this one for too long by conflating its custom AI ASIC story with the full breadth of its connectivity and networking portfolio. Pat's closing: the shorts are eating it now and the custom AI ASIC versus merchant GPU debate is finally settling into the right answer, which is both in lockstep. (Bulls and Bears)   Salesforce Q1 FY27: Agentforce Crosses $1 Billion ARR Revenue $11.13 billion, up 13% year over year. Non-GAAP EPS of $3.88 crushed the $3.12 consensus by 24%. Agentforce ARR crossed $1 billion, up 169% year over year, with 28.6 trillion tokens processed, up 152% quarter over quarter. 50% of Agentforce bookings came from existing customers expanding. Daniel flagged the $25 billion accelerated buyback funded by new debt as an interesting signal worth watching. Pat's bottom line: it's not perfect, but certainly no "SaaSpocalypse" in those numbers. (Bulls and Bears)   Synopsys Q2 FY26: First Full Quarter With Ansys Integrated Revenue $2.276 billion, up 42% year over year, beating consensus. Non-GAAP EPS of $3.35 beat $3.15. FY26 guide raised to $9.665 billion midpoint. Daniel's framing: every chip runs through Synopsys tools, and the Ansys addition makes it the full-stack co-design platform Jensen Huang keeps talking about. Synopsys is not just the pick and shovel of current AI silicon. It is the pick and shovel of quantum, robotics, and space as well. (Bulls and Bears)   Snowflake Q1 FY27: Strongest Sequential Dollar Growth in Company History Product revenue $1.33 billion, up 34% year over year, the strongest sequential dollar growth in Snowflake history. Net revenue retention 126%. FY27 product revenue guide raised to $5.84 billion. Natoma acquisition announced for secure agentic enterprise connectivity. New $6 billion multi-year AWS commitment. Daniel's closing: proprietary unique data is the real moat of the agentic era, and that data has to live somewhere. It is going to go to platforms like Snowflake. (Bulls and Bears)   HP Inc. Q2 FY26: Eight Straight Quarters of Growth With AI PCs at 44% of Shipments Revenue $14.4 billion, up 9% year over year, the company marks its eighth consecutive quarter of top-line growth. Non-GAAP EPS of $0.86 beat the prior guide. Personal Systems at $10.2 billion, up 13%, with 30% operating profit growth. AI PCs jumped from 35% to 44% of shipments quarter over quarter, with HP guiding to 60 to 70% next fiscal year. FY26 EPS guide raised. Pat's note: they still need a permanent CEO, which would help investors sleep better at night. Daniel's add: the real explosive moment for device companies comes when AI moves to the edge and enterprises shift from expensive frontier model consumption to on-device inference. (Bulls and Bears)   Everpure Q1 FY27: Record Revenue, Rebrand Complete Record revenue of $1.1 billion, up 35% year over year. Product revenue $577 million, up 55%. Subscription ARR at $2 billion. FY27 guide raised to $4.41 to $4.51 billion. Pure Storage officially completed its rebrand to Everpure. Daniel's emerging thesis: the agentic era has focused enormous attention on memory and compute, but after the inference runs, the data has to sit somewhere. Storage has not seen its full inflection yet and Everpure is well positioned when that wave arrives. (Bulls and Bears)   The Decode Anthropic Releases Claude Opus 4.8 May 28  https://techcrunch.com/2026/05/28/anthropic-releases-opus-4-8-with-new-dynamic-workflow-tool/ IBM Commits $10B Over Five Years to Quantum Computing the Same Day as $5B Project Lightwell, Bringing IBM's One-Day AI https://www.barrons.com/articles/ibm-stock-quantum-computing-aafbb1eb IBM + Red Hat Announce Project Lightwell  https://newsroom.ibm.com/2026-05-28-ibm-and-red-hat-commit-5-billion-to-redefine-the-future-of-open-source-in-the-ai-era Anthropic Project Glasswing / Claude Mythos Finds 23,000 Potential Vulnerabilities Across 1,000+ Open-Source Projects https://www.securityweek.com/anthropic-mythos-detected-23000-potential-vulnerabilities-across-1000-oss-projects/ Anthropic Negotiating to Run Claude on Microsoft's Maia 200 AI Chips  https://www.cnbc.com/2026/05/21/anthropic-microsoft-maia-200-ai-chip.html OpenAI + Anthropic Walk Back the AI Jobs Apocalypse Ahead of IPOs https://finance.yahoo.com/sectors/technology/articles/ai-chiefs-walk-back-job-193605798.html https://x.com/RiskCentre/status/2059397756016611668 The Flip Is AI Capex Becoming Too Expensive to Earn Its Return — and Will the Result Be a Forced Shift to Open-Source and Smaller Use-Case-Specific Models, or a Continued $725B+ Hyperscaler Buildout That Vindicates the Capex on Productivity Gains? FOR:  The shift is to open-source + smaller use-case-specific models with better token economics, not away from AI https://x.com/danielnewmanUV/status/2059822712122400975 DeepSeek 75% permanent price cut + Anthropic Claude Code restriction reversal https://www.buildfastwithai.com/blogs/ai-news-today-may-26-2026 $190B Microsoft capex + $725B+ aggregate hyperscaler capex with no analog ROI yet  https://www.buildfastwithai.com/blogs/ai-news-today-may-26-2026   AGAINST:  Salesforce Agentforce ARR crossed $1B this quarter on 28.6T tokens processed  https://www.stocktitan.net/sec-filings/CRM/8-k-salesforce-inc-reports-material-event-3b8ead2852bb.html Lenovo +105% AI revenue, +84% Q4; Dell $43B AI backlog: the AI infrastructure flywheel is converting capex to revenue today https://investor.marvell.com/news-events/press-releases/detail/1023/marvell-technology-inc-reports-first-quarter-of-fiscal-year-2027-financial-results NVIDIA $91B Q2 guide + $1T Blackwell+Vera Rubin CY25-CY27 reaffirmed  https://www.cnbc.com/2026/05/20/were-raising-our-price-target-on-nvidia-after-another-knockout-quarter-and-guide-.html DeepSeek + Chinese price war is a Chinese export-controls story, not a US economic ceiling story https://www.cnbc.com/2026/05/21/anthropic-microsoft-maia-200-ai-chip.html   Bulls & Bears Micron (NASDAQ: MU) Crosses $1 TRILLION Market Cap for the First Time https://www.cnbc.com/2026/05/26/micron-stock-trillion-market-cap.html Dell Technologies Q1 FY27 ACTUALS  https://www.cnbc.com/2026/05/28/dell-q1-earnings-report-2027.html Marvell Technology Q1 FY27 ACTUALS https://investor.marvell.com/news-events/press-releases/detail/1023/marvell-technology-inc-reports-first-quarter-of-fiscal-year-2027-financial-results Salesforce CRM Q1 FY27 ACTUALS  https://investor.salesforce.com/financials/quarterly-results/ Synopsys SNPS Q2 FY26 ACTUALS https://investor.synopsys.com/events-and-presentations/events/event-details/2026/Q2-Fiscal-Year-2026-Earnings/default.aspx Snowflake SNOW Q1 FY27 ACTUALS  https://www.businesswire.com/news/home/20260527027931/en/Snowflake-Reports-Financial-Results-for-the-First-Quarter-of-Fiscal-2027 HP Inc. HPQ Q2 FY26 ACTUALS https://finance.yahoo.com/markets/stocks/articles/hp-q2-earnings-call-highlights-230459161.html Everpure (NYSE: P, formerly Pure Storage) Q1 FY27 ACTUALS  https://investor.salesforce.com/financials/quarterly-results/ Synopsys SNPS Q2 FY26 ACTUALS https://investor.synopsys.com/events-and-presentations/events/event-details/2026/Q2-Fiscal-Year-2026-Earnings/default.aspx Snowflake SNOW Q1 FY27 ACTUALS  https://www.businesswire.com/news/home/20260527027931/en/Snowflake-Reports-Financial-Results-for-the-First-Quarter-of-Fiscal-2027 HP Inc. HPQ Q2 FY26 ACTUALS  https://finance.yahoo.com/markets/stocks/articles/hp-q2-earnings-call-highlights-230459161.html Everpure (NYSE: P, formerly Pure Storage) Q1 FY27 ACTUALS https://www.prnewswire.com/news-releases/everpure-announces-first-quarter-fiscal-2027-financial-results-302783502.html

Crazy Wisdom
Episode #549: From MS-DOS to Vibe Coding: How Non-Technical Founders Build Complex Software

Crazy Wisdom

Play Episode Listen Later May 29, 2026 70:14


Stewart Alsop sat down with Michael Shackelford to discuss their experiences building applications through vibe coding—the practice of using AI to create software without traditional programming expertise. Stewart, who runs the AI Whispers community in Buenos Aires and hosts the Crazy Wisdom podcast (with over 660 interviews), shared how he went from teaching people prompt engineering to building his own video conferencing software as a Riverside.fm replacement, while Michael opened up about his year-long journey creating Genrupt Inc, an AI-powered content generation tool for e-commerce sellers. The conversation covered everything from the decline in quality of Claude's reasoning capabilities and how Chinese companies used distillation attacks to copy Anthropic's models, to the importance of spaced repetition systems for managing knowledge in the age of LLMs, with both sharing battle-tested prompting strategies like asking AI to "explain it to me in genius terms" and using deep research queries to reverse engineer how competitors build their products.Show Notes:- Dan Martell's book "Buy Back Your Time" was mentioned as one of the best business books for thinking about life and business- Check out John Vervaeke's "Awakening from the Meaning Crisis" for understanding relevance realization and why AI fundamentally cannot determine what's relevant to humans without being toldTimestamps00:00 Michael discusses being exhausted from getting his app ready for launch, working nonstop with AI to prepare landing page for podcast traffic driving beta signups05:00 Stewart explains starting AI Whispers in Buenos Aires after leaving OpenAI vendor company, meeting early adopters like Torin who was building mind-reading EEG technology10:00 Discussion of how corporations resist AI adoption due to political games and job security fears while some companies use AI as excuse for pandemic-era layoffs15:00 Stewart describes teaching workshops on using LLMs as linguistic tools rather than coding tools, noting technical people often lack humanities background needed for prompting20:00 Explaining chatbot wrappers, API calls, and how Anthropic's reasoning quality declined after Chinese distillation attacks copied their secret sauce developed with philosophers25:00 Technical discussion of model training, fine-tuning versus RAG for new information, and different approaches to updating AI knowledge beyond initial training30:00 Stewart describes building podcast recording software to replace expensive Riverside, struggling with syncing audio and video files across different computer clocks35:00 Discussion of critical factors in vibe coding, discovering unknown technical requirements, and how AIs don't automatically reveal missing information40:00 Stewart's reverse engineering process using deep research function to study competitors' hiring and technology stacks, separating planning agents from coding agents45:00 Prompting techniques including "explain like I know everything" and using spaced repetition systems to capture valuable prompts and technical knowledge50:00 Michael explains his Generux app for generating ecommerce content using Amazon review data analysis to inform high-converting listing images and videos55:00 Discussion of founder mentality involving self-delusion about project timelines, Michael working nine-plus hours daily for nine months on app development60:00 Comparing Amazon's expert software to prosumer software approach, discussing distribution challenges and future robotics applications for customized products65:00 Stewart demonstrates spaced repetition app for memory improvement and knowledge retention, explaining relevance realization problem that AI agents cannot solve without embodimentKey Insights1. Stewart Alsop started AI Whisperers in Buenos Aires after leaving his role at Invisible Technologies, which was OpenAI's largest vendor for RLHF work. He noticed that machine learning engineers at tech companies lacked the humanities background needed to properly interact with large language models, which are fundamentally linguistic tools. This led him to create weekly workshops teaching non-technical people how to use AI effectively, running events every Thursday for two years straight. The group attracted intense geeks from the start and eventually led to Stewart speaking right after Vitalik Buterin at DevConnect, marking a significant milestone for the community.2. Large corporations are resistant to AI adoption due to multiple factors including political dynamics within organizations and employees fearing job loss. Many companies that grew during the pandemic are now using AI as an excuse to downsize when the real issue is inefficiency from rapid expansion. Stewart observed that even technical people in machine learning often don't understand how to properly use AI tools because they lack linguistic and humanities training. The fundamental problem is educational, requiring companies to train people how to use these new tools while those same people resist learning them.3. Vibe coding has evolved significantly with Claude Code being a game changer that reduced the technical barrier to entry. Before Claude Code, developers needed substantial technical knowledge to work through constant doom loops and debugging cycles. The success of coding AI tools stems from thirty years of testing infrastructure that provides clear yes or no feedback on whether code works. This infrastructure doesn't exist in the same way for manufacturing, science, and other fields, which is why software became the dominant area for AI assistance initially.4. Claude's quality degradation over recent months resulted from multiple factors including distillation attacks by Chinese companies who reverse engineered Anthropic's reasoning capabilities. Anthropic had hired philosophers, sociologists, and psychologists to develop exceptional reasoning in Claude 4.5, but this was expensive to run. When Chinese models like Kimi copied these capabilities at one tenth the cost, and when mainstream users flooded the platform before Anthropic's planned IPO, the company had to reduce quality to manage computational costs. This represents a significant loss for power users who relied on Claude's superior reasoning abilities.5. Stewart built a podcast recording application to replace Riverside because he needed API access to automate workflows, which Riverside wanted one thousand dollars monthly to provide. The technical challenge involves syncing audio and video from local recordings on multiple computers with different clocks through a server, then merging them so voices match lip movements. This problem requires understanding complex timing issues across different network conditions and file formats. Stewart has been working through AI psychosis for months on this FFMPEG pipeline problem, illustrating how vibe coding still requires building intuition about technical problems even without traditional coding knowledge.6. The transition from expert software to prosumer software represents a major opportunity for AI-enabled tools. Expert software like Photoshop, Blender, and terminal interfaces have extreme complexity that intimidates beginners, but AI is making these capabilities accessible through natural language. The reign of specialists is ending as generalists with broad knowledge and curiosity can now build complete applications by leveraging AI to fill technical gaps. This shift particularly benefits entrepreneurs and founders who specialize in getting into difficult situations and figuring them out, even when they originally thought tasks would be easier than they turned out to be.7. Building applications with AI requires accepting massive time investments beyond initial estimates and developing strategies for overcoming knowledge gaps. Michael estimated his ecommerce content generation app would take months but spent nearly a year working over nine hours daily, while Stewart spent months solving audio-video sync issues. Success requires using tools like deep research to understand how competitors solve problems, maintaining separate planning and coding agents, and learning to ask the right questions. The key insight is that vibe coders can achieve ninety percent of functionality independently, but the final ten percent often requires understanding specific technical concepts that AI cannot intuit without proper context and domain knowledge.

MLOps.community
AI Is Fast. AI Projects Are Slow. Let's Fix That.

MLOps.community

Play Episode Listen Later May 29, 2026 56:47


Joe Maionchi (Co-founder & COO) and Rod Christensen (Co-founder & Chief Architect) of RocketRide join the MLOps Community to walk through AIDE — the AI Integrated Development Environment. RocketRide is an open-source AI pipeline platform that lets developers build, debug, and run production-grade agentic AI workflows directly from their IDE, with support for 13+ LLM providers, 8+ vector databases, and full multi-agent orchestration.AI Is Fast. AI Projects Are Slow. Let's Fix That. // MLOps Podcast #378 with JRocketRide's Joe Maionchi (Co-founder & COO) and Rod Christensen (Co-founder & Chief Architect)A huge shout-out to  ⁨RocketRide⁩  for this collaboration!

SANS Internet Stormcenter Daily Network/Cyber Security and Information Security Stormcast
SANS Stormcast Thursday, May 28th, 2026: Akira Ransomware; Vaultjacking; Poisoned Chatbot and Search Results;

SANS Internet Stormcenter Daily Network/Cyber Security and Information Security Stormcast

Play Episode Listen Later May 28, 2026 6:04


Reconstructing an Akira Ransomware Kill Chain from Perimeter and Endpoint Logs https://isc.sans.edu/diary/Reconstructing%20an%20Akira%20Ransomware%20Kill%20Chain%20from%20Perimeter%20and%20Endpoint%20Logs/33024 Vaultjacking: One Captured PIN, the Entire Google Password Manager Vault https://phishu.net/blogs/blog-vaultjacking-phishing-the-google-password-manager-vault-in-the-phishu-framework.html From poisoned search results to GPU mining: A cryptojacking campaign abusing ScreenConnect and Microsoft .NET utilities https://www.microsoft.com/en-us/security/blog/2026/05/26/poisoned-search-results-gpu-mining-cryptojacking-campaign-abusing-screenconnect-microsoft-net-utilities/

PodRocket - A web development podcast from LogRocket
Zed 1.0... GPUI, Rust, and the future of native apps with Mikayla Maki

PodRocket - A web development podcast from LogRocket

Play Episode Listen Later May 28, 2026 36:32


Mikayla Maki, software engineer at Zed, digs into what makes this Rust-built code editor tick... from GPUI, their GPU-accelerated UI framework with a Tailwind-inspired API, to CRDTs powering real-time live collaboration without merge conflicts. She talks about the Zed 1.0 release, their approach to AI, how the team builds popular features directly into core instead of relying on extensions, and why Rust might be the best language for agentic coding. Plus: native app comeback, GPUI on mobile, and where the framework is heading. Links LinkedIn: https://www.linkedin.com/in/mikayla-maki Bluesky: https://bsky.app/profile/rad.gendervibes.online GitHub: https://github.com/mikayla-maki Resources Zed 1.0 announcement: https://zed.dev/blog/zed-1-0 DeltaDB / Sequoia Series B post: https://zed.dev/blog/sequoia-backs-zed ACP overview: https://zed.dev/acp GPUI engineering post: https://zed.dev/blog/leveraging-rust-and-the-gpu-to-render-user-interfaces-at-120fps Builder.io "Is Zed ready for AI power users in 2026?": https://www.builder.io/blog/zed-ai-2026 Mikayla's RustConf 2025 talk: https://www.youtube.com/watch?v=rpEU9DNbXA4 filtra.io interview with Mikayla: https://filtra.io/rust/interviews/zed-aug-25 We want to hear from you! How did you find us? Did you see us on Twitter? In a newsletter? Or maybe we were recommended by a friend? Fill out our listener survey! https://t.co/oKVAEXipxu Let us know by sending an email to our producer, Elizabeth, at elizabeth.becz@logrocket.com, or tweet at us at PodRocketPod. Check out our newsletter! https://blog.logrocket.com/the-replay-newsletter/ Follow us. Get free stickers. Follow us on Apple Podcasts, fill out this form, and we'll send you free PodRocket stickers! What does LogRocket do? LogRocket provides AI-first session replay and analytics that surfaces the UX and technical issues impacting user experiences. Start understanding where your users are struggling by trying it for free at LogRocket.com. Try LogRocket for free today. Chapters

The Neuron: AI Explained
What Comes After GPUs? Great Sky's Bet on Brain-Like AI

The Neuron: AI Explained

Play Episode Listen Later May 27, 2026 59:54


What if the next big AI breakthrough is not a bigger model, but a completely different kind of computer?Jeff Shainline, co-founder and CEO of Great Sky, joins The Neuron to explain how his team is building brain-inspired AI hardware using superconductors, photonics, and analog computation. Great Sky's architecture, called Superconducting Optoelectronic Networks, or SOENs, is designed to move beyond the traditional GPU roadmap by co-locating memory and processing, communicating with light, and mimicking some of the high-connectivity dynamics found in biological brains.In this conversation, Jeff breaks down why today's chips can struggle with fast, multimodal inference; why transformers may be powerful but inefficient for some future workloads; how Great Sky's system differs from quantum computing; and why early applications could include fusion reactors, particle physics, video understanding, content moderation, and eventually new model architectures that do not map neatly onto today's hardware.Subscribe to The Neuron for grounded, practical conversations about where AI is going next—and what actually has to work before the hype becomes real.

Irish Tech News Audio Articles
AI Is Forcing a Rethink of Data Centre Cooling and Power Why cooling matters now? Data Centre Cooling trends to watch Rapid growth of AI is beginning to change the model What liquid cooling actually looks like? Why speed is becoming critical? Existing dat

Irish Tech News Audio Articles

Play Episode Listen Later May 27, 2026 6:27


Schneider Electric and Motivair showcase how AI is reshaping cooling and power systems inside modern data centres Irish Tech News joined a group of international technology and infrastructure media in Buffalo, New York last week for a Schneider Electric-hosted briefing and site visits focused on the infrastructure emerging around AI computing. The programme included presentations from Schneider Electric engineers and executives, a visit to Motivair, the liquid cooling company acquired by Schneider Electric in 2024, and tours of TeraWulf's expanding AI infrastructure campus in New York State, where Schneider Electric technologies are being deployed as part of the wider infrastructure buildout. A major theme throughout the visit was the speed at which new cooling, power and infrastructure solutions are now being deployed as AI systems dramatically increase computing density inside modern data centres. For years, most data centres relied mainly on air cooling. Chilled air circulated through server halls to remove heat generated by computing equipment. That model worked reasonably well for earlier generations of enterprise computing and cloud infrastructure where heat densities were lower. Companies such as NVIDIA have dramatically increased the processing power packed into these systems, particularly for AI training and inference workloads. The result is that modern AI racks can generate far greater levels of heat than previous generations of computing infrastructure. According to Tuan Hoang, Head of Product Development and Innovation for Schneider Electric's cooling business, traditional air cooling systems are now approaching practical limits at very high rack densities. "Liquid cooling allows us to do that," Hoang told me during the visit. "It has 4,000% more heat capacity than air." At the Motivair facility and later at the TeraWulf site, journalists were shown examples of liquid cooling systems now being deployed beside high-density AI infrastructure. See photo above. Instead of relying entirely on chilled air circulating through server halls, the systems use liquid circulating through cooling units connected into server environments to remove heat more efficiently and closer to the chips themselves. The visual impression is quite different from the public image many people still have of data centres. Large pipework systems, cooling units, power systems and engineering infrastructure increasingly dominate these environments as AI deployments scale. Hoang stressed repeatedly that air cooling is not disappearing entirely. "It is still necessary," he explained, particularly for wider facility environments and supporting infrastructure. What is changing is the balance between air and liquid cooling as AI workloads become denser and more power-intensive. Another major theme during the Buffalo visit was speed. AI infrastructure operators increasingly want facilities operational as quickly as possible because expensive GPU systems only generate returns once deployed and running. That pressure is driving growing interest in modular infrastructure, prefabricated systems and repeatable engineering designs which can be deployed more rapidly than traditional bespoke builds. The scale and pace of construction at the Lake Mariner TeraWulf campus reflected that urgency. One AI-focused facility is already operational while further expansion continues across the wider site. Hoang also discussed the growing challenge of adapting existing data centres for AI workloads rather than building entirely new facilities from scratch. "A lot of customers are trying to retrofit existing data centres," he said during the interview, explaining that many operators are now attempting to adapt infrastructure originally designed for lower-density cloud computing. That pressure is one reason modular cooling systems and repeatable infrastructure designs are becoming increasingly important as AI deployments scale. The discussions also highlighted how AI is be...

Ask The Tech Guys (Audio)
HOT 268: Laptop Recommendations - The Right Laptop For Your Photography Needs

Ask The Tech Guys (Audio)

Play Episode Listen Later May 25, 2026 28:54


In this week's episode of Hands-On Tech, Robert asks Mikah for help choosing a new Windows laptop suited for heavy photo and video editing work, including guidance on GPU and VRAM requirements for his specific software stack, as well as advice on whether switching to a Mac is a viable option after a disastrous previous migration experience. Don't forget to send in your questions for Mikah to answer during the show! hot@twit.tv Host: Mikah Sargent Download or subscribe to Hands-On Tech at https://twit.tv/shows/hands-on-tech Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Club TWiT members can discuss this episode and leave feedback in the Club TWiT Discord. Sponsors: outsystems.com/twit shopify.com/hot

The Tech Guy (Video HI)
HOT 268: Laptop Recommendations - The Right Laptop For Your Photography Needs

The Tech Guy (Video HI)

Play Episode Listen Later May 25, 2026 28:54


In this week's episode of Hands-On Tech, Robert asks Mikah for help choosing a new Windows laptop suited for heavy photo and video editing work, including guidance on GPU and VRAM requirements for his specific software stack, as well as advice on whether switching to a Mac is a viable option after a disastrous previous migration experience. Don't forget to send in your questions for Mikah to answer during the show! hot@twit.tv Host: Mikah Sargent Download or subscribe to Hands-On Tech at https://twit.tv/shows/hands-on-tech Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Club TWiT members can discuss this episode and leave feedback in the Club TWiT Discord. Sponsors: outsystems.com/twit shopify.com/hot

Brad & Will Made a Tech Pod.
340: Like a Bong for Your CPU

Brad & Will Made a Tech Pod.

Play Episode Listen Later May 24, 2026 70:13


Brad's tired of throttling his CPU due to an inadequate heatsink. Will's been spending a lot more time testing PC hardware of late. Between those two things, we thought it was a good time to do a check-in on CPU cooling, and primarily liquid cooling, so we can establish the facts on the ground about modern AIOs and custom loops with an eye toward helping Brad decide what to get. Turns out, there's more to know than ever, and yet it's also never been simpler. We also talk a little about modern air cooling, CPU spikes in Windows, and other stuff! The GamersNexus video on AIO placement: https://www.youtube.com/watch?v=BbGomv195sk Support the Pod! Contribute to the Tech Pod Patreon and get access to our booming Discord, a monthly bonus episode, your name in the credits, and other great benefits! You can support the show at: https://patreon.com/techpod

All TWiT.tv Shows (MP3)
Hands-On Tech 268: Laptop Recommendations

All TWiT.tv Shows (MP3)

Play Episode Listen Later May 24, 2026 28:54 Transcription Available


In this week's episode of Hands-On Tech, Robert asks Mikah for help choosing a new Windows laptop suited for heavy photo and video editing work, including guidance on GPU and VRAM requirements for his specific software stack, as well as advice on whether switching to a Mac is a viable option after a disastrous previous migration experience. Don't forget to send in your questions for Mikah to answer during the show! hot@twit.tv Host: Mikah Sargent Download or subscribe to Hands-On Tech at https://twit.tv/shows/hands-on-tech Join Club TWiT for Ad-Free Podcasts! Support what you love and get ad-free audio and video feeds, a members-only Discord, and exclusive content. Join today: https://twit.tv/clubtwit Club TWiT members can discuss this episode and leave feedback in the Club TWiT Discord. Sponsors: outsystems.com/twit shopify.com/hot

Alexa's Input (AI)
Intelligence Per Watt with Emilio Andere

Alexa's Input (AI)

Play Episode Listen Later May 24, 2026 73:30


On this episode of Alexa's Input (AI), I sit down with Emilio Andere, co-founder and CEO of Wafer, to talk about the future of AI infrastructure, inference optimization, and the economics driving the AI compute race.We discuss:why “intelligence per watt” may become one of the defining metrics of the AI erathe current GPU and accelerator landscape across NVIDIA, AMD, TPUs, and emerging hardware startupswhy software optimization is becoming just as important as hardware itselfinference optimization strategieswhy AI infrastructure companies are racing up the stackwhat it's actually like building an AI infrastructure startup todayand more!Emilio also shares lessons from founding Wafer, thoughts on the future of open-source AI infrastructure, and why he believes optimizing intelligence itself could become one of the most important engineering problems.General Podcast LinksWatch: ⁠⁠⁠⁠⁠⁠https://www.youtube.com/@alexa_griffith⁠⁠⁠⁠⁠⁠Read: ⁠⁠⁠⁠⁠⁠⁠⁠https://alexasinput.substack.com/⁠⁠⁠⁠⁠⁠⁠⁠Listen:⁠⁠ ⁠⁠https://creators.spotify.com/pod/profile/alexagriffith/⁠⁠⁠⁠More: ⁠⁠⁠⁠⁠⁠https://linktr.ee/alexagriffith⁠⁠⁠⁠⁠⁠Learn more about the host atWebsite: ⁠⁠⁠⁠⁠⁠https://alexagriffith.com/⁠⁠⁠⁠⁠⁠LinkedIn: ⁠⁠⁠⁠⁠⁠https://www.linkedin.com/in/alexa-griffith/⁠⁠⁠⁠⁠⁠Find out more about the guest at:LinkedIn: https://www.linkedin.com/in/emi-andere/Wafer Website: https://www.wafer.ai/Wafer AI / Y Combinator Article: https://www.ycombinator.com/companies/waferChapters00:00 Exploring AI Conversations and Recent Podcasts02:14 Intelligence per Watt: A New Metric for AI07:35 The Manifesto: Efficiency in Civilization12:40 Founding Wafer: The Journey Begins18:08 The GPU Hardware Landscape and Market Dynamics23:07 AMD's Growing Presence in the GPU Market24:07 Emerging Competitors in the AI Hardware Space26:04 Comparing TPUs and GPUs27:21 Acquisition and Availability of TPUs28:33 Navigating the GPU Marketplace30:05 Understanding Neo Cloud Economics33:30 The AI Bubble Debate36:25 Optimizing AI Models for Performance44:46 Bottlenecks in AI Model Performance48:08 Future Directions in AI Hardware Optimization54:39 Balancing Speed and Cost in AI Performance56:54 Kernel Arena: Benchmarking AI Performance01:03:45 Lessons from Founding: Sales and Emotional Resilience01:07:38 The Future of AI: Trends and Predictions01:13:03 Outro KeywordsAI hardware, inference optimization, intelligence per watt, GPU market, AI infrastructure, Wafer, AI bubble, TPU, GPU bottleneck, AI efficiency AI optimization, large language models, AI hardware, quantization, speculative decoding, benchmarking, AI infrastructure, model training, AI startups

The Dividend Cafe
Data Center Drama

The Dividend Cafe

Play Episode Listen Later May 22, 2026 22:04


Today's Post - https://bahnsen.co/3R9QgGV In this Friday Dividend Cafe, David Bahnsen explains why data centers have become a major economic story, tracing their evolution from 1990s CPU-based server facilities to 2010s cloud-driven hyperscale warehouses and today's AI-focused GPU centers that require far more power, cooling, and infrastructure. He argues data center construction and related spending may have accounted for roughly 80% of last year's GDP growth, even as other real estate and industrial activity has been muted, drawing an analogy to the shale/fracking boom. Bahnsen supports data centers and future productivity potential but opposes federal efforts to override local zoning, warns against cronyism, emphasizes the need for a stronger public relations case, and highlights investment implications in adjacent areas like power, water, natural gas, and pipelines. 00:00 Welcome and Setup 00:52 Why Data Centers Matter 01:43 Three Eras of Data Centers 03:51 AI Shift to GPUs 05:42 Data Centers Driving GDP 08:29 Future Productivity Payoff 09:32 What Growth Is Missing 10:12 Fracking Analogy and Backlash 12:15 Localism Versus Federal Override 14:57 PR Playbook Five Points 17:23 Investing Wisely in the Theme 19:35 Wrap Up and Disclosures Links mentioned in this episode: DividendCafe.com TheBahnsenGroup.com

The Lunar Society
Reiner Pope – Chip design from the bottom up

The Lunar Society

Play Episode Listen Later May 22, 2026 80:30


New blackboard lecture with Reiner Pope: how do chips actually work - starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do.Reiner is CEO of MatX, a new chip startup (full disclosure - I'm an angel investor). He was previously at Google, where he worked on software efficiency, compilers, and TPU architecture.Watch this one on YouTube so you can see the chalkboard. Read the transcript.Sponsors* Crusoe was one of only five GPU clouds that made the gold tier in SemiAnalysis' most recent ClusterMAX report. Gold-tier providers like Crusoe delivered 5-15% lower TCO than silver-tier clouds, even with identical GPU pricing. This is because optimizations like early fault detection and rapid node replacement don't necessarily show up in the sticker price, but still matter a ton in the real world. Learn more at crusoe.ai/dwarkesh* Cursor is where I do most of my work—from reading research papers to visualizing technical concepts to coding up internal tools for the podcast. Most recently, I used it to build two different review interfaces for my essay contest, one that anonymizes submissions for scoring and another that lets me see applicants' essays next to their resumes and websites. Whatever you're working on, you should try doing it in Cursor. Get started at cursor.com/dwarkesh* Jane Street let me ask Ron Minsky and Dan Pontecorvo, two senior Jane Streeters, a bunch of questions about how they use AI. We discussed everything from the types of models they're training to how they think about the future of trading to why they're more bullish than ever on hiring technical talent. You can watch the full conversation and learn more about their open positions at janestreet.com/dwarkeshTimestamps00:00:00 – Building a multiply-accumulate from logic gates00:16:31 – Muxes and the cost of data movement00:26:10 – How systolic arrays work00:39:11 – Clock cycles and pipeline registers00:51:51 – FPGAs vs ASICs01:03:25 – Cache vs scratchpad01:07:27 – Why CPU cores are much bigger than GPU cores01:12:00 – Brains vs chips01:15:33 – A GPU is just a bunch of tiny TPUs Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe

Algorithms + Data Structures = Programs
Episode 287: AI Takes & AI Taxes

Algorithms + Data Structures = Programs

Play Episode Listen Later May 22, 2026 45:16 Transcription Available


In this episode, Conor and Bryce chat with Marco Franzreb Salgado about the state of AI, whether AI should do our taxes and more.Link to Episode 287 on WebsiteDiscuss this episode, leave a comment, or ask a question (on GitHub)SocialsADSP: The Podcast: TwitterConor Hoekstra: LinkTree / BioBryce Adelstein Lelbach: TwitterAbout the Guest:Marco is a software engineer at NVIDIA, where he works on improving the nvCOMP library, which offers fast GPU implementations of multiple data compression formats. For the past couple of months he has been working on a GPU implementation of the rotate algorithm.Show NotesDate Recorded: 2026-05-05Date Released: 2026-05-22ADSP Episode 237: Thrust with Jared HoberockADSP Episode 284: GPU RotateADSP Episode 285: GPU Rotate (Part 2)NVIDIA nvCOMPGOSIM PARISLife update: Zig, AI, unemployment, and moreIntro Song InfoMiss You by Sarah Jansen https://soundcloud.com/sarahjansenmusicCreative Commons — Attribution 3.0 Unported — CC BY 3.0Free Download / Stream: http://bit.ly/l-miss-youMusic promoted by Audio Library https://youtu.be/iYYxnasvfx8

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Take the 2026 AI Engineering Survey and get >$2k in credits and AIE WF tickets!On the product side, everyone is getting Computer - Perplexity, Manus, Cursor, and so on. Meanwhile on the research side, agentic evals like TerminalBench and GDPVal are also assuming computer (Harbor). On both ends, the consolidating LLM OS stack has become a standard toolkit, and Daytona is one of a small set of AI Infra companies that are booming because of it.“The end of localhost” has been Ivan Burazin's obsession for more than a decade.Something that is all too familiar…Long before agents became the default way people talked about software development, Ivan was already chasing the idea that development should not depend on a fragile local machine. CodeAnywhere, one of the first browser-based IDEs, was an early attempt at that future: move the development environment into the cloud, make setup reproducible, and free developers from the endless “works on my machine” tax.The thesis was directionally right, but the market wasn't ready yet.However, agents changed that. They do not care about a laptop, desk setup, or favorite editor. They need a computer they can access through an API: something stateful enough to keep working, fast enough to spin up instantly, flexible enough to resize, isolated enough to be safe, and composable enough to run the messy real-world workflows that real software engineering actually requires.Daytona isn't just selling “sandboxes” in the narrow code-execution sense. It is the latest version of Ivan's original localhost thesis.In this episode, Daytona's CEO joins swyx to explain why AI agents need more than code execution boxes: they need composable computers, stateful sandboxes, instant startup, dynamic resources, and infrastructure that can survive workloads going from zero to 100,000 CPUs.We go deep on the new agent compute market: Daytona's hard pivot from human dev environments to AI sandboxes, the New Year's Eve MVP that customers begged for, why Daytona runs on bare metal with its own scheduler, how one customer runs almost 850,000 sandboxes a day, and why RL/eval workloads went from 0% to roughly 50% of usage in just months. Ivan also explains why agents need Windows and macOS machines, why CLI may matter more than MCP, why Kubernetes is painful for this workload, and why the future AI cloud may look more like Stripe than AWS.We discuss:* How Daytona grew out of CodeAnywhere, Shift, and the “end of localhost” thesis* Why Daytona pivoted from human dev environments to AI sandboxes* Why agents need composable computers instead of disposable code execution boxes* The New Year's Eve MVP that customers chased API keys for* Why Daytona chose bare metal, stateful snapshots, and its own scheduler* How Daytona spins up one sandbox in ~60ms and 50,000 sandboxes in ~75 seconds* Why Daytona's biggest customer runs ~850,000 sandboxes a day* How RL/eval workloads create zero-to-100,000 CPU spikes* Why RL workloads went from 0% to roughly 50% of Daytona usage* Why customers compare Daytona against EKS/GKS and say they're “never going back”* Why every AI agent may need a computer, including Windows and macOS environments* The Apple licensing constraints that make macOS sandboxes hard* Why CLI gives agents more power than MCP* How open source helps agents integrate Daytona* Why agent-generated PRs may break today's CI/CD assumptions* Why AI SaaS companies reselling tokens may face a cold shower* Why the AI cloud may look more like Stripe than AWSIvan Burazin* LinkedIn: https://www.linkedin.com/in/ivanburazin* X: https://x.com/ivanburazinDaytona* Website: https://www.daytona.io* X: https://x.com/daytonaioTimestamps* 00:00:00 Hook* 00:01:12 Introduction* 00:03:15 CodeAnywhere, Shift, and the end of localhost* 00:05:58 What Daytona is: composable computers for AI agents* 00:08:07 The pivot from dev environments to AI sandboxes* 00:10:17 The New Year's Eve MVP and customers begging for API keys* 00:12:56 Bare metal, stateful sandboxes, and Daytona's scheduler* 00:17:28 60ms startup, 50,000 sandboxes, and 850K daily runs* 00:21:53 Spiky RL/eval workloads and the new agent infra problem* 00:28:12 RL workloads, Kubernetes pain, and dynamic resizing* 00:33:31 Why every AI agent needs a computer* 00:38:48 macOS sandboxes and Apple's licensing problem* 00:44:28 Why CLI may matter more than MCP* 00:48:11 Open source, GitHub stars, and agent integration* 00:53:11 Git, CI/CD, and agent collaboration bottlenecks* 00:58:15 Founder life and building a 25-person infra company* 01:02:44 AI SaaS, token resale, and API-first business models* 01:06:10 GPU sandboxes, data centers, and compute growth* 01:09:48 Why the AI cloud may look more like Stripe than AWS* 01:11:26 Closing thoughtsTranscriptIntroduction: Daytona, CodeAnywhere, and the End of LocalhostSwyx [00:00:02]: Okay, we're in the studio with Ivan Burazin, CEO of Daytona. Welcome.Ivan [00:00:07]: Thanks for having me, man.Swyx [00:00:08]: Ivan, you and I go back.Ivan [00:00:10]: Way back.Swyx [00:00:11]: How I don't even know how, you found, did you reach out or, for Shift.Ivan [00:00:17]: I reached out to you. The reason was you - we were just - we were thinking about I was one of the co-founders of CodeAnywhere, the first browser-based IDE, and so we were thinking a long time of, localhost should die. And you had this article.Swyx [00:00:29]: End of localhost.Ivan [00:00:30]: Then I reached out to you because of that, and then we talked, and I was actually at a different job and learning about I was the head of, developer experience, and you were quite well-versed in that, and I actually reached out to you, among other people, how do we go about that? What are the key things and whatnot at this point in time? And you were nice enough to take the call, and I remember I was late on your call with you.Swyx [00:00:51]: I don't remember.Ivan [00:00:52]: I remember because I was with my then I'm thinking of a girlfriend or wife at that point in time, I'm not sure. It's the same person, so that's great, and I was late ‘cause we were, in, Italy on, vacation, and then I was late for something. I felt so bad, and you were so nice to be, good about.Swyx [00:01:10]: The reason I'm nice is because I'm also late to other people, so it's like, who's, who's without sin here, yeah, so I have to, for those who don't know, InfoBip Shift, there's this whole thing that, you did in the past, and, and that was basically one of the inspirations for me starting AI Engineer, which is like, I have to thank you for giving me that push to be like, “Oh, you can, you can build and sell conferences?”Ivan [00:01:34]: I remember you asked you asked me at the beginning to give me advisory shares, and I was so focused on what we were doing, I said no, and I should've took the advisory shares. So I'm sorry, dude. But anyway.Swyx [00:01:43]: We're not, we're not venture backed.Ivan [00:01:44]: No, it doesn't matter.Swyx [00:01:45]: It's Yeah, anyway, so I think what's impressive about you is that CodeAnywhere is the thing that you've been trying to build, and, you kind of put it on hold and then came back after InfoBip. Just give us the story, do you - the story and the origin story, going into Daytona.From CodeAnywhere and Shift to DaytonaIvan [00:02:05]: Sure. Like, really way back, me and my co-founder have been together. I say this, I've said this multiple times, it's like we were married and divorced and married. Some people actually ask me is my co-founder my partner. they thought it literally. It's not literally, but we have done multiple companies together, and to your point, we had this shift where we went from the CodeAnywhere to the conference called Shift, and then back to, Daytona. We originally started stacking servers, doing like virtualization in the early 2000s and, routers and doing basically all these things, at a foundational level, and that was a services company which we sold to focus on what my co-founder actually invented, which was the very first browser-based IDE, right, I say the first. Before us was actually Heroku. They did it for a very short time until they became Heroku. But outside of them, we were the only one, and it was called.Swyx [00:02:55]: There was Cloud9.Ivan [00:02:57]: Cloud9 came out slightly after us. There was Replit, which came out when we stopped doing it, Replit came out, and they have been successful since then, which is great. There was Nitrous.io. There was quite a few that existed at the time, but it was like too early. But the interesting part is that we, at that point in time, because there was no VS Code, there was no Kubernetes, and Docker had just started when we Or I'm not sure if it was even public at that point in time. And so we had to build everything to the whole stack ourselves and that was the key learning that we brought into and that we've been using in Daytona today. So it was super early. There's about 3 million people used CodeAnywhere. It was slightly, it was angel-backed more than venture-backed. We ended up paying everyone back because it didn't have that sort of scale. But, three years ago, we started something similar with Daytona, which is not what we are today, but it was automating dev environments for human engineers, the basically the underlying stack of CodeAnywhere. And then we did a hard pivot last January to sandboxes. And so here we are.Swyx [00:04:01]: Historic pivot, yeah, and, it's one of those things where, I had independently invested in CodeAnywhere, but also in E2B, and then both of you pivoted into the same thing, and I'm like, “F**k.”Ivan [00:04:12]: You invested, you invested in Daytona. You invested in Daytona. But you were the first If we had not got your check, we wouldn't have done it.Swyx [00:04:18]: No way.Ivan [00:04:19]: No, it was like, “We have to get him on board first,” and you were that kicker that we, that got us off the ground.Swyx [00:04:23]: No, because you were putting me on your pitch deck, man. I was like, “Man, this is like a good trip if I don't invest.”Ivan [00:04:29]: That's because it was your quote. It's like we.Swyx [00:04:30]: Yeah. It's the end of localhost.Ivan [00:04:31]: Did a bunch of research about end of localhost and who was interested in that,.Swyx [00:04:34]: No, that's like, I put, I wrote that blog post, and every single company in that field reached out to me, and then every VC who was receiving those pitches then also had to call me and, talk it, talk through it with me.Ivan [00:04:47]: It's finally happening though.Swyx [00:04:48]: It was really super interesting.Ivan [00:04:48]: It's finally happening.Swyx [00:04:49]: It's finally happening.Ivan [00:04:49]: Yeah, it's finally.Swyx [00:04:49]: It's finally happening, with maybe sort of non-human users. Yeah, so what is Daytona today? Let's get like a quick description. I'm wearing the shirt.What Daytona Is Today: Composable Computers for AI AgentsIvan [00:04:58]: You're wearing the shirt. Yes,.Swyx [00:04:59]: It says, I think your branding is very good. Like, it's very consistent. It runs AI code. Like, it cannot be simpler.Ivan [00:05:05]: Exactly, but we're gonna probably have to change that.Swyx [00:05:07]: Oh, s**t.Ivan [00:05:07]: It's also a subset of what we do. Unfortunately, we really love this, Run AI Code is super simple. People interpret it different ways. I think we've given out 5,000, 6,000 of these shirts. People wear them with pride because it doesn't really market about us.Swyx [00:05:21]: Yeah, Daytona's on the back.Ivan [00:05:22]: It markets the back. It markets to the person itself, so I think we did a really good job on that one. But it is also a subset of what we do, because people, when they think about Run AI Code, they just think about these small, let's call it isolates, code execution boxes that, you send some code, you get an output. Whereas what Daytona is today is essentially composable computers for AI agents. It is, the market calls them sandboxes which can be misleading.Swyx [00:05:44]: All these things. All these things on.Ivan [00:05:45]: Yeah, exactly, ‘cause it can be misleading ‘cause people usually think about sandboxes as a demo or a test environment versus a production-grade environment. But what Daytona does, if you think of the laptop that you have in front of you or the computer that's over there, or, my wife is an architect, so she has like a Windows with a 3D graphics card inside to do 3D rendering. Like, as humans, we have different computers or different compositions of computers. And our belief is strongly that agents today and going forward will need all these different compositions of computers to do different types of tasks. And so we offer that basically through an API.Swyx [00:06:19]: Yeah, to give people - I'm trying to sort of front-load all the aha moments or the wow moments so that people can, stay engaged and click like and subscribe. the market is exploding, right? Like, you have been reporting 74% month-on-month growth, and it also, it's just been growing for a while. Like, it's been going like this. And every single - It's not just you guys. It's every single.Ivan [00:06:41]: Everyone, yeah.Swyx [00:06:42]: Sort of, compute provider. I don't know if you agree with me saying compute provider or not.Ivan [00:06:48]: It's fine.Swyx [00:06:48]: Yeah. So like organically PLG-driven growth, but also enterprise is doing super well, I think I wanna rewind to January of last year when you did the pivot. Like, so you obviously called this market early, and you were positioned for it, and you are now one of the market leaders. But what was the insight that made you do the pivot?The Pivot: From Human Dev Environments to Agent SandboxesIvan [00:07:06]: The insight that made us do this pivot is the quarter before that, so end of 2024, when we had - Basically, we did a demo with - I don't I think we discussed this as well, Devin was not public. You actually gave me access to Devin at that time. So Devin.Swyx [00:07:25]: I did?Ivan [00:07:26]: Yeah, you gave me access.Swyx [00:07:26]: I don't think I was supposed.Ivan [00:07:27]: Yeah, exactly.Swyx [00:07:28]: Yeah, I.Ivan [00:07:28]: So it doesn't matter. You.Swyx [00:07:29]: Yeah. I gave like three friends access.Ivan [00:07:31]: Yeah, or it was a call and you showed it to me. It doesn't matter. but OpenDevin was available, which is now called OpenHands. And so we're like, “Oh, this seems to be a thing. This is not public. Let's take our for human automation of dev environments and take, OpenDevin and launch that as a SaaS.” And we did that. Not very many people signed up and used it, but a lot of people reached out that were building agents, and they were like, “Hey, my agent needs a compute sandbox runtime,” whatever you wanna call it. I forgot what it was called at that point. And then we were like, “Oh, amazing. This is a new market. Here is our infrastructure. Here's our product, and go.” And what we found really fast, soon, was that people did not like what we had built. It didn't work. And I remember talking to people at the beginning when we're doing this, the sandbox we're building for agents. People were like, “Oh, why is it different? It's the same thing. We have like EC2, we have VMs, we have all these things.” But we saw that everyone we gave it to, it was like 20, 30 people, they all said, “No.” Like, “This is not what we need. This sort of breaks.” And basically, me and my co-founder not knowing a lot about - ‘cause we're infra people. We're not AI people. So I basically took it upon myself to like watch every single podcast that exists, including all of, all of these and all that, and sort of get up to date, read all the blogs, like get, understand what's going on.Swyx [00:08:45]: Do you wanna shout out who else was useful, just in case people are also looking.Ivan [00:08:49]: Generally we -, I looked at There's a few of podcast, different segments and different types. So there's you guys, No Priors, Bill Gurley's was great while.Swyx [00:09:04]: VG2, yeah.Ivan [00:09:05]: Yeah, while it was around. So there's a few. 20VC is interesting from a different dynamic, and some are different dynamic. But there was, also Red Points.Swyx [00:09:14]: We're not really about the compute market.Ivan [00:09:15]: It was also already - Sorry?Swyx [00:09:16]: You're, you want - You're looking at the agent infra market.Ivan [00:09:19]: I was looking at the agent market and the AI market in general and sort of understanding who are the players, what the perception, and how that goes. And like obviously you complement this with like going to conferences, going to events, going to meetups, reading white papers, like doing all the things that you have to do to understand what's happening. And so when we figured, when we sort of had an idea of what we had to build, literally over the New Year's Eve, literally on New Year's Eve, I half vibe coded the first MVP, first minimal viable product of what Daytona is today. And I went to sleep at like 3:00 AM or something like that. I was doing - I just put my like baby daughter and wife to sleep and, Happy New Year's, and go back to just, doing this. And I sent it to my co-founder, my CTO, and he saw it in the morning. He's like, “This is absolute garbage.” “Do not show this to anybody at all, but the idea is good.” And so he took two weeks, and he rebuilt it.Swyx [00:10:09]: Did it like look like that? Listen, I - It was rough idea.Ivan [00:10:12]: Oh, not even, not even close. Like it was it was way worse. But it was like a very - It was a simplistic view of what it should be. Like, it worked, but it was not ideal. And so he went, we went down the whole, which is his job as CTO, to go, and he came back with this version. We then called all the people that had said like, “This is garbage,” a quarter ago. And we set up these calls, and we gave it to - We just demoed it to everyone. And all the calls went long, every single one. They were 15-minute calls, and they all went to like 25, 30 minutes or whatnot. And everyone said, “We need, we want access.” There was no login, just an API key, ‘cause it was just a beta or an alpha. And they said, “Oh, we want access.” And we're like, “Sure, yeah. Okay, thank you very much.” But after like the next day, if we'd not send it, every single one, like every call that we did, everyone came back, “Where is my API key?” Like everyone wanted it. We're like, “S**t.” Like this is it. Like I've never felt So one, the understanding to your point was like most people thought it was the same infrastructure for humans and agents. We understood a quarter ago it's not. We just didn't know what was the right primitive. And then when we came, and we can talk about what that is, and we gave it to these people, I've never seen, I've never experienced - I've done multiple companies in my life. I've never experienced this, that people literally call you if you do not give them access. Like they want access right now. And so it's like, okay, they don't want this. the thing that they want doesn't seem to exist, or they have not found it, and they really want what we want. And then when we understood that we're onto something, and then when you think about the size of the market, like the market for human engineers and enterprise is a very large market, so think GitLab or whatnot. But the market for every single agent that will exist ever in the future is just like, what is that market? How big is that? And we're like, “We are all in on this.” And so that is where we made sort of the cut between the old product and the new one.Bare Metal, Stateful Sandboxes, and the Lambda + EC2 ModelSwyx [00:12:02]: Yeah. But it wasn't composable at the time?Ivan [00:12:05]: It was very - It was basically just a Linux box that you could change, that you could define number of CPUs, disk, and RAM. Like that is what you could do, but you couldn't have multiple operating systems, you couldn't resize it on the fly, you couldn't add a GPU, you couldn't do like all the things. It was just the, just the first sort of variation of that, yeah.Swyx [00:12:22]: Was it bare metal from the start?Ivan [00:12:24]: It was bare metal from the start. And so the interesting thing that we thought about right away, so our.Swyx [00:12:29]: Which, give people the background, what is the normal path?Ivan [00:12:32]: Yeah, so, basically most providers run this on top of VMs. And also.Swyx [00:12:37]: Firecracker.Ivan [00:12:38]: Yeah, they run on Firecracker and VM. And so we also fire - We can get - We have multiple isolation layers and we can do that. But the common way to do it is that they, one, that the state of the machine, or the hard disk is not part of the sandbox itself. And the other thing is they're not meant to last forever. So most of them are preemptible, like they can There's a time that they can live. And so our thought was when we were going into this is, agents will be like humans in the sense of you don't want your laptop to be shut down until you're done with work. Like, and you want to close the lid and open the lid, it's the same state. So you - Agents would want that, like the pause and come back. They want those two things. But also agents really want speed, right? Can they get it? So when we thought about it's like we need something insanely fast, how to make it fast, how to make it long-running, and stateful. And so those two things, it's like combining a Lambda and an EC2, right? Those two things together. And so we didn't have an idea how others did it, ‘cause we didn't know too that there was a market around this. It was more like, okay, this is what we need, what they need. And we looked at Kubernetes, it wasn't wasn't good enough for that. We looked at Nomad, it didn't enable that. And so our history in rewriting our own scheduler at CodeAnywhere is basically what my CTO came up with. Like, he's like, “Oh, the learnings from there,” and he brought it. And the funny thing is, our third co-founder, when he saw it, he's like, “Dude, what is this? This is like 2008.” Like, we went back in time, and he's like, “Exactly.” And so the reason why Daytona is like super fast, and you see this on benchmarks, is we essentially, we run on bare metal. We have our own scheduler, we use the underlying, disk, CPU, and RAM of the underlying machine, which means your IOPS are insanely fast because there's no, there's no network between an EBS or something like that. But also the snapshot, the point in time, the templates, are also preloaded on the bare metal machines. So when you fire off a sandbox from a template or a snapshot, you're essentially directed to the bare metal machine where that snapshot is based on that NVMe drive, and then it literally just turns on that machine, and it's local. There's no network latency, anything on there. And so that is sort of the specificities that we, when we're thinking from first principles, what a computer would look like for an agent, that is what we came up with, and that's what we created.Benchmarks, 60ms Startup, and 50,000 SandboxesSwyx [00:15:02]: Yeah. I should maybe, I don't know if you endorse this, but there's someone that does compute SDK, you guys do very well on there, with like the TTI, right? I. is this a, is this a is this a relevant benchmark for you guys? I don't know.Ivan [00:15:16]: I don't know, and it changes every day. So today RKL is.Swyx [00:15:18]: I don't know what RKL is. Never heard of it.Ivan [00:15:20]: Yeah. RK, yeah, so it is there.Swyx [00:15:22]: You are, at least a third of the next tier of performance, and then, there's a lot of other better-known names that are very slow to start.Ivan [00:15:31]: Yeah. We've been the number one by far for a long time, and now there's different, there's different definitions also of sandboxes, different isolation patterns, different other things. So RKL runs it literally on the S3, the data, so it's very different, and they spin up a sandbox, spin up a container for that, so it's a different type of thing. So the definition of a sandbox is something that we can all, we all need to get along with. But yeah, we're insanely fast on getting these things, up and running. And so you can see even there that it's a zero point 0.10 to 0.11, so.Swyx [00:16:03]: Close enough. Yeah. what else do you need, right?Ivan [00:16:05]: Yeah. So the benchmarks itself, so, in this, in I don't think the benchmarks equate to market ownership or revenue or anything like that. and I've seen this with multiple benchmarks, not just in sandboxes, but in general benchmarks around.Swyx [00:16:20]: It's table stakes. It's just like.Ivan [00:16:21]: Exactly. But it doesn't hurt.Swyx [00:16:22]: Just roughly check.Ivan [00:16:22]: Like you definitely have to be up there and you have to be competing so that people know that, oh, this is definitely one of the top. Because this is only one dimension of what customers look for. There's other things like how many can you spin up consecutively? There's a feature set, there's support, there's like all different things that people look at, but you definitely have to be there, on the benchmarks.Swyx [00:16:40]: How many people do people spin up consecutively?Ivan [00:16:43]: So we have.Swyx [00:16:43]: Or concurrently, is the Concurrency, right?Ivan [00:16:45]: There's three metrics that we look at. And so one is like time to spin up one, and so our time to spin up one is 60 milliseconds with network latency. So request, spin up, reply, 60, the whole thing, 60 milliseconds. That is one. But if you wanna spin up 50,000 at once, we are now at about 75 seconds. So it takes about 75 seconds to spin up concurrently 50,000. Some others, there's public data around this, like take 2,000 seconds, which is 30 minutes. Like there's different variations of that. And then there is the so it is speed of one, speed of like multiple, and then how many can you consistently have up and running. And so we basically have right now no limit to how much we can add because we basically own our own metal. But the biggest customer of ours does like about 850,000 every single day is sort of where they're, where they're just shy of a million every single day that they're running, we do have a request for half a million concurrent, which is literally half a million CPUs somewhere running. So that's an interesting.Swyx [00:17:44]: They pay by like vCPU seconds.Ivan [00:17:47]: By seconds, yeah.Swyx [00:17:47]: Or whatever. Yeah. Okay, and so and then, and the other thing is, the sleeping and the resuming, ‘cause it's all the stateful resumption of all these things, how, what kind of workload are people putting through this, right? Like how is it Do we measure by gigabytes in memory, gigabytes in storage? I don't In like network attached storage. I, what are the costly ones of, out of all these features?Workload Economics: CPU, RAM, Network, and StorageIvan [00:18:15]: The most expensive thing are CPU.Swyx [00:18:18]: Okay. Yeah, of course.Ivan [00:18:18]: The second one, yeah Then it's RAM, then it's disk. We actually don't charge.Swyx [00:18:22]: Which is snapshotting, right?Ivan [00:18:23]: No, it's actually the, snapshotting's part of it, but basically the size of your hard disk, of your machine. So do you have 10 gigabytes, do you have 20, do you have 50, do you have whatever? And then the transference of that. Right now, currently we don't charge for, network at all at Polychron.Swyx [00:18:37]: Oh, you gotta, yeah, you gotta fix.Ivan [00:18:38]: Yeah. It is very much a it's a larger and larger part of our bill, so we're working around, that part there. Obviously, that is the least, expensive, so the hard disk is the least expensive, so it's basically CPU, RAM, for us network, ‘cause we don't charge the customer, and then hard disk, is how it's split up. But there's also different types of workloads, so we basically split it up into two types of workloads in Daytona. One is what we call background agents or long-running agents. and the other is, basically RLs and evals, which I put sort of together. And so they have very different patterns of usage, and if you look at the usage of a background And I'll just name names of companies, not specifically.Background Agents vs. RL/Evals: Two Usage ShapesSwyx [00:19:21]: Yeah, open, all hands.Ivan [00:19:23]: Yeah. So like a background agent's a Cognition, a Lovable, a like all these things are Harvey. These are all long-running, background agents. And so if you look at their usage patterns, their usage patterns are similar to human, which is like follow the sun. Basically, the usage patterns of that is like noon is probably the highest, and the midnight is the lowest, and then weekends are lower. weekday is higher.Swyx [00:19:42]: Yeah, that's a fun question. How global is it? Is it very US-centric or?Ivan [00:19:46]: The US is a large part, but we have currently, we have Asia, Europe, and the US regions.Swyx [00:19:52]: So it's quite global.Ivan [00:19:53]: Yeah, it's quite global. We have it all over. It's interesting that our I talked to you a bit about this. Our number one city by user.Swyx [00:20:01]: Hmm.Ivan [00:20:02]: Is Singapore.Swyx [00:20:04]: Oh, wow. Amazing.Ivan [00:20:05]: Which is an interesting one, right? Not by revenue, just by just like by individual head count.Swyx [00:20:09]: Really?Ivan [00:20:09]: Just like an interesting thing.Swyx [00:20:10]: Singapore is, Singapore is weirdly high in the adoption charts of AI for the population. It's like an, seven, eight million population. And it's like keeps showing up.Ivan [00:20:20]: No, it's quite interesting. We were quite shocked, and I was like, “Oh, this is interesting.” And also one that's up there.Swyx [00:20:24]: There's a reason I'm doing AI using Singapore. it's because I'm from there.Ivan [00:20:27]: We're there. We're gonna, we're gonna be there as well. and it's interesting that Japan is in the top or like Tokyo's in the top, which is in all the tech cycles it has never been. It has never been, so it's quite interesting that they're.Swyx [00:20:39]: I think the Japanese just love AI. Yeah. It's that, and then it's Brazil. That's it.Ivan [00:20:44]: Brazil has always been in.Swyx [00:20:45]: I think.Ivan [00:20:46]: Even when I look, if you look at like GitHub's data and ask historically with CodeAnywhere, it was always like US, Western Europe, and then you'd have like India, Brazil, China, like that would be there. But like Singapore was not in, specifically Japan was never in sort of that top, that top.Swyx [00:21:01]: Yeah. Weird pockets.Ivan [00:21:01]: Weird. Yeah, so it's very global.Swyx [00:21:02]: Okay, so actually that, but that's helps you to distribute your load through, all time?Ivan [00:21:08]: The interesting thing is like we have those kind of loads, but if you look at the researcher loads, they're quite different. So what they are is like if you give them concurrency of 10,000 or 50,000 or 100,000 CPUs at ARMb, when they fire off a run, it's just 100%. And then it just runs, and then it stops. So it's very, the usage pattern is squares basically, right? And it's also not follow the sun, because people will fire it off at midnight before they go to sleep but then wake up and so it's very unpredictable, so you don't know where that is. So the shapes of the usage are quite different than we have had before. And also what's interesting is when it's sort of a follow the sun, even if you have a high growth company, you can sort of predict your usage patterns and have enough capacity for that, because it's sort of, it grows in a, in a way you can project. When you have companies doing sort of like evals and RL, they're super spiky. So they're gonna come in, it's like, “We're gonna use nothing, then can we have 100,000?” Right? And then go back down. And then 100,000, go back down. So it's very different, right? And.Swyx [00:22:09]: Do you want to lock them into commits so.Ivan [00:22:11]: Yeah, we do.Swyx [00:22:12]: Yeah, okay.Ivan [00:22:12]: We so we have to lock them into some sort of commits to have that capacity, because we have to have, basically we have to have the capacity for peak. Right? And so right now, Daytona's mean utilization is 15%, 1-5.Swyx [00:22:25]: Oh my God.Ivan [00:22:26]: So it's very low.Swyx [00:22:27]: Because it's very spiky.Ivan [00:22:27]: It's very spiky, but we get up to 90%. so we have these things. And so what we're, what we're looking at right now as a company is similar to Cloudflare where you can like geo move things around, but that works really well for basically the background agent where it's follow the sun. But this, it's not. Like it's a very different shape. Obviously with scale you figure these things out, but that's an interesting new problem that we have, as a compute provider in the agent space. And when we were doing the conference recently, and so we talked to like Nikita from Neon and.Swyx [00:22:57]: I should bring it up.Ivan [00:22:58]: Parag from Parallel and whatnot, everyone has the same problem. Whereas the usage is super spiky, and this is something that has not happened before, that you have these types of like it was always, it the amplitudes were not this high, right? So it's quite interesting use case and problem solve.Compute Conference and Spiky Agent InfrastructureSwyx [00:23:12]: Yeah, I don't know if we're gonna bring this up again, but let's just talk about the conference, you had like 1,000 something people at the Warriors game, at the Sorry, where is it? What's.Ivan [00:23:22]: Chase Center.Swyx [00:23:23]: Chase Center.Ivan [00:23:23]: Chase Center.Swyx [00:23:24]: I went. It was, it was very impressive. Obviously, you can, how to throw a conference, what did you learn? you put, you pulled together all these impressive names.Ivan [00:23:33]: What I.Swyx [00:23:34]: What were you looking for?Ivan [00:23:35]: My thesis behind the Compute Conference was let's bring together people that are building infrastructure for AI agents. Because when I think of what we're building, it is the agent is the primary user, what are the ergonomics and usage patterns of agents, and so we can do that. And what I found, this was a theory, it wasn't proven, is that we all have these problems, as I touched onto. And I was, as I was talking on stage, it was like we all have the same underlying infra problems, which is this spiky workloads, unpredictable workloads that we've never had before, in human, compute or human infrastructure. And it's, again, it's the same when I was talking to Parag or when I was talking.Swyx [00:24:20]: Lynn. Nikita.Ivan [00:24:21]: Lynn, Nikita. Lynn especially, I was talking to her the other day as well. Like the It is a very interesting type of problem to solve because I can touch on Cloudflare because there's a lot of like talk about that recently as to how they solve that, which is they have a bunch of geos, and basically, as users work in different places, and depending on your tier, they can move you around the geos. And so that how, that's how they get the higher utilization. But you can sort of predict these, and it's If it's something in You'll rarely get a spike that is 10 orders of magnitude. Like you'll get a like let's say one of your customers has some like an exponential curve. What is that to I'm using Cloudflare as an example. 10%, 20%, whatever it is. I don't, I don't have this data, I'm just assessing. It's surely not 10x, right? It's surely not something there. And so how do you go out and solve this problem? And we're all solving this in different ways. So we have.Swyx [00:25:11]: She also has the same thing.Ivan [00:25:12]: Yeah, I know specifically that like Neon had that issue as well. Like how are we solving these spiky loads and things like that ‘cause we talked about it. And so the interesting thing for me to actually internalize was, yes, everyone that's building for agents first is going through this, and we're all solving similar problems, which is quite.Swyx [00:25:28]: Let me let me double-click on this. Okay. So for example, Neon, I happen to know that they're very sort of S3 oriented, right? so they're just like fully bet on S3. And you get to benefit from S3's distribution and infrastructure. So I would imagine that Neon doesn't have to care, whereas Lynn maybe has to care a bit more because obviously she's doing GPU inference. And, for listeners, we did an episode with her, one and a half years ago. And you have to care. But like, right?Ivan [00:25:54]: Parag cares for sure, and Nikita.Swyx [00:25:58]: And Parag is C of, Parallel.Ivan [00:25:59]: Parallel, yeah.Swyx [00:26:00]: Former CTO of Twitter.Ivan [00:26:01]: Twitter, yeah.Swyx [00:26:02]: They are the search.Ivan [00:26:03]: Yeah, they're search, yeah.Swyx [00:26:03]: I You and I know but the listeners don't know.Ivan [00:26:08]: Yeah, we can put it down in the screen, and so ‘cause we, when we were talking.Swyx [00:26:11]: I'll put it up on the, on the screen.Ivan [00:26:12]: Yeah, right.Swyx [00:26:12]: People can look it up if they need.Ivan [00:26:14]: Look it up. And, yes, but they still have CPU and RAM, allocation that you have to have up and running. And so CPU and RAM, you have to allocate that and have that ready. And so there's basically two ways to do it. One is you either over-provision and you can handle the bursts, or two, you basically have, I don't know if this is a term, just-in-time compute, which is like as your load becomes, as your usage comes in, you can fire off requests for VMs or bare metals at other cloud providers and then get them up and running.Swyx [00:26:43]: This is if you go above 100%, right?Ivan [00:26:45]: Yeah, this is.Swyx [00:26:46]: Like your overflow.Ivan [00:26:46]: If your overflow, like spillage or whatever you do.Swyx [00:26:48]: You probably lose money on it, but it doesn't matter, right?Ivan [00:26:50]: It, not Well, you might, you might not That is a more cost-effective way to do it but it's a slower way to do it. Because basically what you have to do is you have to like queue your requests, spin up these just-in-time compute, get it all ready, provision it, and then get your workload there. And so if the time isn't important that much, that's fine, and you can do that. But if your customer, and especially for, let's say, the RL training runs, the reason why a lot of people come to us is because GPUs are more expensive than CPUs, right? So you want your GPU running at, what, 100% the entire time. And so when you're running runs on CPUs, when the when the CPU cycle is like down and spinning up the next one, you want that to be instantaneous so that your GPU doesn't go down, right? And if you then have to like go out and provision machines, you're essentially telling the GPU that it has to wait, and that's incurring our cost. So there's things that you have to try to solve for there.RL Workloads, Declarative Images, and Kubernetes ReplacementSwyx [00:27:43]: Yeah, let's talk about the different workload, right? You said that, what was it? A few months ago, you had zero RL workload and now it's 50%.Ivan [00:27:52]: It will be this one, 50%, yeah.Swyx [00:27:54]: Let's talk about how different it is, right? Like I imagine, for example, a lot less dynamic code generation of like arbitrary code. Like here, it's probably all the same code. You're just doing parallel runs or something, I don't know.Ivan [00:28:05]: Yeah. So you'll have multiple Depends on the like for each run, you'll have a snapshot. And they, for the most part, they actually do use our declarative image builder, which is like, “Oh, we, the agent wants these dependencies, these env vars.”Swyx [00:28:17]: These ones, yeah.Ivan [00:28:18]: Yeah, the declarative image builder, it.Swyx [00:28:20]: Which is a very modal like thing that they.Ivan [00:28:22]: Yeah. And so we build it on the fly and then we propagate that snapshot, and you can spin up as many sandboxes as you want against that snapshot. And then if you have to do changes, the model can, or like it could be also be automated. It's like, “Oh, now for the next run, we need to install these things or remove these things or whatever to get, a task done,” and then it goes off and runs that. So yes, that is something that it seems that they prefer. The number one reason I found, or should I say, let's take a step back. What we are competing against in that environment is essentially managed Kubernetes. So EKS, GKE, whatever. That is what the vast majority run on. And anyone that has tried Daytona versus GKE, EKS is like, “I'm never going back.” That has always been. There's a few reasons. One is the ergonomics. So if you have, if you're using Kubernetes to spin that up, you have to essentially manage the interface interactions with that. Daytona, although as a compute provider, it's more akin to a Twilio and Stripe from a consumption perspective than it is an AWS. Like you have an API, an SDK, it's quite like easy and seamless to get these things up and running, that's one. The other is the speed to which we spin up, which we mentioned earlier, which is much faster, and the scale to which we can go to. We haven't got into features, but an interesting feature is that it's very hard to OOM, or out of memory, our sandboxes, because we can dynamically on the fly.Swyx [00:29:48]: Resize.Ivan [00:29:49]: Resize, which is like impossible on almost any other thing. There are some technologies that enable you to do that, but it's like a very hard thing. And so we actually saw this when, the Terminal Revenge team is, brought us actually. So thank you, Alex and the team, that brought us into this whole space.Swyx [00:30:05]: It's just very rare that, a framework would just say, “Guys, just use Daytona.”Ivan [00:30:11]: Yeah, I think it says it somewhere. Yeah.Swyx [00:30:13]: Yeah. I was like, “What is this?”Ivan [00:30:15]: There's all, there's multiple there, but they also mention a few other places. and so Daytona specifically-We have, the, just jumping on themes here We, I don't know where it says Data Center.Swyx [00:30:27]: I, there.Ivan [00:30:27]: Doesn't matter.Swyx [00:30:28]: There's a very strong recommendation, which is, very unusual. Which is, it's.Ivan [00:30:33]: We do not pay them for this, just.Swyx [00:30:34]: I know, yeah. They just like you.Ivan [00:30:35]: Yeah, they like us. yeah, and also a thing, so, Data Center has multiple isolation sets underneath. The customer doesn't have to know what they are. But basically we have Docker, which is a container, that's hardened with Sysbox. So it's Docker's, isolation that is a security equivalent to a VM, but it's still a container. And that is the default, and they, especially in these training workloads, really like that as an interface to be able to use just a basic Docker container, and we enable Docker and Docker. Which for these RL runs, if you need to do a Docker compose or Kubernetes, you can spin up a K3S inside of these things, which unlocks a huge amount of workloads that you can do that you cannot do on other providers. So just on that part is much more interesting. And so we went that, through that. We showed them that we could do that, and they enjoyed that quite a bit. They being the general venture people.Swyx [00:31:28]: Those people, yeah.Ivan [00:31:29]: And Harbor people.Swyx [00:31:29]: Harbor people, do are they, are they a company yet?Ivan [00:31:33]: As far, I do not know.Customer Pull, Slack Connect, and the Computer Use BetSwyx [00:31:35]: Okay. All right. Yeah. It's like super obvious that like, there's a lot of excitement and success around these things, okay, so yeah, tell us more, right? Like, this is an exploding workload, Harbor adopted you, which helped speed things along. But what are you learning as this new workload comes online?Ivan [00:31:53]: There's a couple things that we learned, which we chat about in the beginning. We, and this has led our story, as we mentioned, we like talked to a lot of customers along the way, and we add more features and more tool sets as we talk to customers. And it's interesting that And I think it's that the ecosystem is so small and/or the models get smarter, where when we see one user come with a request, we know it goes on a roadmap if like three to five customers come with the same request in that week. It's like very bizarre. It happens so many times, which is.Swyx [00:32:27]: Because they're all friends.Ivan [00:32:28]: Sorry?Swyx [00:32:28]: They all, they're all friends. They're all in the same group chat.Ivan [00:32:30]: Yeah, probably, yeah. ‘Cause and they're like, “Oh, can you do this?” And I'm like, “Okay, this is interesting. We'll put it on a feature request.” And then the next one's like, “Oh, can you do this?” “Okay.” It's all the same, right? It's always the same. And so what we try to do, and I personally try to do, I try to be on as many call, quote-unquote “sales calls” I can. I'm in every Slack channel. We literally have about 1,000 Slack Connect channels, something like that. It's an interesting, there's so many interesting things you find out when you have all the Slack channels. You can also see where people, transfer between companies. You see leave Slack channel, enter Slack channel. It's an interesting thing. Also, just I digress, I feel that Slack Connect is literally LinkedIn what it should be. You have a list.Swyx [00:33:08]: LinkedIn charges you to, use your own connections, but Slack doesn't, right? Slack is like, do it for free. It's more lock-in. It's great.Ivan [00:33:15]: Yeah. It's amazing. Yeah. It's one of the reasons.Swyx [00:33:17]: You're gonna pay Slack for life.Ivan [00:33:18]: Exactly. You're there for life. So that's interesting. And so one of the things, the newer things we were talking about earlier is we made a big bet and put a lot of investment on computer use. that is not seen publicly the light of day. We haven't GA'd that yet, but we have.Swyx [00:33:32]: Is there a thing I can pull up?Ivan [00:33:33]: There is computer use there. It's right up a bit.Swyx [00:33:36]: Oh, yeah. Okay.Ivan [00:33:38]: What we have, what we talked about and what we've seen publicly is there's this theme now about, the human emulator where And Elon from XAI has talked about this publicly, and if you think about the models today, they're actually quite sophisticated and they can do a lot of work, but they still don't have access to all the tools. Like, I'm a strong believer that the most efficient way for an agent to work is essentially headless or through, terminal or whatnot. But if we, if we look at knowledge work in general, there's about 100 million knowledge workers in the US, about a billion in the world, and knowledge workers, and the salaries of them aggregate to 10 trillion in the US 50 trillion worldwide.Swyx [00:34:24]: Wow.Ivan [00:34:25]: Something like that. And if we look at, the five most important sectors of that, so like healthcare and government and financial services and whatnot, that's about 56% of that. So let's say it's about half of that. So in the US it's about 25 trillion, and most of them, most of that work is actually still locked into legacy apps inside of Windows, which is not going anywhere for a very long time. Like, people just won't invest in that. How much of it? our assumption is the following: if, in the RPA market, which is similar market, well, not the same 25% of, these white collar, workers', work is automated. If an agent is more sophisticated, can go through more runs, figure stuff out, let's say it's, 40%, right? And so if you take 40% of that, you get to essentially, $10 trillion a year.Swyx [00:35:17]: That's a TAM.Ivan [00:35:18]: That is a that is a TAM. So that's the TAM of the models, right? That's not our, essentially ours. But you get to that size, and to be able to do that, you essentially have to give agents these computers with the legacy. So computer use, either Mac or Windows or Linux. Linux we also obviously have and others have. But Windows specifically is something very new, and the only option right now is an EC2 with, Windows or on Azure. Both of them take anywhere from three to five minutes to spin up. We've created an actual sandbox, so it's a second instead of milliseconds, but you have, point in time snapshots, you have, forking, you have all the things that you have from a sandbox, but essentially enables you to hopefully unlock all this value. And so that's been our big push and bet, but we've sort of, kept our ear to the ground. What is sort of the next things in the market?RPA Returns: Why Agents Still Need ComputersSwyx [00:36:06]: Yeah, knowledge work, and building, and sort of RPA, the next wave of RPA. I got very excited about RPA kind of during COVID times. The UI path was IPO-ing. And it was, a very hot Isn't it, Eastern European?Ivan [00:36:20]: It is, Romanian.Swyx [00:36:21]: Romanian?Yeah, it might be the only Romanian, big unicorn okay, yeah. This I don't I don't, I don't have like a I think there's, I think there's a stage being set for the resurgence of RPA, ‘cause everyone understands that, yeah, no one wants to deal with these shitty apps and no one's gonna rewrite them. Like, you just have to do, a remote operation and programmatic operation of them.Ivan [00:36:45]: If you wanna unlock it, my own setup was basically the following. So I was doing a board deck recently, last month, whatever, and I'm like, “Okay, let's just, let's just do automated.” So, all our data's in, ClickHouse and PostHog and QuickBooks, where everyone else's is, and I'm basically, connected that all to, my Cloud code, like go off and go Cloud code whatever. Go off and, here's the integrations, go do that. It pulled out the first report, which was great. It connected to Brex and all these things, pulled it, which was great, and then I say, “Okay, now pull out this, and this,” and I kept getting, really well McKinsey-style design reports, but the data said partial data. all the missing data, partial data. Like, it can't access all the things, and I got so frustrated, and so I got, I got, my Mac Mini virtual sandbox with OpenClaw. I gave it its own account in our company, and then I went to all these services and created a read-only account, so literally like an intern in your company. And so I would say, “Now go and do this report,” and it would get the same, or like, “I can't via the MCP or the API or whatever. I can't get all the information.” I'm like, “Go log in.” And it will log into the website, then go in, export the data. It'll export the data and do the thing end to end. So even for things that have today APIs, not all of it is exposed, and I to get value, I get immense value right now, but it has to be a computer usage, unfortunately, and so I spend a bunch of tokens just on that, but I get the job done. And so if even a startup like ours, and using all the hottest tools, still needs a computer agent what hope does, Goldman have to have a headless, right?Swyx [00:38:22]: Yeah, what a - Why isn't Microsoft doing this?Ivan [00:38:27]: I'm pretty sure, Satya had a post yesterday.Swyx [00:38:29]: Oh, okay. I see.Ivan [00:38:29]: Which was like, “Every agent needs a computer.”Swyx [00:38:31]: I see, I see.Ivan [00:38:32]: So they have launched something recently.Swyx [00:38:34]: Yeah, they have Microsoft Power Automate, I'm sure, I'm sure, they're gonna have their version.macOS Sandboxes, Apple Constraints, and the Windows OpportunityIvan [00:38:39]: Version of that, yeah.Swyx [00:38:39]: You're gonna try to do yours, and it - I always know there's always demand for Mac, but I know it's, tricky to host, macOS sandboxes.Ivan [00:38:49]: We will have macOS sandboxes fairly soon. The problem with macOS, OS sandboxes is, I'm deep in this, I don't know how much interesting is.Swyx [00:38:55]: No, it's.Ivan [00:38:56]: MacOS has this problem.Swyx [00:38:57]: It's a licensing thing, right?Ivan [00:38:58]: Licensing thing. So one, you're allowed to run only two parallel VMs per machine, so that's one. Two, you can only license to a different user every 24 hours. So if you come in and theoretically, if I wanna charge you per second and I charge you one second, I have to have it idle for the rest of the day. I can't have anyone else doing that. So the pricing will be different in the sense that I will have to - we would have to charge for 24 hours, and that's not even, that's not even the most difficult thing. But the, thing above that is, from a security perspective, they enable you to do memory snapshot, pause, resume, but only on the same physical drive, physical machine. And so what you can do in, Windows world or Linux world is that I can move in the background, your snapshot from one to the other and manage load, right? Here, if you wanna do that, you essentially have to have your.Swyx [00:39:49]: Yeah, snapshots. Yeah.Ivan [00:39:50]: Your.Swyx [00:39:51]: It's like.Ivan [00:39:51]: Physical machine.Swyx [00:39:52]: You can't break it up.Ivan [00:39:53]: You can't, you can't move things around that, and all of that is, that part is, from a security standpoint, if it is written. Like, I understand the security aspect of that, but it disables you from doing these agentic, like really scalable agentic workloads.Swyx [00:40:08]: You need to do a vibe-coded, clean room implementation on macOS that you can then - That's like Clean OS or something. I don't know.Ivan [00:40:17]: So. We have.Swyx [00:40:18]: ‘cause like Linux was originally like a clean room rewrite of Unix.Ivan [00:40:21]: Okay. Yeah.Swyx [00:40:21]: Or something like that, right? Like same thing to macOS. Someone needs to do it.Ivan [00:40:25]: Someone will do that, and someone will have some long-running agents for a few days to figure this stuff out. But yeah. So definitely we - we're really close to offering something ‘cause people do want it, but the pricing will be different, and the feature set will be sort of stringent.Swyx [00:40:38]: Yeah, nobody's gonna use this. like, the labs, the labs will because they want to automate macOS.Ivan [00:40:42]: They have to do RL. They have to do RL again. But even if you The - So the point is with the RL part, if you, if you do RL on macOS, then the next iteration of the model comes out, it will be able to use these tools significantly. Then you actually need to run those, that somewhere. So you're gonna have to have that, later on. And from, if anyone at Apple is listening, I very much feel that they are shooting themselves in the foot of the scale of the revenue of compute or licensing they could get if they would just enable a concurrency model similar to what you can get on a Windows and a, and Linux.Swyx [00:41:17]: Yeah. Yeah. And I'm sure they've heard this before. They just don't care. Yeah, it's And maybe they will change their mind with the new CEO.Ivan [00:41:24]: Yeah. We'll see.Swyx [00:41:25]: We'll see.Ivan [00:41:25]: High hopes.Swyx [00:41:26]: High hopes.Ivan [00:41:26]: High hopes.Swyx [00:41:27]: Okay. But I, it's very clear the market opportunity is huge in Windows, and you can go for a long time on just Windows, but your customers are gonna want both. and I think, it is interesting to me that, this is the sort of God application of agents, right? Like, I don't It was - How big was OpenClaw for you guys? Like, was it, was there, a significant bump.OpenClaw, Agent Labs, and the B2B2C Sandbox MarketIvan [00:41:54]: Not for us because we.Swyx [00:41:54]: Because you already.Ivan [00:41:55]: We're kind of positioned differently. Whereas although it's completely PLG and we have individual developers that use it, most of the users that use Daytona are sort of a B2B2C. Sort of it's either B2B or B2B2C. So, in the researcher world, it's B2B, so you're selling to, labs and neo labs and things like that. But on the long-running agents, it's mostly, from a scale revenue perspective, it's mostly B2B2C, where you have a app layer agent that uses you at a big scale.Swyx [00:42:26]: Like a Manus. Yeah.Ivan [00:42:28]: Like a Manus Lovable type of thing.Swyx [00:42:31]: Yeah. I think that's the question of, well how, um-Uh, yeah, B2B to C is basically to me what I've been calling an agent lab, which is kind of like you're not in a model lab, but you're making a very good wrapper that is a platform that other people can sign up so they don't have to code those things. Yeah, it sound, it sounds like a much better market than the direct OpenClaw market.Ivan [00:42:56]: I've like - We I've done multiple things. So the CodeAnywhere's part of our career path R in the calendar, was very much an end user developer product. And so that is great. It You can get a lot of developer love, and I feel that we do as a company have a bunch of developer love. But it's a different type, where it's people building these things. Again, it's more akin to a Twilio because you don't really run - As a person, you wouldn't run Twilio. I don't know how many people remember. It was like ask your developer billboard and whatnot. And people really love Twilio, but they only used it inside of like, “Oh, I'm building this app or service for thing.” And so we're very much directly to that. And you also know that I used to work for a competitor for Twilio, so it's kind of ingrained, in my DNA.Swyx [00:43:35]: People don't know InfoBip is that big.Ivan [00:43:38]: Yeah, it's.Swyx [00:43:39]: Because.Ivan [00:43:40]: It's a billion euro.Swyx [00:43:40]: They're all American. They're like, “Whatever's in Europe doesn't matter to me.” But like it's the, it's the same size or bigger? Same size?Ivan [00:43:46]: It's about half the size.Swyx [00:43:47]: Half the size?Ivan [00:43:48]: Yeah, about half the size.Swyx [00:43:48]: It's like, yeah.Ivan [00:43:48]: Still huge. Multiple billions a year. Yes.Swyx [00:43:51]: That's crazy.Ivan [00:43:51]: Exactly, and so that - These are like really interesting and large revenue-generating, very sticky businesses. Whereas when you're selling to the - When your focus is the end developer, it is a very hard sell because they're very price sensitive, very price conscious, very around that. And there's very It's very hard to scale. Your cap is the number of people that are willing to spin up - First of all, wanna spin that up, and then spin up multiple of these. Whereas if you're in the enterprise one, like we know everyone's talking about like how many tokens they're spending, I'm spending. Like a lot of companies today are like, “If this is our company, spend as much as you can.” Like basically that is where we're going. And so if you think about that paradigm, where you're selling to companies that say, “Spend as much as you can to generate, productivity,” versus, “Oh, I'm a single person. I have this much budget, and I'm doing this thing because it's fun or it's helping me out or whatever.” Like it is a different, it's a different go-to-market, I think, strategy.MCP, CLIs, and Sandboxes as the Agent RuntimeSwyx [00:44:50]: Yeah, there's a lot of discussion. I'm just kind of going through like the mental list of things that are in your favor, which is, for example, MCP versus CLI. Like obviously you want CLI. It's been very good for you. I feel like it's maybe a drop in the bucket or maybe it's huge. I'm just checking whether it's like these are big trends.Ivan [00:45:10]: Those things you - work well in our favor, to your point just because every.Swyx [00:45:13]: They're kind of drop in the bucket, right?Ivan [00:45:15]: I think it's like sort of all the things come together. And so there's so many things that impact that. To your point, like OpenClaw wasn't huge for us, but like having the agent SDK, from Anthropic, so or Cloud Claude Code was very interesting. The reason why it was interesting is that a lot of, let's call them app I don't know what to call them, app layer agent companies, essentially they are like, “Oh, I can create this new app, this new agent. All I need, I just use Claude Code, and I throw it into a sandbox, and then I have my interface to the human to that.” And so that enabled so many more companies to actually offer this, and then they would pull on sandbox. So that was, that was interesting. And to your point, like MCP, versus the CLI, the MCP is an interface against an API, whereas the CLI is like you can actually go do things. Like this is it. The difference between integrations and actually running scripts or data or analysis against a thing. So being able to use a CLI very well enables the agent to do more things, and it's because that people will invoke a sandbox, they'll run it in the CLI, and but it'll do anal-analysis on that data and then give you an actual result versus just, pulling data from an API source.Swyx [00:46:29]: Yeah, it's a layer of indirection basically, it's the same thing as agentic search versus RAG, which where you're.Ivan [00:46:34]: Exactly, yeah.Swyx [00:46:34]: Just like you just win whenever people put more agents into their workflow. And so like it doesn't really matter, but I'm just kinda teasing out like what else have people heard about that like it's sort of, “Oh yeah, this is another sandbox use case. Oh yeah, that's another one.” Am I, am I missing any big ones?Ivan [00:46:51]: The thing, the thing that people, which is the computer use stuff, which I think is probably the most interesting one, is, and to your point, we've talked to so many people over the last year. It's like, “Oh, like why do you need a sandbox? Why do you need this? Why this?” And to your point, it's like, “Oh, I need sandbox for this. I need sandbox for that. I need sandbox-” It's like, “Oh, I need it for every single thing.” And so basically what I, what I - and it sounds like a broken record, it's like you use a laptop every single day, right? And you are n of one. It's just you. But now imagine how And by the way, the laptop, the computer PC market, the PC market is about equal to the cloud market in total. So it's about 150, 180 billion a year. Something like that. It's about roughly the three cloud hyperscalers is about equal to like Apple, HP, Lenovo, whatever, It's a little bit less, but it's sort of like that. And now imagine And that's just like, so how big is the addressable market? What, how many people are there in the world now? What's the last data?Swyx [00:47:45]: Let's call it eight billion.Ivan [00:47:46]: Eight billion. And so let's say you can have two computer, like you have one personal and one business, whatever. Like so it's double that, right? and so that's 16 billion, right? How many agents are gonna be running in two years, in 10 years, in 100 years? Like And for every single task, they will need one of these. And so how big is that? That market is essentially quote unquote “infinite”. You will get to the point, and Dylan Patel was at the conference talking about, from SemiAnalysis, that talks usually about GPUs, was also talking about how CPUs will now be a bottleneck because it will be the constraint. You won't be able to grow, or we won't be able to have enough of these because there won't be enough CPUs to basically do.Swyx [00:48:23]: Yeah. Well, I actually had a really good podcast with Doug Oliphant, who, which was his president at SemiAnalysis, where they've basically been like, yeah, it's been a GPU shortage first, but then it's cascaded down to memory and now to CPUs.Ivan [00:48:35]: CPU, yeah.Swyx [00:48:35]: It-What's next? So networking. So, networking actually has been in shortage for a while if you're looking at, just GPU networking. But, yeah, it's really crazy the amount of computer use that's going on, yeah, cool. I, other questions are, just the one very big part is the open sourceness which you didn't have to do, your competitors don't do, like it's not, a lot of people are worried about keeping their projects open source because some competitor can just slot fork it. I don't know if there's any reflections on just being an open source company.Open Source, Trust, and Enterprise ProcurementIvan [00:49:15]: Yeah. There's a bunch. So we the original product that we did was open source.Swyx [00:49:19]: Yeah. CodeAnywhere.Ivan [00:49:20]: So doing that was actually very good for us. There's basically a saying of, What's the saying? Like, companies that are, that are doing really well, measure themselves against, free cashflow, that are kinda okay, it's EBITDA, then, it's, it goes all the way down.Swyx [00:49:36]: The worst is like GitHub stars.Ivan [00:49:37]: GitHub stars. GitHub stars are the worst, yeah. So you go all the way down to GitHub stars. And so our original one was GitHub stars. That's what we talked about, we're at the point we're talking about revenue, so we're we've gone up the stack on that. And so we started.Swyx [00:49:47]: No, profit.Ivan [00:49:48]: Yeah. We haven't, we're, we'll get there. We'll get there. But basically at that point we did stars and GitHub and it was useful, and the original variation that we did, it we split the core into its own repo and it was Apache 2.0, so very, permissive. And then we basically would bundl

Castle Super Beast
CSB373: Never Reheat 2 Year Old Beef

Castle Super Beast

Play Episode Listen Later May 20, 2026 156:53


Download MP3 | Watch Video Episode Full Timestamps: https://docs.google.com/document/d/e/2PACX-1vT44TUsVZDuKxJqgbhZrh6_hEVMU02wcpfzsQ_7Rfei8DkTcgVVBO3S6sKmPIS8v3-gY5vb0P1CDeeJ/pub The Hardest Card Game I've Ever Played  ONE HUNDRED DOLLARS?!  Here's The Thing: It's a Bad Version of The Thing  Pokemon Cards Anti-scalping Tech: Answer Our Riddles Three  Seeing The Matrix on Opening Weekend HIT DIFFERENT Watch full episodes: https://www.youtube.com/@CastleSuperBeastArchive Reggie In The Lab Limited-time Plushie only available this month! https://www.makeship.com/products/reggie-in-the-lab-plushie - Visit http://drinkag1.com/SUPERBEAST to get a FREE AG1 Flavor Sampler and a bottle of Vitamin D3+K2 in your AG1 Welcome Kit! - Head to http://factormeals.com/castle50off and use code castle50off to get 50% off and free daily greens per box - Sign up for your 1$-per-month trial today at http://shopify.com/superbeast - Invincible VS is out now on PlayStation, Xbox, and PC. Docket: PS5 Class Action Lawsuit Targets Sony Over Price Hikes Jess Cox - This Is A Genius Way To Prevent Scalping PlayStation Has Started Revealing Public Player Counts - Insider Gaming Microsoft has confirmed that Windows Update has been downgrading newer GPU drivers that users install manually from Intel, AMD, or NVIDIA websites. Marvel Tokon: Fighting Souls Was Almost A 1v1 Fighter PlayStation CEO Hermen Hulst says single-player Sony games won't come to PC going forward Random Avatar Matches and Avatar Arcade, two brand-new modes for Street Fighter 6, will be added on May 28 alongside the release of Ingrid GenAI In Games? Most Players Just Don't Care, Study Finds Capcom Execs Still Excited About GenAI, Is Still Ramping Up Hiring  

Saxo Market Call
Pivotal days for tech and AI stocks, FX and rates

Saxo Market Call

Play Episode Listen Later May 20, 2026 27:26


Today, markets are looking pivotal across the board ahead of the single biggest earnings report of the quarter as Nvidia reports after the market close today. Is the AI chip focus shifting a bit more toward inference and away from GPU's? Meanwhile, in the background, US and global yields have pressurized focus on broader equity market valuations and even FX is trying to come a bit more alive here on the rates focus. And that's all without considering the ongoing headline risk from the Hormuz Strait and Iran war. This and more on today's pod, which is hosted by Saxo Global Head of Macro Strategy John J. Hardy. Links A radically new commercial and military airplane concept, the JetZero Z4 is getting serious funding for actual production and is set to break ground on a production facility next month. AI radio stations, DJ and all - listen at your own risk, listeners, or rather, biological processors. A very funny and fascinating experiment pitting four of the top LLM's against one another. WSJ covers the declining popularity of AI in the US- a slight clash with the scale of growth currently priced, no? WSJ with another piece, this once on US juries and rising lack of trust in the US justice system as well as the points of view of others on the jury. The implications of the "post-truth" society we live in - how can institutions every find renewed trust?  Molten lead nuclear reactors coming to Sweden? It's not as crazy as it sounds. There is even an Oklo angle with this Swedish company Blykalla. About twice per week, you will find links discussed on the podcast and a chart-of-the-day over at the John J. Hardy substack. Read daily in-depth market updates from the Saxo Market Call and the Saxo Strategy Team here. Please reach out to us at marketcall@saxobank.com for feedback and questions. Click here to open an account with Saxo. Intro music by AShamaluevMusic DISCLAIMER This content is marketing material. Trading financial instruments carries risks. Always ensure that you understand these risks before trading. This material does not contain investment advice or an encouragement to invest in a particular manner. Historic performance is not a guarantee of future results. The instrument(s) referenced in this content may be issued by a partner, from whom Saxo Bank A/S receives promotional fees, payment or retrocessions. While Saxo may receive compensation from these partnerships, all content is created with the aim of providing clients with valuable information and options.

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Take the 2026 AI Engineering Survey and get >$2k in credits and AIE WF tickets!This was recorded before Railway suffered a major GCP outage on May 19, despite being a multi-AZ, multi-zone mesh ring, with HA fiber interconnects between their Metal GCP AWS, because workload discoverability was unintentionally still tied to GCP. All has been resolved with a post-mortem.Railway did not start as an AI infrastructure company.It was founded in 2020 years before agents became the default way people thought about deploying software. Jake Cooper, formerly at Bloomberg and Uber, started Railway with a simple obsession: the activation energy to ship something to production should be near zero. Push code, get a URL, iterate. No Docker files, no Kubernetes manifests, no Ansible scripts stacked on Ansible scripts.For years, this was a slow grind. Railway spent its first 18 months hand-acquiring its first 100 users with Jake personally greeting every Discord signup on a second monitor.Today, Railway has raised $124m and is growing very fast. A 35-person team supports 3 million users, adding roughly 100,000 signups a week. Their bare metal data centers have a 3-month payback period vs. renting in the cloud, with 70% margins funding aggressive cloud bursting when needed. The servers they own have actually appreciated in value as RAM prices have climbed basically meaning the value of their hardware now exceeds the capital they've raised.From rebuilding Railway's network overlay over a weekend to moving the vast majority of workloads onto its own bare metal data centers, Jake Cooper is trying to build a new cloud for an agent-native world. In this episode, Railway's founder and “conductor” joins swyx and Alessio to unpack why the next era of software infrastructure is not just “Heroku but newer,” what agents need that humans did not, and why the old deployment loop of Git, PRs, CI/CD, and static cloud resources may be heading for a rewrite.We go deep on Railway's infrastructure stack: own-metal data centers, three-month cloud payback periods, cloud bursting, data center debt, Railpack, Nixpacks, Temporal, feature flags, Central Station, content-addressable filesystems, agent-safe production forks, and why the CLI may become more important than the canvas in an agent world. Jake also shares the founder journey behind Railway, how the company survived losing $500K/month, why it now serves millions of users with only 35 people, and why he believes the pull request is dying.We discuss:* How Railway went from a slow six-year grind to adding 100,000 users a week* How Railway thinks about agents as the next dominant software species* Why agents need version control, observability, compute, storage, and orchestration at 1000x scale* The economics of Railway's own-metal data centers and three-month payback* How Railway uses cloud bursting while scaling its own infrastructure* Why data center debt can be a better tool than venture debt for infra startups* Central Station, Railway's internal system for clustering customer feedback and incidents* Why responsible disclosure and over-communication matter for platforms* Why feature flags, progressive rollouts, and shadow traffic are essential for agents* Temporal's strengths, pain points, and why workflows matter for agents* Railpack, Nixpacks, Nix, and lazy-loaded content-addressable filesystems* Why “cattle, not pets” may change if you can clone the pets* Why Railway is building a new cloud from scratch instead of copying hyperscalers* The solo founder path, focus, writing, and how Jake thinks about company buildingRailway:* Website: https://railway.com/* X: https://x.com/RailwayJake Cooper:* LinkedIn: https://www.linkedin.com/in/thejakecooper/* X: https://x.com/JustJakeTimestamps00:00:00 Introduction: What Is Railway?00:02:07 Jake's Path to Railway00:06:13 Railway's Six-Year Growth Story00:08:52 Rebuilding the Business After the Free Tier00:11:17 Agents as the Next Software Platform00:13:29 Railway's Infrastructure Philosophy00:15:42 Bare Metal, Cloud Economics, and the Compute Crunch00:17:22 Cloud Bursting and Five-Cloud Networking00:20:20 Data Center Debt and Infra Financing00:23:31 Data Centers in Space00:25:24 What Agents Need From Infrastructure00:28:24 CLIs, Canvas, and Agent-Native UX00:35:15 Central Station, Incidents, and Responsible Disclosure00:40:30 Safe Rollouts, SRE Agents, and Production Forks00:45:00 AI SRE, Specs, Code, and Tests00:48:24 Self-Replicating Infrastructure and the New Serverless00:53:18 Heroku, Temporal, and Workflow Engines01:04:07 Railpack, Nixpacks, and Lazy-Loaded Filesystems01:06:01 Coding Agents, Token Spend, and Roadmap Acceleration01:10:56 The Pull Request Is Dying01:12:28 Feature Flags and the Agent-Era SDLC01:16:15 Cattle, Pets, and Cloning Machines01:19:29 Solo Founder Lessons01:24:12 Focus, GPUs, and Building a New Cloud01:28:20 Closing ThoughtsTranscriptAlessio [00:00:00]: Hey, everyone. Welcome to the Latent Space Podcast. This is Alessio, founder of Kernel Labs, and I'm joined by Swyx, editor of Latent Space.Swyx [00:00:10]: Hey, hey, hey. Today we're in the studio with Jake Cooper of Railway.Alessio [00:00:14]: Conductor of Railway.Swyx [00:00:15]: Conductor at Railway. Yeah.Alessio [00:00:16]: Choo-choo.Swyx [00:00:17]: Do you actually have that anywhere, like on your business card?Jake [00:00:20]: We call some of our volunteer moderators conductors. I don't have a business card. We're not that big yet. At some point I will. I got handed a nice business card from the Supermicro folks, and I was like, “Damn, this is pretty official.”Swyx [00:00:30]: Business cards are coming back.Jake [00:00:32]: They're cool. They're hip. The conductor thing is good. We're trying to figure out what we want to call each other internally. Some people think it's super cringe and say, “You don't need a name for people internally.” Some people want to call each other something. We still don't have a really good one.Jake [00:00:55]: We've got New Railcrews, Trainiacs. Nothing has stuck yet.Swyx [00:01:00]: I like Trainiac. Trainiac sounds good. Railwayians. For those who don't know, what is Railway? Let's give people a crisp definition up front.Jake [00:01:09]: Railway is the easiest way to ship anything. You go to the canvas, or you talk with Claude, and you say, “Deploy a Postgres instance, deploy my GitHub repository, run this code,” and you're off to the races.Swyx [00:01:22]: You've got a nice animation on the landing page.Jake [00:01:24]: Thank you. None of my work, by the way. They don't let me touch the design stuff anymore.Jake [00:01:25]: We want to make it trivially easy not just to deploy things, but to evolve applications over time. Most tooling right now stacks entropy on top of entropy: Docker, Kubernetes, Ansible scripts, and all these other things. If we can version all of your software and keep track of all the changes, then we can make it trivial to clone environments, fork into a parallel universe, get copies of production data, get copies of any services, make changes, validate them, and collapse them back in without reproducing everything across a staging environment.The Railway Origin Story: From Uber Systems to a New CloudSwyx [00:02:07]: I was looking at your background: Bloomberg, Uber. Nothing immediately stands out as, “This guy is going to found the next great platform as a service.” What prepared you for Railway?Jake [00:02:21]: It was curiosity to keep going deeper. I started out on front-end stuff, working on Wolfram Mathematica and porting it over. Then I briefly moved to Bloomberg, then toward Uber and distributed systems, taking the Jump Bikes systems and moving them to a distributed system built on top of Cadence, the pre-Temporal Temporal.Swyx [00:02:44]: Which, by the way, I'm happy to talk about, pros and cons.Jake [00:02:48]: Totally.Swyx [00:02:51]: But let's do the Railway story.Jake [00:02:52]: It has been a continual step of wanting an experience. Whether it's walking up to a bike, unlocking it, and having it work frictionlessly, or something else, the depth required to make that happen follows from the experience. A lot of the work I do, and a lot of the team does, is in service of that experience. We fundamentally don't care how deep we have to go. We will swim to the bottom of the swimming pool to get the experience.Jake [00:03:17]: I don't have a physics PhD. I did an EECS degree. It has always been about figuring out the next step: how do we get there? That's what led to starting Railway for that experience and then moving all the way to bare metal data centers. I was adding patches to the kernel this week to get the experience there because I can see how much better it can be.Swyx [00:03:49]: Other patches to the Linux kernel this week?Jake [00:03:51]: Yeah. Not upstream. Our fork.Swyx [00:03:52]: That's a flex. Railpack? No, this is different. This is the OS on top of Railpack?Jake [00:03:57]: No, this is an actual kernel patch. It's always literally: what do we have to do to get that experience? Then figure it out. Anything is figureoutable.Swyx [00:04:10]: Would you send the patch upstream, or does it not fit other use cases?Jake [00:04:13]: Maybe. We have to work out the experience internally. It has to do with the storage layer we're building for some of the agentic stuff. Maybe it'll be useful upstream, but it's deeply useful for us internally.Open Source, Forks, and Non-Deterministic VersioningSwyx [00:04:29]: You mentioned open source before. How do you think about starting from open source, and then coding agents letting you do a lot more from forks of it?Jake [00:04:38]: GitHub's original sin is that it's almost a series of broken pointers. You have this thing, then you clone it, and now you've lost the whole upstream. How do we make it trivial for people to modify really small pieces of it?Jake [00:04:51]: We think of Git in a discrete sense: I've either made a change and merged upstream, or I haven't. What would it look like if it were percentage-based, a little more non-deterministic, or a stream of changes that users traverse as a percentage rolled out in general and then rolled all the way up?Jake [00:05:13]: We have the open-source kickback program and let you deploy templates because we want to make it trivial for people to version these shards over time. It solves a large problem around authentication, authorization, and security. NPM has a way to define, “Don't take any new packages.” The ideal end state is that you roll out progressively to users with the minimum impact zone and continue rolling up. JPMorgan should probably be the last one on the patch line, for all our sakes, because our money and livelihoods are there.Jake [00:05:53]: It's okay if Johnny Vibe Coder gets a broken patch because there's so much entropy in the system that the rubber has to meet the road at some point. You have to test at varying levels.The Long Grind: First Users, Free Tier, and Making the Business WorkSwyx [00:06:13]: I wanted to pull up this glorious chart, which is your usage or number of daily signups?Jake [00:06:22]: Daily signups, I think.Swyx [00:06:24]: You started six years ago. It was a slow grind, and now you're on a rocket ship. You say, “Don't doubt your fight and don't quit.” Maybe pick out certain points that were key inflections for the company.Jake [00:06:40]: At the start, it's about getting your first 100 users, hell or high water. We had a website and a support link. The support link was the Discord channel. I had notifications on with two monitors: the monitor I was working on and the other monitor with Discord. If anybody came in, I was immediately like, “Hey, how's it going?” It was rare, so getting those first 100 users to come back was the start.Jake [00:07:14]: Then you build a consultancy factory because users want all these things. You have to go back to the board and ask, “What is the actual product offering I want to build on top of this?”Jake [00:07:28]: VCs want charts that always go up and to the right, but in reality you don't necessarily want charts that look like that. For us, there have been periods of expansion where we add features to test use cases, and periods of compaction where we ask, “If the experience we have is good, how do we make it significantly better?” Maybe we strip out features that don't fit our ICP anymore.Jake [00:07:57]: The boom from 2022 to 2023 came from the free tier. Everybody under the sun was using it.Swyx [00:08:09]: A lot of Reddit bots and Discord bots.Jake [00:08:12]: And crypto miners. When you build an open product on the internet where anybody can sign up, the internet is a horrible place with so many things. You go through periods of asking, “How do I reach as many people as possible?” Then, “How do I fit the exact use case for the people who really matter and are really excited about this specific thing?”Jake [00:08:39]: Then there was a two-year period of making the actual business work. During the free-tier era, we were losing about half a million dollars a month.Swyx [00:08:59]: On a $20 million bank account.Jake [00:09:02]: On a $20 million bank account with maybe $50,000 a month in revenue. That's a horrible business. I don't know how anybody invested. But you have to go through it and say, “We have an experience people love, but the business has to work.”Jake [00:09:17]: There are two schools of thought. You can run the horrible business all the way up with bad margins, or you can go back and make it work. We've always wanted a super lean team. We're 35 people right now. It's very small.Swyx [00:09:36]: Supporting three million already?Jake [00:09:38]: Yeah. We're adding 100,000 users a week right now, so it's growing fast. We don't want to add headcount for the sake of headcount or throw bodies at problems. We want to build systems. It's hard to build systems during expansion because you're adding things to the system because people are asking for them or things are breaking.Jake [00:10:00]: We had to cut off the free users for a little while, rebuild the business, and make sure it worked. We want to reach as many people as possible because software is important. It's become difficult to create things in the physical world, so it's important to make it easy for people to build in the virtual world and have access to creation. But there are legs to that journey.Jake [00:10:30]: You can see divots in the charts. If you follow between 2025 and 2026, it's either summer or winter. People go on holiday with family.Swyx [00:10:50]: It affects that much?Jake [00:10:51]: Yeah. It's kind of B2C and kind of B2B. People are shipping constantly, then they stop. Our activation curve now shows more people activating on weekdays because we have more business users, so it smooths out over time.Agents as the New Interface to DeploymentSwyx [00:11:17]: Was there a point where you started prioritizing AI development or agent development?Jake [00:11:24]: We've prioritized agentic as a top-of-funnel thing. Over the last six months, we've deeply prioritized agentic as a mechanism to build and deploy things because we believe the curve is so steep and that is how people will build and deploy software.Jake [00:11:42]: It almost fundamentally doesn't matter whether this is dot-com or not because we're all on the internet anyway. If agents are going to deploy a bunch of things and we hit an inference wall at some point, we'll fix those problems. The dominant species over the next 10 years is that we've moved from assembly to C to C++ to JavaScript to words. You're going to need to close that loop.Swyx [00:12:13]: When you say this is dot-com, did you mean buying the domain, or the general case?Jake [00:12:17]: I mean the dot-com era, when companies had a huge run-up because people understood the internet was important. Then they hit bottlenecks, fundamental laws of physics, math didn't work, and everybody came back down to earth. But it didn't matter because the internet became so impactful. If you operate on a long enough time horizon, you should build these things anyway because you can see where it's going.Jake [00:12:45]: That's where I think a lot of agent stuff is. You get to a point where you're running thousands of agents in parallel. What is the inference cost? What is the compute cost? How do you make that efficient? How do you coordinate all this? We have issues coordinating humans; we don't even have good tooling for that. Now we have to figure out how to get agents to coordinate, safely version changes, and know when to raise their hand for someone to intervene. Otherwise it becomes an interrupt factory.Railway's Infrastructure Thesis: Network, Compute, Storage, and MetalSwyx [00:13:19]: Let's go right into the technical side. What are the core infrastructure or architectural beliefs of Railway that allow you to do what you do?Jake [00:13:29]: The primitives matter a lot for us. We need network, compute, storage, and orchestration around it. You need control over a lot of those things. We've talked a lot about how we don't really use Kubernetes because we want higher-order control to place workloads in very specific places.Jake [00:13:48]: The reason is that you have to be very efficient with agents: memory reuse and all these other things, or you're going to massively blow up your cost structure. Being able to rack and stack your own servers and build your own metal unlocks performance and cost. Experiences where you're running 1,000 agents in parallel are not massively cost prohibitive.Jake [00:14:13]: Token use and compute use are blowing up. Over time, those things have to get a lot more efficient. You can get a lot of margin to make those experiences solid by building your own metal. That's all in service of offering a differentiated experience to as many people as humanly possible.Swyx [00:14:51]: You have a data center in Singapore.Jake [00:14:53]: Yeah. We have two in every other region now. In Singapore, we're adding a second one in Q3.Swyx [00:14:58]: What's it like? I've never built a data center. Do you go to Equinix and say, “I want some slots?”Jake [00:15:05]: Yeah. Equinix. You basically go and say, “I want power and I want a cage.” They say, “Great, here's what it's going to be.” You rent the cage for a period of time, fill it with racks and servers, and hook up internet to it. That's all the pieces.Swyx [00:15:36]: Then you handle everything else.Jake [00:15:37]: You handle everything else.Swyx [00:15:39]: What's the math versus clouds doing it for you?Jake [00:15:43]: If we rented in the cloud, our payback period when we go to metal is about three months.Swyx [00:15:50]: Which is crazy.Jake [00:15:51]: It's nuts. That's four years of depreciated hardware. You're going to see a lot of this compute crunch because hyperscalers are buying up a lot of stuff. We're working directly with OEMs, resellers, and people building these machines: Supermicro, Dell, and others.Jake [00:16:11]: Upstream, there's a bunch of supply pressure. When we raised our last round, between deploying capital for servers and now, the amount of money we've raised is less than the amount of money we have in the bank plus the value of the servers because the servers have appreciated as RAM has gone up. It's nuts how valuable hardware has become.Jake [00:16:50]: If you look at hyperscalers, they deployed around $80 billion of capital expenditures this year, and next year will be more. That's a massive infrastructure build-out. You look at that and think it's crazy that they're spending way more than the Manhattan Project. But if every person is going to run dozens or hundreds of agents in parallel, you have no conceptual idea how much compute is required to make that experience happen, even if you're deeply efficient and sharing resources. And that doesn't even count inference.Swyx [00:17:22]: How do you plan the build-out? The growth chart is so vertical. Are you usually at 100% utilization as soon as racks are live? How far ahead are you planning?Jake [00:17:33]: We still maintain cloud presence for bursting. We work with AWS, GCP, and a few other clouds. We can rent, and then the moment we get space or power, we compact those workloads off the cloud. We started on the clouds, then built a system to migrate to our own metal. There's nothing that says you can't continually do that again, and that's exactly what we do. We never want to be compute constrained.Jake [00:18:09]: At the start of the year, we actually became compute constrained because one upstream provider wasn't able to give us quota at the rate we needed, and the hardware was slower. I spent a weekend rebuilding our entire network overlay so we could straddle five clouds: Oracle, AWS, ourselves, GCP, and one other one. We can do more than that now.Jake [00:18:38]: We got into a spot where we were trying to pack instances tight because we couldn't get enough compute. That led to a few reliability issues, which are now past us. I made a tweet pointing out that it's becoming harder and harder to acquire compute at the rate these models need to acquire compute. We got bit by it.Swyx [00:19:15]: How do you think about pricing knowing you might not have your own metal available at all times? Are you pricing assuming you need extra margin if you end up going into the cloud?Jake [00:19:26]: Because we've built out our metal data centers, our margins on metal are around 70%. We can deeply subsidize the cloud business if we want to scale at a reasonable rate. We have a few levers: metal, which makes the margins; cloud burst; debt to buy servers; and venture capital. It's an interesting operational problem: how much cash do we have, how much should we raise, how quickly can we deploy it, and can we scale revenue as quickly as we scale compute?Jake [00:20:05]: If we continue making it trivially easy for people to build and deploy, then the faster we close that loop and the more operationally excellent we are with capital, the faster the business can scale. It's almost a straight linear deployment rate.Financing Infrastructure: Hardware Debt, VC, and Operational LeverageSwyx [00:20:20]: I think infra startups raising debt is a tool people don't utilize enough or know enough about. What can you tell us about that? Is it secured against your CPUs?Jake [00:20:32]: It's secured against our hardware.Swyx [00:20:37]: What rates do you get? Who are the lenders?Jake [00:20:39]: We pay prime plus a spread, and we can refinance any of the debt as rates go down. The terms are pretty good. The unfortunate thing is that Twitter has no nuance, so people say, “Venture debt bad.” But as with all things, there are specific tools and areas where you can be deliberate instead of using one tool as a hammer. Venture capital is not the hammer for everything. You have to explore and figure out what works.Swyx [00:21:12]: VC is usually the most expensive financing you can get.Jake [00:21:15]: Yeah. I also think people think about VC incorrectly from a capital-raising perspective. Most people think, “How do I raise as much money as possible from whoever is probably the best I can get at that time?” That's close to right, but what we've tried to do is figure out what unfair advantage we can buy with that equity.Jake [00:21:34]: It's the most expensive equity you're going to give away at that point in time, assuming the company keeps getting better. How do you use it to work with someone stellar who complements you? In the seed stage, I had never started a company. Ray Tonsing had good advice, and I could text him all the time. He was really fast. Awesome.Jake [00:22:01]: Then with John and Erica at Unusual, they said, “You roughly know what you're doing building a product. We'll mostly leave you alone and be available for advice.” Amazing. Then we got to Series A and the business was an operational tire fire because we didn't know how to scale a business. Work with Erica, and Jordan is over at Redpoint, so bonus.Jake [00:22:28]: Now we've raised from TQ and FPV as we're moving into enterprises. Every step of the way, we've asked: who can we partner with at this specific time to unlock the next section of the journey? I don't know enterprise sales. As an engineer, I can eyeball what features we might need, and we have wonderful people internally who can help. But you want boardroom dynamics where everyone is aligned and asking, “How do we win this?” instead of bickering about strategy.Data Centers in Space and the Physics of ComputeSwyx [00:23:31]: You had a tweet about data centers in space. Why no data centers in space?Jake [00:23:37]: It's not “no data centers in space.” My hot take is that I think it is solvable. I've just never seen anybody solve it.Swyx [00:23:49]: You said, “How are you going to dissipate that much heat in a vacuum?” You're making a physics claim.Jake [00:23:55]: I haven't seen anybody prove how you're going to dissipate that much heat in a vacuum. It doesn't mean it's not possible. It just means nobody has brought it up yet.Swyx [00:24:05]: Astrophage.Jake [00:24:06]: I don't know what that is.Swyx [00:24:07]: The Martian thing. Okay, you're very logical.Jake [00:24:09]: It could work. A lot of people are putting the cart before the horse. They say, “We're going to put data centers in space.” Okay, but how? “We have time to figure it out.” It's like in The Martian where they ask how they're going to intercept something and say, “We'll figure it out.”Swyx [00:24:36]: Making a bet on human invention is weird because you blind trust that it can be solved. But with physics, there are first-principles bounds you can put on it. Maybe not. Maybe you're asking to travel time or break a fundamental thermodynamic law.Jake [00:24:57]: I don't know how VCs do this either. How do you know what's not possible and a grift versus what's possible but sounds completely insane? “We're going to put data centers in space.” Coin flip as to which it is, and I guess you'll know in 10 years. That's one cycle.What Agents Need: Versioning, Observability, and 1,000x ScaleSwyx [00:25:23]: Moving back to agents. The branching, fast spin-up, and orchestration you do feels like pre-work that happened to be exactly what agents want. What do agents want differently than humans?Jake [00:25:37]: They want the ability to version things. It's not that different; it materializes slightly differently. Agents want a way to test changes incrementally. Engineers have feature flags. Is there a reason agents can't use feature flags? I don't think so.Jake [00:25:54]: They want version control. Can we use Git or not Git? That one is up in the air. I think something outside Git will emerge for how we version these things over time. They need observability. You need to query what happened, when it happened, which steps failed, traces, logs, metrics, and all the rest. They need network, compute, and storage. They need to write files, save files, iterate on files, and snapshot file systems.Jake [00:26:25]: A lot of what humans needed is in line with what agents need. Branching and forking are not different; we're just moving 1,000 times quicker. It can look like you need something massively different, but what you need is something massively better than what existed. You need orchestration massively better than Kubernetes. You need networking probably better than Envoy. It goes all the way down the stack.Jake [00:26:55]: If the workload profile doesn't change so much as it gets massively compressed because you need thousands of these things, what assumptions change? etcd is going to melt. You need to replace it with something. You can go all the way down the stack and say, “That part has to change, that part has to change, and that part has to change.”Jake [00:27:19]: The interesting thing about the super-exponential curve is that you have to build systems where you can rip out those parts at any time because a new bottleneck might emerge. You get good at parallel agents, and a different part of the system breaks. So it's similar to what humans needed, but at 1,000x scale.Jake [00:27:55]: How do you do code review in the age of agents?Swyx [00:28:00]: You throw more agents at it.Jake [00:28:01]: You don't. But then who reviews for CVEs and all these other things?Swyx [00:28:07]: More agents.Jake [00:28:08]: And that's how we hit the inference wall. You can continually throw agents at the problem, but I think there's a limit to the number of agents you can throw at a problem.CLI, Agent Handles, and Closing the LoopSwyx [00:28:24]: You already had a CLI before it was cool. How is the shape of what you're exposing changing, if at all?Jake [00:28:28]: CLIs have always been cool. The CLI changes because we think about how to give Claude, Codex, ChatGPT, or any model a handhold.Jake [00:28:50]: A CLI is a single command: deploy, get logs, and so on. Things that were prohibitively annoying to humans are not annoying to agents. They're nice. If I handed you a CLI with 40 arguments and 600 flags, you'd think, “I'm never going to use all of this.” But if you hand it to an agent, it says, “This is excellent. I have so many handles to work with.”Jake [00:29:24]: If you're going to expose things to agents that way, you want as many handles as possible where they can get information, query dynamic information, and close the loop quickly. Most problems right now are about how to close the loop as quickly as possible. Where does the agent get stuck, and how can you remove that?Jake [00:29:49]: Telemetry is important. If you can tell where the agent gets stuck from the CLI and say, “12% of people deviate from the happy path because of this, and now I add this argument and drive it down to 2%,” you massively increase the rate of loop closure.Jake [00:30:03]: That's how we think about not just the CLI, but every point in the dashboard. It's a user journey: I hear about Railway. I get something deployed. I get my first green build or aha moment. I see an endpoint, logs, whatever. Then I iterate. The iteration loop is indefinite. The user wants to deploy a new thing, a Postgres instance, change code, and keep iterating.Jake [00:30:36]: If you focus on the iteration loops and what's blocking them from closing quickly, one thing we say internally is: you never want to be waiting on compute anymore. You always want to be waiting on intelligence. If you're waiting on compute, there's a bottleneck that needs to be destroyed because eventually that bottleneck becomes so large that another workflow emerges to change it.Jake [00:31:04]: We've built a product where you push code, build it, and so on. But I fundamentally believe the push-pull loop is going away. We'll get to a point where you make a small change in production, that change is versioned across your infrastructure, you're working alongside copy-on-write versions of your database and infrastructure, and then you merge it in and it's instantaneously live. That's the holy grail of loops. The push-pull-rebuild thing is a point of friction that we're removing entirely.Canvas as Output: Dashboards, Context Anchors, and HyperstructuresSwyx [00:31:43]: It's incredibly fast. If anyone hasn't tried it, that fast feedback is great. My hot take is that Railway was famous for its canvas, which visualizes your infrastructure and lets you manipulate it visually. But that was for humans. For the next phase of growth, Railway CLI is more important than canvas.Jake [00:32:05]: The canvas is funny because it's a mechanism to show changes over time. You're right that previously we used it a lot as an input. Moving forward, its goal is more like an output. You would go to the canvas, make changes, see them, and watch your infrastructure evolve. Now agents have access to the CLI and can make those changes. So the canvas becomes an output: what information does the human need at this moment to make suitable decisions about control requests? Do I approve this or not?Jake [00:32:57]: It also has to be an anchor for your context, a port in the storm. Think of it like layers in a file system. You start with a project, then drill down into services, then into a function or code, because you want to represent the entire thing not just in your head, but in the canvas. Other people can share that representation, think on the same wavelength, and move quickly.Jake [00:33:33]: A lot of organizations get in trouble as they scale because all the context lives in someone's head. “How does this microservice work?” “I have no idea; go ask this person.” Then you have whole categories of products built around context discovery. A lot of that melts away if you have a solid hierarchy and can infinitely nest services, code, context, and everything else all the way down. That's what lets you build these structures over time.Jake [00:34:18]: It's also what lets us build what I've called hyperstructures: things that are way bigger. You look at the Golden Gate Bridge and ask, “How did we build that?” There's a meme that we lost the technology. To some extent, yes, because the coordination that built those things evolved and changed. We lost some of the art of building structure as we jammed everything into Slack.Swyx [00:34:52]: But you jam everything in Discord.Jake [00:34:53]: Same point. It doesn't matter. It's message passing and interrupts, message passing and interrupts.Swyx [00:35:00]: So you're arguing there should be something better and more structured than Slack?Jake [00:35:04]: Yeah. For sure. I think Slack is awful, and Discord is awful too.Central Station: Context Routing, Support, and Incident ClustersSwyx [00:35:09]: This is the equivalent of my mom test. What have you done that has your solution to this?Jake [00:35:15]: Internally, we've built a tool called Central Station that aggregates all the context from our users. Every piece of feedback, every customer support item, everything gets aggregated into clusters. If an incident is brewing, we can determine how many users are affected and break off a discussion based on that.Jake [00:35:40]: That is more helpful than long-running channels where you're trying to decide which channel to put something in. If you can dynamically aggregate information and dynamically route it to the right person based on context, it works better. We know internally that these four people are close to networking. If we see a networking thing, we can drill it down to those four people. If it's with this part, we can look at the commits. This is no longer a manual process internally.Jake [00:36:13]: If you go to station or help.railway.com, that's why we built it. We wanted to scale with a massive amount of leverage by aggregating feedback.Swyx [00:36:27]: This is built in-house?Jake [00:36:28]: Yep.Swyx [00:36:29]: I remember helping out on this one with Angelo in 2023. You scale a lot with a very small team.Jake [00:36:38]: Yeah. We're about 10 times bigger now.Swyx [00:36:40]: You have your full developer code here? Very cool.Jake [00:36:44]: If you go to railway.com/stats, we expose this as a pub-sub-able thing. It's all real-time metrics. There's a way to get it as JSON somewhere if you care.Jake [00:37:01]: We're big on trying to build everything in public and talk about what we're working on. We've had issues in the past, and we'll say, “Here's how we're fixing these things.” We've gotten compliments and flak for incident reports. We're always trying to make them better and talk with people.Incidents, Disclosure, and Progressive RolloutsSwyx [00:37:20]: You had a big one recently. I liked that it was scoped to 3,000. You presumably used Central Station. Talk through what happened and how you address it internally as a team.Jake [00:37:38]: Internally, this one really sucked. It had to do with an upstream provider that didn't do the behavior it said it documented, which is unfortunate given they wrote the RFC for how the behavior should work. We rolled those things out, and Central Station caught it initially when a couple users said caches weren't invalidating. We turned it off immediately.Jake [00:38:03]: When you roll out to a large user base of three million people, you get a lot of disparate behaviors. We tested in staging and had tests, but we hit an edge case. We've hardened those systems, and now we can make that better. But it was a tough one.Swyx [00:38:39]: I always wonder how private disclosure is supposed to work if people find an issue. Are they supposed to contact you first? When you run a platform, these things will happen. What channels should people pursue to quietly resolve it before it becomes a bigger incident?Jake [00:38:59]: There's responsible disclosure. We err on the side of over-disclosing and letting you know something is wrong versus having your provider gaslight you. We've erred on sharing those things more publicly, even if they impact a small subset of users. That's a decision we've made internally. We have four values. One is honor. The honorable thing is to notify people to the widest degree at which they may have been affected or there was an issue, and then confront it head-on: why did it happen, what can we do better?Swyx [00:39:45]: Not the whole user base. That's because of incremental rollouts and other things?Jake [00:39:50]: Yeah. Progressive rollouts.Swyx [00:39:54]: That should be the norm at all large platforms.Jake [00:39:58]: It should. A variety of companies do this. There's the quote that Meta runs 10,000 different versions of Meta. To our earlier point about agents, they need the same thing. They need shadow traffic and all these other things. We've built so much ceremony around production being sacred that we need to make it trivially easy to test different behaviors in a safe environment. Then you can make mistakes in a safe environment.Safe AI SRE: Customer Agents, Forked Environments, and Production ParityAlessio [00:40:30]: Do you see a world where these things get automatically caught, not necessarily by your agent, but by your customer's agent? The cache invalidation issue seems easy to check if you know to look for it.Jake [00:40:44]: It's hard because to determine it, we almost need to hook into your observability infrastructure. That's why we have the template loop on the platform: so you can roll things out progressively. You can roll out to Johnny Vibe Coder initially, or push a shard that someone consumes at their own leisure. Or you can roll it out over weeks: 0.1% of people, 1% of people, early adopters, then all the way up. That's the non-deterministic version control we talked about earlier.Jake [00:41:30]: I believe that's where most things should go, because most companies end up building staged rollout systems in-house. It's the same thing built again and again at every company. There's a massive opportunity to consolidate developer debt.Alessio [00:41:45]: You should have a free tier. Model providers give free tokens if you let them use the data. You could give free compute if someone is the number-one shard that goes out and lets you plug into their observability.Jake [00:41:55]: We do that. That's why we talked about the impact on 3,000 people. We start with lower-impact people. Larger companies on the platform are last to receive those rollouts so they have a version of the platform that's deeply stable.Alessio [00:42:16]: I have three services, so I'm sure I get the first rollout. You can nuke my thing at any time. There are all these SRE agent companies. Observability people also want agents that fix upstream problems. You have your own agent in the canvas now. How do you see that playing out?Jake [00:42:39]: It's the stacking entropy problem. If you don't have primitives to make iteration in production safe, it becomes difficult. If you're an observability provider saying, “Here's the fix to this error,” assume 80% are good and make sense. But in the last 20% long tail of complex issues, if you let somebody stamp it, you create an opportunity for an incident.Jake [00:43:08]: That's why forked environments are important. People have staging, but it always drifts from production. You need primitives, workflows, and experience built first-party on the platform so you can fork any service at any point in time.Jake [00:43:33]: I think of the canvas as a sheet of transparency paper. The agent is a little guy you push up into the canvas. It should say, “I need to copy that service and that service so I can test these two things.” It gets a read-only copy of production. Anything that's PII gets marked as a transform when we clone the database, create a copy-on-write version, or read from it. Then the agent makes changes and asks, “Does this actually work?” as close to production as possible.Jake [00:44:22]: That's how close you have to be, or you get massive drift. The system becomes unstable. You see this with massive systems built on Docker for local, Kubernetes for production, and a specific thing for something else. That complexity slows developers and becomes unstable at scale, making it hard to iterate. We want to compress that way down and say, “As close to prod as possible is where we want to be.”From AISRE Skeptic to Agent BelieverSwyx [00:45:00]: I was texting Erica for questions, and she says you were originally not a believer in AISRE. Have you come around on it?Jake [00:45:10]: I flipped, but I'm still not a believer in AISRE if you don't have the primitives to make it safe. If you unleash AISRE on production infrastructure without safe primitives for copying volumes and making sure things are fine, it's going to nuke your production database. It's not a matter of if, but when. I'm a big believer in making those loops safe.Jake [00:45:33]: I was a deep AI skeptic until 2023. In 2024, I thought, “Maybe I can roughly make this thing do it.” In 2025, I thought, “Now I can hold this.” Over winter break, everybody came back saying, “It's almost impossible to hold this.”Swyx [00:46:01]: Did you see this on the Claude docs? CloudBot? OpenCloud?Jake [00:46:06]: It's gotten to a point where it's harder to hold it wrong than to hold it right. There's a scene in Avengers where Vision picks up Thor's hammer and says it's terribly well-balanced. It self-balances and works well. I'm a deep believer at this point that this will be the dominant species: assembly, C, C++, JavaScript, words.Swyx [00:46:35]: It feels like a big jump.Jake [00:46:37]: It is. But it's not like you abandon CPU-based discrete logic and move straight to fuzzy logic. You need both. Your skills should call code or applications or some static structure. You can use skills to distill what the procedure should be or how the code should act.Jake [00:47:02]: I'm coming to a thesis: you need three points. You need a clear spec defining the system, the code, and the tests. When you say it out loud, if you've been in engineering long enough, you're like, “Of course. That's an RFC, tests, and code.” But they all matter. Having them together lets them reinforce each other: the spec and tests match, but the code doesn't, so reconcile it. Or the tests and code match but the spec doesn't, so reconcile that. That's the iteration loop.Jake [00:47:41]: That's why you're seeing people talk about software factories, docs, and reconciliation. Some of that is architectural astronomy if you don't implement it, but that loop is where most things will end up.Swyx [00:48:07]: For listeners, we've been talking about this on the pod for three years: the holy trinity of specs and tests. Itamar Friedman from Qodo is the reference if people want to look it up.Self-Modifying Infrastructure and the End of Push-Pull-RebuildSwyx [00:48:18]: One thing I want to mention on the OpenCloud idea is self-modification. I don't know how Railway would support it, but I have my OpenClaw, and I just tell it it has the Railway CLI and can do whatever. In theory, whatever capabilities or new infra it needs, it can call the Railway CLI, provision it, and add it to itself. The agent can modify its own infra.Jake [00:48:45]: It's nuts. I have a loop set up where you put the Railway CLI on top of something that runs on Railway. You're authenticated as whatever the current box is, and you can make any changes to it. Then you call Railway deploy, and it deploys itself.Jake [00:49:04]: It's like: “I need to spin up this instance of this environment. I already exist in this environment. Excellent, I have access to a Postgres instance now.” That's where we want to go with agentic, self-replicating infrastructure. That's your loop: iterate in production. You continue making changes. If it works, merge it upstream. If it doesn't, throw it away.Jake [00:49:37]: How do you make throwaway copies trivial to spin up and super cheap? The era of “I have an AWS instance with four vCPU and 16 gigs of RAM” is going to get destroyed. If you do that for agents, you need a thousand of those machines. It's prohibitively expensive compared with what we've spent a ton of time figuring out: the atomic unit of deploy, whether you call it isolates, sandboxes, or something else. Only pay for what you use, spin up instantaneously, and close the loop as quickly as possible.Jake [00:50:15]: If the system can self-replicate safely and say, “This is my environment, I'm making these changes,” it can come back with, “Does this look good? This is a new state of infrastructure given this prompt. I think I've solved it.” Then you go back and say, “Actually, it looks different.” It does the loop again. Then you say, “Cool. Apply.”Swyx [00:50:38]: That's retroactively obvious, which is the most useful kind. Any other comments on agent deployment on Railway?Jake [00:50:51]: It's getting better every day. I'm on X or Twitter. You can always yell at me about the parts not working as well as they should, because plenty of things should work way better.The New Serverless: Stateful, Long-Running, Pay-for-What-You-Use LinuxSwyx [00:51:04]: At this stage, when people want massively or embarrassingly parallel compute, they usually talk serverless. I feel like there's a new serverless compared to the previous five years of serverless. You're in that new bucket. Do you have comparisons or philosophical differences you want to call out?Jake [00:51:31]: It's somewhere in between. It's the ability to run stateful, long-running workflows or executions.Swyx [00:51:42]: Vercel has Fluid Compute, Cloudflare has some container thing, Google has App Runner and others.Jake [00:51:55]: That's where everything is roughly going, and it's why we've been working on this for six years. We believe users need access to a computer: a box that speaks Linux. They need to deploy what they want. Other systems change the surface area of what you can build. For us, users need a computer and need to deploy anything they truly want. That's why we've focused on the primitives: network, compute, storage. If we give you those and expose them so you can run things indefinitely, that's where we believe it's going.Jake [00:52:43]: Twitter has no nuance, so everyone says “servers” or “serverless.” It's always somewhere in the middle: I want to run it for a long time, but I don't want to provision the resource statically or pay for things I'm not using. That's been our thesis from day one: pay only for what you use, run it indefinitely, and it is full Linux.Swyx [00:53:12]: That's why I like the naming of Fluid. It's fluid. Flexible.Heroku, Focus, and Carrying the Torch Without Becoming the PastSwyx [00:53:18]: Another milestone is the Heroku official deprecation. You're one of the presumptive new Herokus. “New Heroku” has been a category for as long as I've been in developer tooling. It's finally happening. What was that like? Any behind-the-scenes of, “This is the moment”?Jake [00:53:42]: You have people where you're like, “You were running stuff on here? You, as this company?” It's crazy that names you would know are running on it and now coming to us saying, “We want to move a lot of this off.”Swyx [00:54:00]: Any behind-the-scenes on why Salesforce let Heroku stagnate?Jake [00:54:05]: I can only guess. It's hard when it's not your business. Salesforce's business is to build a great CRM. That's their focus. Then you acquire a compute business as an offshoot. A lot of early Meta people talk about focus. Boz has a write-up about how in the early days of Meta they had no money, so they were forced to focus. Then they turned on the money tree and had no reason not to split their focus.Jake [00:54:52]: But that dilutes your product. You get offshoots where you ask, “Is this the focus of the business?” If it's not core, it languishes. A lot of companies get in trouble when they split focus because they're fighting a multi-front war, not just externally but internally for alignment. Where are we going? What are we doing? What is our purpose?Jake [00:55:24]: If you're Salesforce-built and mission-driven, you want to work on Salesforce. Heroku is off to the side. It's not core to the business. Getting resources, budget, focus, and alignment internally becomes hard. It was a matter of time.Swyx [00:56:06]: Kudos for them to call it out instead of leaving it unknown.Jake [00:56:12]: Their release was a little odd. They called it out, but they didn't say they were shutting it down. Behind the scenes, I think they issued messages to people saying they should close accounts and that they were going to deprecate and remove things over time.Jake [00:56:30]: It's crazy because some of my first deployment experiences were on Heroku. You start with dragging things into an FTP server, then you try to get a deploy working, and then it's Heroku. It was the on-ramp for us. But the wheel turns. New things emerge. We're happy to carry the torch for a lot of that. But we don't want to be the new Heroku. We want to be the way people build and deploy software, and ultimately the way people monetize software over time.Swyx [00:57:19]: It's still a big crown to be the new Heroku. There are 50 companies that fought for that.Jake [00:57:23]: Everybody is holding some portion of it. We're happy to support people and companies. The platform works differently. The game loop is similar, but we've been dogmatic about where these things are going: primitives, agents, fan-out. Some things fit; some workflows need to change. We have an approximation of Heroku pipelines with the environment system. It's exciting. We've got a ton of people we can support, and it's growing a lot.Temporal, Workflow Engines, and State MachinesSwyx [00:58:12]: I have one more technical question about Temporal. I've sold my shares. You're a power user and one of our earliest customers. I met you through Temporal. You built on Temporal. You have complaints. This may be the most neutral and informed conversation anyone will hear about Temporal without someone working at the company.Jake [00:58:39]: That's fair. I've used Temporal for almost 10 years because of Cadence at Uber.Swyx [00:58:52]: Give people a sense of what Cadence was at Uber.Jake [00:58:57]: Cadence was the precursor to Temporal. It powers trip actions, rides, when you rent a Jump bike or scooter or car. You're running workflows for a period of time and saying, “This ride will run indefinitely until it finishes.” You attach information: you paused in this zone, so add this charge to the bill. When you end the trip, the workflow is done. That experience was powered by Cadence at the time.Swyx [00:59:34]: I used to say it's like programming the entire user journey top-down as one function.Jake [00:59:39]: It's a powerful idea and important. It's also important for the next phase of the agentic journey. You want an agent to do a specific task, be complete or incomplete on that task, and move on to the next thing. You need a way to manage workflows dynamically.Jake [00:59:59]: Temporal was always great in theory, and great when you got it working the way you wanted in production. But it required you to model the entire journey in your head. If you didn't, you could cause issues where replaying the state of the workflow causes non-determinism.Swyx [01:00:25]: Because it works on deterministic workflow history.Jake [01:00:28]: Exactly. I describe it as a jet engine. If you know how to operate it and run it, it's great. But you can't hand it to people trying to build complicated things if they don't have the whole state in their head.Jake [01:00:48]: We run our whole deployment pipeline on top of it. That's a reasonably complicated workflow: pre-commit hooks, signaling, queuing, and all the rest. We ran into the same thing at Uber. As you express a large workflow, it gets more complicated, with more states in the state machine that you have to map back to the workflow.Swyx [01:01:15]: It's a lot of ifs.Jake [01:01:16]: Exactly. At Uber, we built a system for doing the state machine and testing it. We've started to build some of those things here because it's grown heavily. It's not quite love-hate. When it works well, it works super well. But if someone who doesn't have full context puts something into the system that invalidates state or causes non-determinism, or spins off a ton of activities, you have to keep track of underlying SRE knobs like activity slots. Those should scale with memory, vCPU, and so on. It becomes a bear to scale.Swyx [01:02:10]: You need a capable sysadmin running things behind the scenes. If you moved off, what would you do?Jake [01:02:19]: We'd build our own workflow engine. We have a few internally that we've worked on.Swyx [01:02:27]: This is one of those classes of things you typically wouldn't vibe code, but I'm wondering if you can.Jake [01:02:33]: I still don't think you should vibe code it. You still want to run decent tests to make sure it works.Swyx [01:02:39]: Timo didn't invent that from scratch either. There are libraries you can run. On top of that, it's just a state machine that you have to map out. Ultimately, you define the instructions you want and run them through a state machine.Jake [01:03:00]: It's very doable. Workflow stuff is interesting. Restate is doing neat stuff here.Swyx [01:03:10]: You're tied into JavaScript. Are you a JavaScript maxi?Jake [01:03:13]: Internally, we have TypeScript, Rust, and Go. We don't add more languages. Actually, we have a little C because we write BPF code and hooks. But those are the languages.Swyx [01:03:28]: Is this for sidecars?Jake [01:03:32]: No. It's for the networking stack, volumes, and things like that. We use TypeScript a lot because it powers the dashboard, but we're moving a lot of workflow stuff off the dashboard stack and into the infrastructure stack.Railpack, Nixpacks, and Content-Addressable FilesystemsSwyx [01:04:00]: Cool. Any other technical infrastructure stuff? Railpacks?Jake [01:04:07]: We built an engine for determining dependencies based on source code. It's called Railpack. We built the first version, Nixpacks, on top of Nix, and then we moved.Swyx [01:04:17]: People have been trying to get me to adopt Nix and NixOS for four years. Is it ever going to be a thing?Jake [01:04:23]: I don't know. We're excited about it, but it has pain points. Think of it as a stack of versioned binaries at specific slices in time. If you want version X and version Y, you bloat the package space, which blows up image size and makes real-world workloads difficult.Swyx [01:04:53]: But you content-address it and cache it. In theory, there are optimizations.Jake [01:05:00]: In theory, yes. But with a large enough user base and disparate enough machines, you run into a problem Meta described in the XFAAS paper, their internal serverless system. It becomes difficult at scale unless you break out specific runtimes.Jake [01:05:24]: We didn't want to do that because we wanted to truly allow you to deploy anything. That was our initial thing with Nix. But we've moved toward interesting work around content-addressable file systems that can lazy-load anything from any point and page it into memory.Swyx [01:05:48]: Amazing.Jake [01:05:49]: The future is very bright. It's crazy, and it's going to be nuts.Coding Agent Spend, Roadmaps, and Token ROISwyx [01:05:54]: Founder journey stuff?Alessio [01:05:56]: Your cloud usage: you tweeted you're going to spend $300K this month?Jake [01:06:01]: I think we got to $200K.Alessio [01:06:02]: Coding agents?Jake [01:06:03]: Yeah.Swyx [01:06:04]: Across the company?Alessio [01:06:05]: You only have 35 people, so I'm sure they're not all spending $10K a month. What's the distribution?Jake [01:06:10]: I think I'm at about $25K. We have power users all the way down. We came back from winter break, and I basically said, “If you're writing code by hand, you're doing this wrong.” The tools are good enough now that you can move extremely quickly. There are issues and pain points, but you should be reviewing the code you are writing instead of writing it by hand.Jake [01:06:40]: Architectural patterns matter more now than ever, but you shouldn't spend your time generating code you would write. If you know how to write it, ask the agent to write it and reconcile it until it looks like you would have written it yourself.Jake [01:06:58]: People misconstrue my propensity to push people toward agents as connected to our growth and some reliability bumps. They're not necessarily related. The tools are good enough to move extremely quickly and build things way larger than you could before.Jake [01:07:19]: To the earlier point about cooling data centers in space: I don't know. But with software, you can ask, “How would I build block storage from scratch? How would I do these things?” I have ideas because I have history and have read papers. Let me work them out and build massive test benches with thousands of tests, because those are now free to author. If you're not using AI systems to speed-run your roadmap and reconcile your existing system onto the future, you're missing a large point of what's happening.Alessio [01:08:12]: What's the path to spending $3 million a month? Is it bound by ideas and things customers can absorb?Jake [01:08:19]: For most companies, it's bound by deployment at this point. That's why we've seen a massive boom in users and companies, from Fortune 50s down, asking how to get developers to move faster. You'll probably hit your CFO before any technical limits because they'll look at the eye-watering amount of money spent on tokens. Inference costs have to come down, but we're inference constrained now. There will be price discovery around what makes sense for an org to adopt.Jake [01:09:06]: I think you'll end up with the F1 driver concept. If someone is really adept at these things, it makes sense to put them in a $3 million car. If they're not, it probably doesn't make sense. You'll take a few people and say, “You can drive the F1 car. We need to go in this direction. Figure out if it works and prototype it.”Jake [01:09:33]: We've done some of that and vastly accelerated our roadmap. We thought we'd ship something in a few years; now we can probably ship it in a few months because we validated it and don't have to build it incrementally. We can skip steps and move toward our vision.Alessio [01:09:58]: A lot of people are realizing the roadmap doesn't always have a business impact, so they say tokens are too expensive. But if your roadmap were built to make more money by the time you built it, you'd have token pricing for it, the same way you do with sales. You'd spend a billion dollars on sales if you knew you would get $2 billion of revenue.Jake [01:10:19]: Exactly. A naive way to measure this is the percentage of tokens that end up in production. If you can measure impact because those tokens end up in production, that's awesome. But the burden of proof will rise. Internally, we have a growing number of pull requests that haven't merged. The question becomes: how do you get this into production? It's about how quickly you can build and deploy software, which is exciting because that's our whole thing.The SDLC Shift: Prompt Requests, Feature Flags, and Safe RolloutsSwyx [01:10:56]: The SDLC is changing. One thesis is that the pull request is dying. It's going to be the prompt request. Beyond that, code review is also kind of dying if you have all the other systems in place. What else is changing about the SDLC?Jake [01:11:19]: The AISRE and the tools to make it happen. AISRE is pie-in-the-sky aspirational. What does it take to get an AISRE? What tools do you need to build?Swyx [01:11:32]: You should expose your tooling to customers at some point. The Central Station command center.Jake [01:11:39]: We have it for template maintainers. Template maintainers can deploy and maintain templates, and they get feedback. We're going to expose those things incrementally.Swyx [01:11:51]: Clustering around incidents. Everyone has a version of that, but I don't think anyone has solved it.Jake [01:11:56]: I won't say we've solved it internally, but it's gotten so good that we can see incidents forming pretty quickly. At some point, those will be things either someone else builds or we build. We've always built things purpose-built for us. If it makes sense to make it useful for users, monetize it, or turn that loop into a profit center instead of a cost center, we want to do that.Jake [01:12:28]: Pull request is definitely dying.Swyx [01:12:29]: Do you do first-party feature flagging and incremental rollout stuff?Jake [01:12:34]: We have a feature-flagging engine we built internally and will eventually roll out.Swyx [01:12:38]: I don't see it as a user. How come you didn't give us what you have?Jake [01:12:43]: We have to beta test it. We care a lot about the quality of the things. There's plenty we've used internally that doesn't make it all the way through the journey because it fails. It works for one service but not multiple services. We'd have to build it for multiple services and know that if we released it, we'd rebuild it again and again. Some things are worth that, but many inform the roadmap.Jake [01:13:18]: We don't want to dilute the experience by saying, “This works, but only for this service,” unless it's a core initiative. Over the next few months, we'll roll out things that work for a single service, then multiple services, then multiple services across the environment. You have to be deliberate. Otherwise you create broken disparate experiences and support load because people ask how to use the feature.Jake [01:13:52]: It's the earlier expansion and compaction pattern. You expand the company to get features, then compact and smooth them out so the experience is stellar. You told me in the hallway, “It's gotten so much better.” Internally we're saying, “This part really sucks. We need to make it significantly better.”Swyx [01:14:11]: I can attest to that over the last three years watching you build Railway. For listeners, feature flagging is a huge part of Uber culture. So much so that they have too many feature flags and another thing to remove feature flags. Facebook has Gatekeeper. Agents are going to need this. It's fundamental to incremental rollouts. OpenAI acquired Statsig. GPT-5 is routing and flagging through different models.Jake [01:14:56]: It's super important. If the software development lifecycle is going to change because we're doing things 1,000 times faster and 1,000 times more concurrently, what becomes important at scale?Jake [01:15:16]: Before I started Railway, I built a feature-flagging product and tried to sell it. It was an easier version of LaunchDarkly. I ran into a problem: anyone small enough to adopt your technology doesn't care about feature flags, and anyone large enough to need feature flags needs so much scale that you have to build out all the infrastructure. I scrapped it.Jake [01:15:42]: But what is old is new again. Companies are trying to move quickly, but you can't YOLO a vibe-coded thing straight into production. You need to say, “Here's my blast radius, my impact, and I want to shadow it for these users.” Feature flags. You're going to need the tools larger companies built to maintain their structures. Everything gets compressed by 1,000x so everybody can build those structures quickly.Jake [01:16:07]: That's exactly where we are: compressing the software development lifecycle, then expanding it and adding more new things.Cattle, Pets, and Clonable InfrastructureSwyx [01:16:15]: Another term that comes to mind for newer developers is “cattle, not pets.” People treat production like a pet. It has a name. You baby it and keep it alive. With cattle, you can mass farm, roll out, portion parts out, and kill them.Jake [01:16:37]: I think that might change. You can move toward having pets as long as you have a cloning machine for your pets.Swyx [01:16:52]: Yeah.Jake [01:16:52]: If you can snapshot every single thing at every frame, it doesn't matter if something gets obliterated because you have a snapshot of it. The things we've built right now are designed to block changes from the hermetically sealed DevOps line. You have to write a Dockerfile because you nee

Ardan Labs Podcast
AI, Open Source, and Accessibility with Eugene Cheah

Ardan Labs Podcast

Play Episode Listen Later May 20, 2026 80:57


In this episode of the Ardan Labs Podcast, Ale Kennedy talks with Eugene Cheah, founder of Featherless, about his journey from physics to building globally accessible AI systems. Eugene shares his vision for making AI more affordable, multilingual, and open to communities around the world through efficient architectures and open-source collaboration.The conversation explores GPU optimization, evolving AI infrastructure, the importance of multilingual support, and the balance between innovation and regulation. Eugene also reflects on speaking at the United Nations, the future of open-source AI, and why accessibility and transparency are essential for the next generation of AI technology.00:00 Introduction and Featherless02:25 Education and Early Interests10:24 University and Military Service15:19 Entering the AI Industry22:33 Startups and AI Development30:42 AI as a Force for Good34:28 AI, Culture, and Automation42:13 Fundraising and Building a Startup50:10 AI Architecture and Optimization58:23 The Evolution of Featherless01:02:37 Building a Global AI Vision01:06:57 Open Source and AI Accessibility01:12:35 AI Risks and Real-World Concerns01:18:20 Lessons Learned and Final ThoughtsConnect with Eugene: LinkedIn: https://www.linkedin.com/in/eugene-cheah-a47791126/Mentioned in this Episode:Featherless AI: https://featherless.ai/Want more from Ardan Labs? You can learn Go, Kubernetes, Docker & more through our video training, live events, or through our blog!Online Courses : https://ardanlabs.com/education/ Live Events : https://www.ardanlabs.com/live-training-events/ Blog : https://www.ardanlabs.com/blog Github : https://github.com/ardanlabs

Web3 with Sam Kamani
385: Proving You're Human in the Age of AI with Guest speaker Ian Dilick, developer relations at World Foundation

Web3 with Sam Kamani

Play Episode Listen Later May 18, 2026 16:59


I sat down with Ian from the World Foundation to dig into one of the most pressing problems of our time, how do you prove you're a real human online without sacrificing your privacy? As AI floods the internet with bots and agents, the gap between human and machine interaction is closing fast. Ian walks me through how World ID and the Orb device let anyone verify their humanity using advanced cryptography, completely anonymously, no passport scans, no email addresses, no data sitting on some server you don't control. We also get into Ian's wild journey from GPU mining and Constitution DAO to building at World, and why the current KYC and AML model is a problem for both users and platforms. This is a conversation about identity, privacy, and what it means to be human in a world where most internet traffic won't be. Connect: World Foundation Website: https://world.org Twitter/X, World: https://x.com/worldnetwork Web3 with Sam Kamani: https://www.web3pod.xyz/ Key points:• [00:00] Sam introduces Ian from the World Foundation and the episode's focus on digital identity in the AI age• [02:30] Ian's background: crypto-adjacent upbringing, GPU mining with family, selling a Bitcoin at $600• [05:00] Ian drops out of college during COVID, starts a startup, gets pulled into crypto through Constitution DAO• [08:00] What World is: a way to prove you're a real human online, completely anonymously, using the Orb device• [11:00] Why this matters now: bots and agents already make up 60–70% of crypto trading traffic, and it's growing• [14:00] World ID vs KYC/AML: not a replacement for regulated compliance, but a privacy-first alternative for situations where KYC isn't legally required• [17:00] Why both users and platforms suffer under current KYC models, GDPR compliance burden, data exposure, trust issues• [20:00] How World ID solves the same human-verification problem more privately and with a better user experienceDisclaimer:Nothing mentioned in this podcast is investment advice and please do your own research. It would mean a lot if you can leave a review of this podcast on Apple Podcasts or Spotify and share this podcast with a friend. Be a guest on the podcast or contact us - https://www.web3pod.xyz/

聽天下:天下雜誌Podcast
【天下零時差05.19.26】輝達連7紅 專家:還很便宜,漲勢才剛開始

聽天下:天下雜誌Podcast

Play Episode Listen Later May 18, 2026 8:09


股價沉寂多時的美國科技公司輝達(Nvidia)最近連續7個交易日收紅,專家表示相對其他半導體公司,輝達現在依然便宜,有機會再漲50%。 文:樂羽嘉 製作團隊:錢玉紘、張雅媛、鄭子鴻 *閱讀零時差,點這看全文

聽天下:天下雜誌Podcast
【天下零時差05.18.26】輝達將公布財報,股價繼續飆?聯準會內部分歧加劇;Google I/O大會端哪些新菜?

聽天下:天下雜誌Podcast

Play Episode Listen Later May 17, 2026 8:30


週一天下零時差關注以下財經大事: 一、美國聯準會內部分歧加劇,會如何影響利率走向? 二、雲端資本支出續增且CPU短缺,給輝達什麼機會? 三、Google母公司Alphabet、逼近輝達,I/O開發者大會將端出哪些新菜? 文:郭家宏、蔡娪嫣 製作團隊:錢玉紘、張雅媛、鄭子鴻 *閱讀零時差,點這看全文

Late Confirmation by CoinDesk
Blockspace: IREN's $3B Note, CME Compute Futures, Mike Alfred's Stock Picks, Trump's Q1 Bitcoin Equities

Late Confirmation by CoinDesk

Play Episode Listen Later May 16, 2026 86:33


AI compute futures are now live on the CME, and IREN has raised $3B in a new convertible note offering. Welcome back to The Blockspace Podcast! Today for news, we cover IREN's new $3B convertible note – the largest convert ever for a public bitcoin miner – Trump's Q1 bitcoin equity buys, and the 90-day pause on zoning discussions for Hut 8's proposed 500 MW data center in Logan County, Illinois. Plus, Mike Alfred of Alpine Fox Hedge Fund joins us to discuss his top stock picks for AI, and Kush Bavaria of Ornn jumps on to discuss how Ornn is providing an H100 index for the CME's new AI compute futures – and his thoughts on the future of these incipient compute futures markets. Mike San Miguel of Luxor also joins us to discuss the latest in GPU markets and AI ASICs, and pseudonymous user Soup explains how he used Claude and $15 in tokens to spin up 3.5 trillion passwords to crack his long-lost bitcoin wallet.  

FView Friday
既然显卡像老酒,越老越香,那不如大家都换成 GTX690

FView Friday

Play Episode Listen Later May 16, 2026 136:36


本期嘉宾:彭林、十天、蓝白、恺伦本期节目的主要内容有:· 00:00:40 -- iPhone 17 Pro 官降 1000 元· 00:05:45 -- ChatGPT 进 Siri 未达预期,曝 OpenAI 拟对苹果采取法律措施· 00:19:38 -- Mark Gurman:苹果 Vision Pro 团队已被拆分重组· 00:29:35 -- 黄仁勋称 5 年前的 GPU 就像老酒:它们越陈越香,还会涨价· 00:33:53 -- 5499 元起,大疆发布 ROMO 2 系列扫拖机器人· 00:45:55 -- 马斯克点赞宇树载人变形机甲· 00:55:33 -- 《GTA6》将于北京时间 11 月 19 日零点解锁我们的二手线下店位置在深圳·坂田北·吉华路·展誉公馆,离地铁站很近目前已经开业了,感谢大家的支持~还有众多观众朋友的热心提问~每周五晚 8 点,爱否直播间,我们一起开心聊天

DLN Xtend
223: Linux on the Road | Linux Out Loud 125

DLN Xtend

Play Episode Listen Later May 16, 2026 62:21


Wendy is back from hauling robots to Texas and getting ready to drive another one to California, so the crew leans hard into life on the road with Linux. Bill talks about moving his systems over to Bazzite, tells the story of an overworked NVIDIA 1080 that literally ate into another GPU, and explains how HomeBridge 2.0 keeps his smart‑home world humming. Nate shares his first impressions of Tux Manager, a Linux clone of the classic Windows Task Manager, and walks through the Framework‑plus‑Flip‑Go combo that makes his roaming setup feel like CubicleLabs away from home. From Steam Decks and One X Players to UniFi travel routers and noise‑canceling headphones, everyone opens their travel bags and talks about the gear they actually trust when Wi‑Fi is sketchy and power outlets are rare. Wendy also geeks out over her new MOVA V50 robot vacuum, complete with a dedicated “Sentinels” Wi‑Fi SSID, and how little self‑hosted comforts make a hotel room feel just a bit more like a homelab. Along the way, there are jokes about Ethernet‑cable hair, data having weight, and why the best layover is the one where your SSH tunnel actually connects. If you're curious about the recent Linux vulnerabilities and the ABCs of CVEs, don't miss SUDO Show 76, where they break it all down in a fun and informative way. Connect with the Hosts on Discord: Matt – @Dark1ltg Wendy – @Wendy.sh Nate – CubicleNate.com @CubicleNate Bill – @ctlinux on Mastodon Special Guest: Bill.

Algorithms + Data Structures = Programs
Episode 286: GPU Profiling with NVIDIA Nsight Compute (NCU)

Algorithms + Data Structures = Programs

Play Episode Listen Later May 15, 2026 36:44 Transcription Available


In this episode, Conor and Bryce chat with Marco Franzreb Salgado about profiling GPU code with NVIDIA Nsight Compute (NCU).Link to Episode 286 on WebsiteDiscuss this episode, leave a comment, or ask a question (on GitHub)SocialsADSP: The Podcast: TwitterConor Hoekstra: LinkTree / BioBryce Adelstein Lelbach: TwitterAbout the Guest:Marco is a software engineer at NVIDIA, where he works on improving the nvCOMP library, which offers fast GPU implementations of multiple data compression formats. For the past couple of months he has been working on a GPU implementation of the rotate algorithm.Show NotesDate Recorded: 2026-05-05Date Released: 2026-05-15ADSP Episode 237: Thrust with Jared HoberockADSP Episode 284: GPU RotateADSP Episode 285: GPU Rotate (Part 2)NVIDIA CCCLNVIDIA nvCOMPNVIDIA Nsight SystemsNVIDIA Nsight ComputeNVIDIA CuTe DSLNVIDIA CUDA TilecudaMemCopyAsyncPERF WARS: EPISODE IHoogle Translate partitionSingeliADSP Episode 97: C++ vs Carbon vs Circle vs CppFront with Sean BaxterIntro Song InfoMiss You by Sarah Jansen https://soundcloud.com/sarahjansenmusicCreative Commons — Attribution 3.0 Unported — CC BY 3.0Free Download / Stream: http://bit.ly/l-miss-youMusic promoted by Audio Library https://youtu.be/iYYxnasvfx8

Lend Academy Podcast
Fixing the Broken Appraisal Model in Asset-Backed Lending With Thomas Galbraith, CEO of Barkr

Lend Academy Podcast

Play Episode Listen Later May 14, 2026 27:55


Thomas Galbraith is the CEO and co-founder of Barkr, an AI-driven valuation platform for asset-backed lending. He spent his early career in high net worth insurance at AIG and AXA, where he grew comfortable with the challenge of pricing hard-to-value assets. That thread ran through every role he held until it crystallized into a company built around a simple but structural problem: in asset-backed lending, appraisers give you a price and then spend the rest of their report telling you they're not responsible for it. Barkr is built to change that.What We CoveredThomas's background in high net worth insurance at AIG and AXAHow a common thread across luxury assets led to founding BarkrStarting with fine art and private jets before expanding to other asset classesThe two-part failure in traditional appraisals: accuracy and absence of liabilityHow Barkr pairs an AI valuation with a contractual performance warrantyThe progression from Lloyd's of London to AXA to Munich Re$2 billion in covered valuations and what patience actually means in this businessGPUs as a surprisingly durable and long-lived collateral asset classHow Barkr finds clients, from pavement pounding to Nvidia referralsMonthly mark-to-market on hard assets throughout a loan's lifeBuilding a domain-specific LLM with human review in the loopPlans to build an in-house insurance vehicle to unlock capacityKey TakeawaysTraditional appraisal firms hedge their liability by design. Page one is the price; the rest of the report is the disclaimer. Barkr's contractual warranty flips that model by standing behind the number.Barkr's data on GPU durability challenges the conventional narrative. Chips five and seven years old are still generating revenue and still have meaningful resale value, which changes the risk calculus for lenders considering AI infrastructure as collateral.Augmenting, not replacing, is the right positioning for valuation technology. Barkr actively encourages clients to keep using their existing appraisers and treats third-party appraisals as additional data inputs that improve their own accuracy.Building a reinsurance relationship takes years. Barkr worked through Lloyd's, then AXA, before landing Munich Re, and each step required demonstrating proof of concept at the prior level first.About Thomas GalbraithThomas Galbraith is the CEO and co-founder of Barkr. He began his career in high net worth insurance at AIG and AXA before founding Barkr to bring accountability and AI-driven accuracy to asset valuation in the lending market. Barkr has covered approximately $2 billion in valuations across art, private jets, vehicles, and GPUs.Connect with Fintech One-on-One:Tweet me @PeterRentonConnect with me on LinkedInFind previous Fintech One-on-One episodes

The Information's 411
Apple Explores Ways to Welcome AI Agents in App Store, Cerebras IPO Winners, Modals' Raise

The Information's 411

Play Episode Listen Later May 14, 2026 38:13


Apple reporter Aaron Tilley joins TITV Host Akash Pasricha to discuss Apple's internal efforts to safely incorporate AI agents into the App Store while balancing security risks from "vibe coding" apps. We also look at the big winners of the Cerebras IPO with Julia Hornstein, including OpenAI and Foundation Capital, and get an update from Rocket Drew on the closing arguments in Elon Musk's trial against OpenAI. Finally, Stephanie Palazzolo explores Modal's $4.5 billion valuation in the GPU reselling market and we get into Marathon Management Partners' Gokul Rajaram's prediction that OpenAI's advertising business could rival Google and Meta.Articles discussed on this episode: https://www.theinformation.com/articles/apple-explores-ways-welcome-ai-agents-app-storehttps://www.theinformation.com/articles/startup-modal-talks-raise-4-5-billion-valuation-revenue-surgeshttps://www.theinformation.com/articles/cerebras-ipo-winners-include-foundation-benchmark-openaiSubscribe: YouTube: https://www.youtube.com/@theinformation The Information: https://www.theinformation.com/subscribe_hSign up for the AI Agenda newsletter: https://www.theinformation.com/features/ai-agendaTITV airs weekdays on YouTube, X and LinkedIn at 10AM PT / 1PM ET. Or check us out wherever you get your podcasts.Follow us:X: https://x.com/theinformationIG: https://www.instagram.com/theinformation/TikTok: https://www.tiktok.com/@titv.theinformationLinkedIn: https://www.linkedin.com/company/theinformation/Chapters:00:00 - Introduction01:13 - Apple Exploring AI Agents for App Store Safety09:40 - Cerebras IPO: Benchmark and Foundation Capital's Big Wins15:17 - Musk v. OpenAI: The "Jackass Trophy" and Closing Arguments18:56 - Modal Raising at $4.5B Valuation Amid GPU Crunch25:37 - Gokul Rajaram on OpenAI's $100B Advertising Opportunity

Forbes Daily Briefing
Sometimes You Don't Want A GPU: Groq Cofounder Explains Whirlwind Deal With Nvidia

Forbes Daily Briefing

Play Episode Listen Later May 14, 2026 7:10


Last winter, Groq cofounder and CEO Jonathan Ross walked into a meeting with Nvidia CEO Jensen Huang with a pitch for the companies' tech to work together. He now describes the synergy with a logistics analogy: stop building AI data centers as if every workload wants the same hardware. Training is bulk hauling; inference is last-mile delivery. GPUs can do both, but using the 18-wheeler even when you just need a van can be a lot slower. So: Nvidia's general-purpose GPUs are the big trucks. Groq's specialized chips—LPUs, or language processing units, designed to run models fast—are the smaller vans. “If you were building out a logistics network for the entire United States, and I told you your two options were all 18-wheelers or just delivery vans, which one would you pick?” Ross said. “The best answer is both.”  Ross wasn't just pitching a worldview. He wanted Nvidia's permission to buy around 100,000 Blackwell chips, likely worth billions. Huang grilled him on the technical details, and then the meeting ended.  When Huang called back three days later, Ross expected a discussion about his GPU purchase order. Instead, the Nvidia CEO cut to the chase. “We should probably move really fast,” Ross recalled him saying. Learn more about your ad choices. Visit megaphone.fm/adchoices

The AI with Maribel Lopez (AI with ML)
Four Types of AI Agents With Dell's John Roese. Most Enterprises Are Only Building One

The AI with Maribel Lopez (AI with ML)

Play Episode Listen Later May 13, 2026 24:00


Dell's CTO built a 4-category agent framework from real production deployments. Most enterprises are ignoring two of the categories that matter most.Full Show NotesEnterprise leaders are mapping AI agents to org charts — building digital employees, agentic teams, AI workers — and then wondering why the results fall short. Dell's Global CTO John Roese has been running agents in production long enough to know exactly why that framing fails, and what to do instead.In this episode, Roese shares a framework Dell developed from actual production deployments, not pilots. It identifies four categories of AI agents defined by two dimensions: how much autonomy you grant the agent, and how complex the underlying process is. Most enterprises are focused on one category. Two of the four are widely overlooked — and they may represent the fastest path to measurable ROI.This is a practical, grounded conversation about where agents are actually delivering value today, how to think about infrastructure cost in the context of agent economics, and why the sequence in which you deploy agents matters as much as which agents you build. If your organization is trying to move from AI experimentation to production, this episode is required listening.3. Chapter titles:[00:00] — Introduction: Dell's dual role as tech vendor and enterprise AI user[01:38] — Why the org chart model for agents fails[03:12] — Decoupling human capacity from work capacity for the first time[04:23] — The two-by-two framework: autonomy vs. process complexity[06:14] — Productivity agents: what most enterprises already have[07:00] — Hygiene agents: the overlooked category that fixes foundational data problems[08:01] — The CRM data example: why every CRM is inaccurate and how agents fix it[10:05] — Latent infrastructure capacity: running agents in GPU white space to cut costs to cents[13:53] — Facilitation agents: removing entropy from complex cross-functional workflows[17:30] — The sequencing insight: hygiene and facilitation as the path to expert agents[19:24] — Why coordination agents aren't agentic bosses — and where human control actually lives[22:21] — Roese's closing advice: become literate, pick a few, get them into production4. Guest BioJohn Roese is the Global Chief Technology Officer and Chief AI Officer at Dell Technologies, where he is responsible for technology strategy, AI deployment, and research and development across the company. He has held senior technology leadership roles at Nortel, Enterasys Networks, Broadcom, and EMC. At Dell, he operates at a rare intersection: leading AI strategy for a major technology vendor while also deploying AI internally at enterprise scale — which means his frameworks are tested against real production constraints, not just market positioning.LinkedIn: linkedin.com/in/johnroeseDell Technologies: dell.comAbout This PodcastAI with Maribel Lopez is a podcast for enterprise technology leaders navigating AI adoption, agentic systems, AI infrastructure, and AI governance. Host Maribel Lopez covers enterprise technology and advises CIOs, CDOs, CMOs, and technology vendors on how to move from AI experimentation to measurable business outcomes. New episodes published bi-weekly.Subscribe on your platform of choice: buzzsprout.com/1947446

The Rundown
Trump Brings Corporate Entourage to China, Amazon Pivots AI Shopping Strategy

The Rundown

Play Episode Listen Later May 13, 2026 10:52


Market update for Wednesday May 13, 2026Check out the Public app for incredible investing tools and to support the show (LINK)Follow us on Instagram (@TheRundownDaily) for bonus content and instant reactions.In today's episode, Zaid covers:PPI inflation surges 6%, blowing past expectationsTrump lands in Beijing with Jensen Huang, Elon Musk, Tim Cook and moreAmazon kills Rufus chatbot, launches Alexa for ShoppingWolfspeed surges 23% on power chip hypeBirkenstock misses earnings as Iran war hits sandal salesYou'll soon be able to trade futures tied to AI computing power and GPU rental prices

The Six Five with Patrick Moorhead and Daniel Newman
Anthropic at $1.2 Trillion, AMD's Blowout Quarter, and the PE-Backed AI Enterprise Play | Ep. 304

The Six Five with Patrick Moorhead and Daniel Newman

Play Episode Listen Later May 11, 2026 65:08


Patrick Moorhead and Daniel Newman dig into the week's biggest moves in enterprise AI: Anthropic and OpenAI launching PE-backed enterprise JVs on the same day, Anthropic filling its compute gap with SpaceX's Colossus, Cerebris filing for a $3.5 billion IPO, NVIDIA going deep on co-packaged optics with Corning, and a full IBM Think and ServiceNow recap. Plus, for The Flip, hosts debate whether Anthropic, at $1.2 trillion, is the most important company in enterprise tech. The handpicked topics for this week are: 1. Anthropic and OpenAI Launch PE-Backed Enterprise JVs on the Same Day — Both companies announced private equity joint ventures, with OpenAI backed by Bain, Brookfield, and Advent, and Anthropic partnering with Blackstone, Goldman Sachs, Apollo, and General Atlantic. Daniel's read is that this is fundamentally a distribution play, using private equity portfolio companies as a deployment channel for AI at scale. Pat sees it as the clearest admission yet that enterprise AI cannot be self-implemented at scale without specialized consulting support, and flags that mid-tier systems integrators (SIs) could get cut out of the middle. (The Decode) 2. Anthropic Signs Massive Compute Deal with SpaceX Colossus — Anthropic urgently needed compute and SpaceX had 300 megawatts and 220,000 GPUs sitting at Colossus One in Memphis without enough business to fill them. Pat's take is blunt: this move is pragmatic. Anthropic needs it, xAI has it. Daniel adds that Dario himself said they planned for 10x growth and got 80x, and this deal is the fast backfill that reality demanded. The side note both hosts flag: Anthropic is running on H100s, H200s, and B200s, which puts the whole "Anthropic only runs on Trainium and TPUs" narrative to rest. (The Decode) 3. Cerebris Files for a $3.5 Billion IPO at $26.6 Billion Valuation — This marks their second attempt at an IPO after pulling the first filing. The architecture is genuinely unique, a complete wafer with massive on-chip SRAM and interconnects built directly onto the wafer rather than copper or photonics. Pat calls it the first credible Western alternative for AI inference. Daniel's framing cuts through: you do not have to beat NVIDIA to sell right now. You just need to have availability. The more interesting headline, both hosts agree, is that Sam Altman and Greg Brockman are angel investors, which adds fuel to the ongoing OpenAI lawsuit. (The Decode) 4. NVIDIA and Corning Announce $500 Million Optical Partnership — Three new US factories, co-packaged optics for Vera Rubin, and a supply chain strategy that mirrors what NVIDIA did with Coherent. Pat's context: this is vertical integration through investment rather than acquisition. Daniel's observation is that the pace of movement toward co-packaged optics is accelerating faster than anyone expected, and his "rule of and" applies here too. Copper is not going away. Optics are being added on top because the data volumes moving across these racks are outrunning what copper alone can handle. US manufacturing in North Carolina and Texas is a strategic bonus. (The Decode) 5. IBM Think 2026: Day Zero, Sovereign Core, and the Quantum Plus AI Bet — Pat moderated on stage with CEO Arvind Krishna and calls this IBM's best showing in five years. Arvind opened with the AI divide, the gap between companies still running POCs and companies already in production, and framed where IBM sits as day zero, not because nothing has happened, but because enterprise AI deployment at scale is still so early. Daniel's biggest takeaways: watsonX Orchestrate updates, Sovereign Core going GA with policy at runtime, and the Confluent acquisition potentially being IBM's most important asset since Red Hat, given that 40% of Fortune 500 companies run on it and real-time streaming data is foundational to agentic systems. Both hosts land on quantum plus AI as IBM's next inflection moment. (The Decode) 6. ServiceNow Knowledge 2026: Enterprise SaaS 2.0 is Emerging — Daniel got there on day three of the event and noted the conference was densely packed. His observation: enterprises have not gotten the memo from Wall Street that SaaS is supposedly dead. His emerging thesis is that middleware could make a comeback for AI, with companies needing a layer that lets agents work across any infrastructure, any app, and within the rules of their specific business. Pat agrees and adds that the growth question is about mix, not survival. (The Decode) 7. The Flip: Is Anthropic at $1.2 Trillion the Most Important Company in Enterprise Tech? — Daniel took the affirmative citing that Claude Code is deeply entrenched in developer workflows. Anthropic went from $9 billion to $45 billion ARR in months. Every major hyperscaler is both a customer and an investor. The PE JVs are turning verticals into Anthropic engines. Dario said they planned for 10x and got 80x. Pat's counter: the enterprise trust gap is real after what Anthropic pulled on pricing and performance. Microsoft has 2 billion users across 365, Azure, and Copilot. NVIDIA is the infrastructure Anthropic runs on. And workforce replacement, which is how Anthropic extracts its terminal value, is not arriving as fast as the valuation suggests. In reality, both hosts admit their notes looked almost identical. (The Flip) 8. AMD — Lisa Su guided AI data center growth up from 60% to 80%. With OpEx growing 83%, net income up 95%, free cash flow ripping, and CPUs growing at nearly 40% without price increases, Pat reads this as unit market share gains coming soon. Daniel's framing: AMD is now a two-headed juggernaut with CPUs and GPUs for the data center. And Helios has not even started shipping yet. Both hosts take a victory lap for previously calling this one. (Bulls and Bears) 9. Palantir — Triple beat on revenue, EPS, and forward guidance. Rule of 40 at 145%. Government revenue up 84%, 47 deals over $10 million, and the largest guidance raise in the company's history. Daniel's take: Palantir is redefining the category entirely. It's not a software company in the Salesforce or ServiceNow sense. It's technology, plus ontology, plus people, deployed at the deepest layers inside governments and enterprises. Pat adds that the four deployed FTE model lets them stand up AIP POCs within a week, which is why they are winning business at this pace. (Bulls and Bears) 10. ARM — AGI processor demand doubled from $1 billion to $2 billion within 45 days. Record revenue, strong pipeline, royalty growth at 21% for the full year. The stock ripped after hours, then sold the next day when management confirmed only enough supply for $1 billion of that $2 billion demand. Pat's read: 50% CPU market share with hyperscalers at the core level is the most underdiscussed signal on the call. Daniel adds that the worry about ARM competing with its own customer base in custom silicon has been quietly swept away by the sheer volume of compute demand. (Bulls and Bears) 11. Supermicro — A board member allegedly used a hairdryer to remove labels from GPU boxes being shipped to China. Approximately 20% of their revenue has reportedly been illegally shipped to China. They beat on EPS and Q4 guide but missed Q3 revenue versus consensus. Stock still ripped 18%. Daniel's take: if you are selling picks and shovels during a gold rush and you are this messed up, he cannot imagine owning it with the overhang that is building. (Bulls and Bears) 12. Lattice Semi and Coherent — Lattice revenue up 42%, back into growth, guiding to 50% year-on-year at midpoint. The AMI acquisition at $1.65 billion doubles their serviceable market from $6 billion to $12 billion and puts them inside every AI server on the planet at the BIOS and platform firmware layer. Pat calls the timing right: core financials crushing it, time to make a move. Coherent printed 21% year-on-year growth, 55% EPS growth, margins expanding, debt coming down, entered the S&P 500, and sits at the center of the co-packaged optics trend that is accelerating. Pat's choke point note: Indium phosphide capacity is the constraint. Six-inch fabs are doubling capacity in 2026, a quarter ahead of plan, and competitors are still ramping their transitions. (Bulls and Bears) Want the full breakdown from IBM Think and ServiceNow Knowledge, and check out our on-the-ground coverage linked in the show notes. Be part of our community. Hit that subscribe button and let us know what you want us to cover next week in the comments. Intro Pat on Stage at IBM Think https://x.com/PatrickMoorhead/status/2051381046537601101?s=20 The Decode OpenAI and Anthropic Both Launch PE-Backed Enterprise Services JVs on the Same Day — The Palantir FDE Model Goes Mainstream https://www.bloomberg.com/news/articles/2026-05-04/openai-finalizes-10-billion-joint-venture-with-pe-firms-to-deploy-ai https://techcrunch.com/2026/05/04/anthropic-and-openai-are-both-launching-joint-ventures-for-enterprise-ai-services/ https://www.semafor.com/article/05/04/2026/openai-anthropic-ramp-up-enterprise-push Anthropic and SpaceX Sign Massive Compute Deal — Full 300MW / 220,000 GPU Colossus 1 Memphis Data Center Plus Exploration of Multi-Gigawatt Orbital AI Compute https://www.cnbc.com/2026/05/06/anthropic-spacex-data-center-capacity.html https://www.bloomberg.com/news/articles/2026-05-06/anthropic-inks-computing-deal-with-spacex-to-meet-ai-demand https://www.tomshardware.com/tech-industry/artificial-intelligence/musks-spacex-has-rented-out-access-to-its-supercomputers-220-000-nvidia-gpus-and-300-megawatts-of-ai-compute-power-to-rival-anthropic Cerebras Files for $3.5B IPO at $26.6B Valuation — The First Major AI Chip IPO of 2026 https://www.cnbc.com/2026/05/04/cerebras-ipo-ai-chipmaker.html https://theaiinsider.tech/2026/05/06/cerebras-systems-eyes-3-5b-in-largest-tech-ipo-of-2026-on-strength-of-ai-chip-demand/ https://www.briefs.co/news/ai-chipmaker-cerebras-just-filed-for-a-3-5-billion-ipo/ NVIDIA and Corning Announce Game-Changing Optical Partnership — $500M Investment, 3 New U.S. Factories, and Co-Packaged Optics for Vera Rubin and Beyond https://www.corning.com/worldwide/en/about-us/news-events/news-releases/2026/05/nvidia-and-corning-announce-long-term-partnership-to-strengthen-us-manufacturing-for-ai-infrastructure.html https://www.cnbc.com/2026/05/06/nvidia-corning-optical-factories-nc-texas-ai.html https://www.wsj.com/tech/nvidia-corning-form-partnership-to-expand-fiber-optic-manufacturing-17f525de https://kfgo.com/2026/05/06/corning-partners-with-nvidia-to-expand-us-fiber-optic-output-for-ai-growth/ IBM Think 2026 Boston — Watsonx Orchestrate Next-Gen, Confluent Real-Time Data, IBM Concert, and Sovereign Core Define IBM's Agentic Operating Model https://newsroom.ibm.com/2026-05-05-think-2026-ibm-delivers-the-blueprint-for-the-ai-operating-model-as-the-ai-divide-widens https://www.ibm.com/new/announcements/ibm-announcements-at-think-2026 https://www.instagram.com/reel/DX42DlrglOs/ ServiceNow Knowledge 2026 Las Vegas https://www.servicenow.com/events/knowledge.html https://newsroom.servicenow.com/press-releases/details/2026/Cohesity-and-ServiceNow-Deliver-Real-Time-Recovery-for-Enterprise-AI-Agents/default.aspx https://www.cnbc.com/2025/09/04/nvidia-backed-cohesity-eyes-2026-ipo-with-valuation-rivaling-17-billion-rubrik.html   The Flip: Anthropic at $1.2T Now the Most Important Company in Enterprise Tech — More Important Than NVIDIA, Microsoft, or OpenAI FOR: Dual-hyperscaler compute anchor (Amazon $33B + Google $40B = $73B) is structural — unmatched https://futurumgroup.com/insights/anthropics-gigawatt-scale-tpu-deal-with-broadcom-creates-a-structural-advantage/ Constitutional AI safety positioning wins regulated industries https://www.anthropic.com/news/anthropic-nec-japan-ai-engineering-workforce $900B valuation surpasses OpenAI ($852B) at faster revenue growth and lower burn rate https://techcrunch.com/2026/04/30/anthropic-potential-900b-valuation-round-could-happen-within-two-weeks/   AGAINST: NVIDIA still controls the substrate — every Anthropic dollar of revenue requires NVIDIA inference at some layer https://www.cnbc.com/2026/04/27/nvidia-just-hit-an-all-time-high-why-some-think-a-rally-is-just-getting-started.html Microsoft has the enterprise distribution — 365 + Azure + Copilot reach >2 billion users https://www.marketbeat.com/originals/microsofts-maia-200-the-profit-engine-ai-needs/ $900B valuation is venture marketing — the IPO will reset the number https://www.semafor.com/article/05/04/2026/openai-anthropic-ramp-up-enterprise-push   Bulls & Bears: AMD Q1 2026 — Revenue $10.3B (+38% YoY), MI300X Data Center GPU Demand Drives Stock +20% on the Print https://ir.amd.com/news-events/press-releases/detail/1284/amd-reports-first-quarter-2026-financial-results https://www.cnbc.com/2026/05/05/amd-q1-2026-earnings-report.html https://finance.yahoo.com/markets/stocks/articles/amd-q1-2026-earnings-revenue-203331768.html Palantir Q1 2026 — Revenue +85% YoY, US Commercial +133%, Rule of 40 Score Hits 145%; Largest Guidance Raise in Company History https://investors.palantir.com/files/Palantir%20-%20Q1%202026%20Business%20Update.pdf https://www.reddit.com/r/PLTR/comments/1t3t0me/palantir_reports_q1_2026_us_revenue_growth_of_104/ https://finance.yahoo.com/markets/stocks/articles/palantir-technologies-inc-q1-2026-002218719.html https://semiconalpha.substack.com/p/palantir-q1-2026-rewriting-the-rule Arm Holdings Q4 FY2026 — Record $1.49B Quarter, Full-Year Revenue Crosses $4.92B, $2B AGI CPU Pipeline; Stock +16% After Hours https://finance.yahoo.com/markets/stocks/articles/arm-q4-earnings-call-highlights-225942093.html https://www.stocktitan.net/sec-filings/ARM/6-k-arm-holdings-plc-uk-current-report-foreign-issuer-7e9ca9ac7dda.html https://semiconalpha.substack.com/p/arm-q4-fy2026-record-quarter-2-billion Super Micro Computer Q3 FY2026 — Revenue $10.2B (+123% YoY), Strong Q4 Guide; Stock +18% AH on First Earnings Call Since Co-Founder Indictment Drama https://www.cnbc.com/2026/05/05/super-micro-smci-q3-earnings-report-2026.html https://www.stocktitan.net/sec-filings/SMCI/8-k-super-micro-computer-inc-reports-material-event-e70b2f8b3cb7.html https://www.instagram.com/reel/DX42DlrglOs/ Lattice Semiconductor Q1 2026 — Beat-and-Raise Quarter ($170.9M, +42% YoY) Paired With $1.65B AMI Acquisition That Doubles Lattice's SAM to $12B https://www.stocktitan.net/sec-filings/LSCC/8-k-lattice-semiconductor-corp-reports-material-event-642a862b2bf9.html https://www.ami.com/resources/ami-announces-agreement-to-be-acquired-by-lattice-semiconductor/ https://www.linkedin.com/posts/patmoorhead_lattice-semiconductor-posts-beat-and-raise-activity-7457411226944425984-xA8T Coherent Q3 2026 Earnings https://www.msn.com/en-us/money/companies/coherent-cohr-tops-revenue-expectations-in-q3-as-ai-demand-accelerates-shares-decline/ar-AA22Bz24?ocid=finance-verthp-feeds  

The Cloudcast
AI, Data Centers, and the Power Crunch

The Cloudcast

Play Episode Listen Later May 10, 2026 33:39


SUMMARY: We  explore one of the most overlooked bottlenecks in the AI boom: energy and infrastructure and  why power availability is becoming the limiting factor.GUEST: Wannie Park, Founder/CEO of PADO AISHOW: 1026SHOW TRANSCRIPT: The Reasoning Show #1026 TranscriptSHOW VIDEO: https://youtu.be/satMQRxKQC8SHOW SPONSORS:ShareGate - ShareGate Protect. Microsoft 365 Governance, we got this!Nasuni - Activate your data for AI and request a demoSHOW NOTES:1. AI's Hidden Constraint: PowerAI growth is no longer limited only by GPUs and computePower generation, cooling, and grid interconnects are emerging as major bottlenecksData centers could account for 10–12% of North American power demand in coming years2. Why Data Centers Are Being ReimaginedTraditional data centers were built for enterprise IT, not AI-scale workloadsAI infrastructure introduces:Massive power density needsAdvanced cooling challenges3. The Grid Wasn't Built for AIUtilities are designed around peak demand scenariosMost grids run well below peak capacity most of the timeAI workloads create volatile and unpredictable consumption patternsLong interconnection timelines are pushing companies toward alternative infrastructure models4. GPU Utilization Is Surprisingly LowGPU clusters are often underutilized because of:Scheduling inefficiencies, Cooling limitations, SLA constraintsEffective GPU utilization may be as low as 12–13% in some environments5. Cooling as a Major Optimization LayerLegacy data centers often cool entire zones inefficientlyPado AI alignsAI workloads, Cooling systems, Power allocationWorkload-aware orchestration helps optimize cooling and compute efficiency6. The Rise of “Compute Forecasting”Pado forecasts compute demand instead of energy demandThe platform models:GPU workloads, Power consumption, Cooling requirements, SLA prioritiesGoal: maximize “compute per megawatt”7. AI Workloads Become Time-AwareAI providers may increasingly:Shift workloads to off-peak periodsIncentivize delayed non-urgent jobsDynamically balance compute demandUsers are already seeing variable inference latency in real-world AI systems8. Sustainability vs Reliability vs ProfitabilityOperators must balance:Uptime expectations, Infrastructure costs, Sustainability goalsRenewable adoption is growing, but reliability still drives investment in natural gas and battery-backed systems9. Brownfield vs Greenfield OpportunitiesPado AI is focused primarily on existing (“brownfield”) data centersExisting enterprise infrastructure can often be extended and optimized instead of rebuiltEnterprises may gain significant AI capability without hyperscale GPU deploymentsFEEDBACK?Email: show @ reasoning dot showBluesky: @reasoningshow.bsky.socialTwitter/X: @ReasoningShowInstagram: @reasoningshowTikTok: @reasoningshow

PC Perspective Podcast
Podcast #867 - AMD Stock Jump, Steam Controller, Don't Worry About 32GB for Windows 11, HOF lives!, DDR6, PCIe8, and MORE

PC Perspective Podcast

Play Episode Listen Later May 8, 2026 85:09


Let me start off by saying that we should have made a "5309" reference, a miss on our part.AMD has a very good financial report, let the Josh take wash over you.Steam controller launches, sells out, and you probably don't have one.32GB was the minimum.  Then it wasn't.  Microsoft, so trustworthy.Also plain text never hurt anyway.And so much more!Thanks to our sponsor this week: Zapier!Bring the power an AI to any workflow - don't just talk about it:  ACT! Check out Zapier and their orchestration platform.Timestamps:0:00 Intro1:19 Patreon3:09 Food with Josh5:11 Josh presents AMD financials12:19 Steam Controller news and Machine rumor14:46 Microsoft wants you to forget not to worry18:16 DDR6 development starts19:06 PCI Express 8.023:54 be quiet! Dark Rock 6 coolers announced26:09 We apologize for being wrong about GALAX29:10 Lisuan 7G100 is a GPU launching this month31:50 Jeremy discovers the cheap turntable market35:10 Ad 37:03 (In)Security Corner52:02 Gaming Quick Hits1:03:16 Josh put his racing corner segment here1:13:14 Picks of the week1:24:02 Outro ★ Support this podcast on Patreon ★

The MacRumors Show
193: Is Apple DOWNGRADING the iPhone 18 Due to Memory Shortage?

The MacRumors Show

Play Episode Listen Later May 8, 2026 43:19


On this week's episode of The MacRumors Show, we talk through how the global memory shortage is forcing Apple's hand across multiple key products, killing configurations, delaying launches, and prompting spec decisions that would have seemed unlikely a year ago.The pressure originates outside Apple's control. JPMorgan analysis cited by the Financial Timesfound that memory could account for as much as 45% of an iPhone's component costs by 2027, up from around 10% today. Companies like Nvidia are reportedly outbidding consumer electronics makers for limited DRAM supply from Samsung, SK Hynix, and Micron, while cloud firms are locking in capacity with multi-billion-dollar upfront commitments. Apple, which buys memory for roughly 250 million iPhones per year, has shifted from a position where it could dictate terms to one where it must compete for supply, and component prices are being driven up as a result.The consequences are already visible in the Mac lineup. Apple last week removed the Mac mini's 256GB storage option, pushing its starting price from $599 to $799. Days later, it eliminated Mac mini models with 32GB and 64GB of RAMand stripped the M3 Ultra Mac Studio to a single 96GB configuration, with delivery estimates for remaining Studio models at 9 to 10 weeks. The ‌Mac Studio‌ had already lost its 512GB memory option in March, and multiple configurations became entirely unavailable in April. On Apple's April 30 earnings call, CEO Tim Cook acknowledged that both machines would be "hard to get for months to come" and said Apple expects "significantly higher memory costs" in the current quarter. The MacBook Neo was sold out through April and Cook described demand on the earnings call as “off the charts." The ‌MacBook Neo‌ uses binned A18 Pro chips, adopting manufacturing rejects from the iPhone 16 lineup with one GPU core disabled, repurposed rather than discarded to keep costs low enough to hit the $599 price point.Apple's initial production target is believed to be about five to six million units, but demand has since pushed the company to instruct suppliers to prepare for at least 10 million. TSMC's N3E production lines, where the A18 Pro was made, are now running at maximum capacity, with AI-related orders consuming much of the available output. A fresh manufacturing run for the A18 Pro would yield fully functional chips rather than defective ones, raising the per-unit cost before any expedited manufacturing premium is applied.Apple is now said to be weighing up its options for the ‌MacBook Neo‌. The company is purportedly considering cutting the 256GB entry-level model, which would push the effective starting price up by $100 without changing any existing configuration's price, the same mechanism used with the ‌Mac mini‌. Separately, Apple may be considering new color options to soften any price increase.Upcoming products are apparently being reshaped too. Weibo leaker "Fixed Focus Digital" has claimed in a series of posts that the standard iPhone 18 is being downgraded as a cost-cutting measure, with both display and chip specifications affected. Most recently, the leaker said certain parts are interchangeable between the ‌iPhone 18‌ and the lower-cost iPhone 18e. For context, iPhone 17 and iPhone 17e differ meaningfully: the standard model has a larger ProMotion display, Dynamic Island, Ultra Wide camera, five-core GPU, and significantly better battery life, but it looks like there could be fewer differences with the next generation.A follow-up post framed the new split launch strategy, under which the ‌iPhone 18‌ ships in spring 2027 rather than alongside the Pro models in the fall, as a deliberate commercial mechanism to smooth out demand. By extending the ‌iPhone 17‌'s flagship run, Apple is also said to be creating conditions under which a lower-specced successor will be more palatable. The split launch itself has been widely reported since last year, with Ming-Chi Kuo and Nikkei among those to have corroborated it.The launch of the rumored all-new high-end MacBook Pro or "MacBook Ultra" with an OLED display and touchscreen has also apparently slipped. Bloomberg's Mark Gurman has said early 2027 is now looking more likely than late 2026 due to Apple's constrained memory supply cited as a factor.

Lance Roberts' Real Investment Hour
5-7-26 Semiconductors - Bubble or Bust?

Lance Roberts' Real Investment Hour

Play Episode Listen Later May 7, 2026 42:55


Semiconductor stocks continue to lead the market higher as the AI boom drives massive capital spending, soaring valuations, and relentless momentum in names like Nvidia, AMD, Broadcom, and TSMC. But are investors witnessing the next great technology revolution, or the early stages of another market bubble? Lance Roberts & Michael Lebowitz examine the semiconductor trade from both sides: Explosive AI demand, data center growth, and earnings momentum versus stretched valuations, narrowing market breadth, and growing concentration risk inside the S&P 500 and NASDAQ. Key topics include: 0:00 - INTRO 0:56 - Earnings Revisions are Ratcheting Up 4:32 - The Importance and Use of Technical Indicators 11:05 - Tools for Separating Narratives & Emotions from Market Reality 13:07 - Likes for Lebo Merch 14:10 - Semiconductors, AI, and Bubbles 16:58 - GPU's, CPU's, & LLM's 18:38 - The AMD Story & Supple-Demand Imbalances 24:10 - Portfolio Management Keeps You Out of Trouble 25:05 - Taking Profits is Important 26:45 - Where Will Money Go Next? 29:22 - Dissention in the Fed - Cuts or No Cuts? 32:15 - AI is Disinflationary in Nature 34:06 - The Curb Appeal of the Economy 36:18 - Guessing the Outcome - The K-Shape Continues 38:29 -The Technology Displacement Cycle Hosted by RIA Advisors Chief Investment Strategist, Lance Roberts, CIO, w Portfolio Manager, Michael Lebowitz, CFA Produced by Brent Clanton, Executive Producer ------- Do you enjoy our content? Rate us on Google: https://bit.ly/4b9JtEo ------- Watch Today's Full Video on our YouTube Channel: https://youtube.com/live/X0moIzRXOHg ------- Articles Mentioned in Today's Show: "A Robot Economy: Who Gets Rich, Who Gets Left Behind" https://realinvestmentadvice.com/resources/blog/a-robot-economy-who-gets-rich-who-gets-left-behind/ ------- Watch today's "Before the Bell" feature, "How Markets Ignore the Fear" here: https://youtu.be/1BrnxwNgzeY ------- Watch our previous show, "Q&A Wednesday - What's Your Biggest Concern?" https://youtube.com/live/TFn61TpR-Fc ------- * REGISTER for our next Candid Coffee, Saturday, May 16: "Financial Organization Made Simple:" https://streamyard.com/watch/SA6aj2aMdMhf -------- Download Lance's Latest e-book, "Laws of Money & Wealth:"https://realinvestmentadvice.com/ria-e-guide-library/ -------- SUBSCRIBE to The Real Investment Show here: http://www.youtube.com/c/TheRealInvestmentShow -------- Visit our Site: https://www.realinvestmentadvice.com Contact Us: 1-855-RIA-PLAN -------- Subscribe to SimpleVisor: https://www.simplevisor.com/register-new -------- Connect with us on social: https://twitter.com/RealInvAdvice https://twitter.com/LanceRoberts https://www.facebook.com/RealInvestmentAdvice/ https://www.linkedin.com/in/realinvestmentadvice/ #StockMarket #Investing #SP500 #MarketOutlook #TechnicalAnalysis #Semiconductors #AIStocks #Nvidia #TechnologyStocks

The Real Investment Show Podcast
5-7-26 Semiconductors: Bubble or Bust?

The Real Investment Show Podcast

Play Episode Listen Later May 7, 2026 42:56


Semiconductor stocks continue to lead the market higher as the AI boom drives massive capital spending, soaring valuations, and relentless momentum in names like Nvidia, AMD, Broadcom, and TSMC. But are investors witnessing the next great technology revolution, or the early stages of another market bubble? Lance Roberts & Michael Lebowitz examine the semiconductor trade from both sides: Explosive AI demand, data center growth, and earnings momentum versus stretched valuations, narrowing market breadth, and growing concentration risk inside the S&P 500 and NASDAQ. Key topics include: 0:00 - INTRO 0:56 - Earnings Revisions are Ratcheting Up 4:32 - The Importance and Use of Technical Indicators 11:05 - Tools for Separating Narratives & Emotions from Market Reality 13:07 - Likes for Lebo Merch 14:10 - Semiconductors, AI, and Bubbles 16:58 - GPU's, CPU's, & LLM's 18:38 - The AMD Story & Supple-Demand Imbalances 24:10 - Portfolio Management Keeps You Out of Trouble 25:05 - Taking Profits is Important 26:45 - Where Will Money Go Next? 29:22 - Dissention in the Fed - Cuts or No Cuts? 32:15 - AI is Disinflationary in Nature 34:06 - The Curb Appeal of the Economy 36:18 - Guessing the Outcome - The K-Shape Continues 38:29 -The Technology Displacement Cycle Hosted by RIA Advisors Chief Investment Strategist, Lance Roberts, CIO, w Portfolio Manager, Michael Lebowitz, CFA Produced by Brent Clanton, Executive Producer ------- Do you enjoy our content? Rate us on Google: https://bit.ly/4b9JtEo ------- Watch Today's Full Video on our YouTube Channel: https://youtube.com/live/X0moIzRXOHg ------- Articles Mentioned in Today's Show: "A Robot Economy: Who Gets Rich, Who Gets Left Behind" https://realinvestmentadvice.com/resources/blog/a-robot-economy-who-gets-rich-who-gets-left-behind/ ------- Watch today's "Before the Bell" feature, "How Markets Ignore the Fear" here: https://youtu.be/1BrnxwNgzeY ------- Watch our previous show, "Q&A Wednesday - What's Your Biggest Concern?" https://youtube.com/live/TFn61TpR-Fc ------- * REGISTER for our next Candid Coffee, Saturday, May 16: "Financial Organization Made Simple:" https://streamyard.com/watch/SA6aj2aMdMhf -------- Download Lance's Latest e-book, "Laws of Money & Wealth:"https://realinvestmentadvice.com/ria-e-guide-library/ -------- SUBSCRIBE to The Real Investment Show here: http://www.youtube.com/c/TheRealInvestmentShow -------- Visit our Site: https://www.realinvestmentadvice.com Contact Us: 1-855-RIA-PLAN -------- Subscribe to SimpleVisor: https://www.simplevisor.com/register-new -------- Connect with us on social: https://twitter.com/RealInvAdvice https://twitter.com/LanceRoberts https://www.facebook.com/RealInvestmentAdvice/ https://www.linkedin.com/in/realinvestmentadvice/ #StockMarket #Investing #SP500 #MarketOutlook #TechnicalAnalysis #Semiconductors #AIStocks #Nvidia #TechnologyStocks

This Week in Startups
Naval's GP, Ankur Nagpal, Breaks Down The Viral “USVC” Fund | E2284

This Week in Startups

Play Episode Listen Later May 5, 2026 98:10


This Week In Startups is made possible by:Render - render.com/twistVanta - vanta.com/twistNorthwest Registered Agent - northwestregisteredagent.com/twistHave you wanted to invest in venture capital alongside your 401k contributions, but struggled to find a way to place a bet? Search no more, for the AngelList team has created USVC, a new fund that will accept investments of as little as $500 from folks who lack accreditation. USVC's Ankur Napgal swung by the show to chat about investment strategies, access, fees, and just how illiquid the venture-like vehicle will prove to be.Jason and Alex were then joined by Jon Durbin, core contributor at Chutes, and Yash Goenka, co-founder and CEO of Humwork. Chutes is the most valuable Bittensor subnet, focused on aggregating GPUs to offer trustless AI compute. Humwork wants to help bring a human into your agentic workflow to unstick your agent when it runs into a hitch. The show closes with a news lightning round — enjoy!Timestamps:0:00 Intro + Plaud AI sponsor read (Jason demos the NotePin S)2:06 Plaud: If your work depends on conversations — interviews, meetings, calls — you need a Plaud NotePin. You can check it out at https://Plaud.ai/twist and use code TWIST for 10% off!3:50 Episode overview: USVC, Chutes, Humwork7:41 USVC structure: $500 min, no accreditation, quarterly redemptions10:28 Northwest Registered Agent - Get more when you start your business with Northwest. In 10 clicks and 10 minutes, you can form your company and walk away with a real business identity — Learn more at https://northwestregisteredagent.com/twist11:39 Fee structure breakdown: 1% mgmt fee, ~2.5% net expense ratio15:41 USVC portfolio: xAI, Anthropic, OpenAI, Crusoe, Vercel16:17 Strategy: 1/3 emerging managers, 1/3 growth, 1/3 secondaries19:00 $1B cap and path to expanding the fund20:33 Vanta - Compliance and security shouldn't be a deal-breaker for startups to win new business. Vanta makes it easy for companies to get a SOC 2 reports fast. Get $1,000 off for a limited time at https://www.vanta.com/twist28:00 What is sovereign compute? Permissionless GPU networks explained30:10 Render - Find out why 5 million developers are already using the all-in-one cloud platform, Render. Go to https://render.com/twist and apply for the Render Startup Program to get $500-$100,000 in free credits, depending on your stage and backers.44:53 Usage history: 160B tokens/day peak, free-to-paid transition46:41 GPU pricing: from $0.77/hr to $3.50/hr as shortage bites49:13 DeepSeek censorship demo: Taiwan test on DeepSeek chat57:27 Permissionless supply curves: Uber/Airbnb analogy for induced demand59:20 Yash Goenka, co-founder of Humwork (YC S26)1:00:49 Lightning Round: Ryan Cohen wants to buy eBay — Jason's hot take1:12:36 Lightning Round: Cerebras IPO update ($27–36B valuation range)1:14:05 Lightning Round: Spirit Airlines / JetBlue / M&A regulation debate1:19:10 OFF-DUTY: Star Wars: Maul - Shadow Lord1:32:57 Alex's OFF-DUTY: Captain of IndustrySubscribe to the TWiST500 newsletter: https://ticker.thisweekinstartups.comCheck out the TWIST500: https://www.twist500.comSubscribe to This Week in Startups on Apple: https://rb.gy/v19fcpFollow Lon:X: https://x.com/lonsFollow Alex:X: https://x.com/alexLinkedIn: ⁠https://www.linkedin.com/in/alexwilhelmFollow Jason:X: https://twitter.com/JasonLinkedIn: https://www.linkedin.com/in/jasoncalacanisCheck out all our partner offers: https://partners.launch.co/Great TWIST interviews: Will Guidara, Eoghan McCabe, Steve Huffman, Brian Chesky, Bob Moesta, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarlandCheck out Jason's suite of newsletters: https://substack.com/@calacanisFollow TWiST:Twitter: https://twitter.com/TWiStartupsYouTube: https://www.youtube.com/thisweekinInstagram: https://www.instagram.com/thisweekinstartupsTikTok: https://www.tiktok.com/@thisweekinstartupsSubstack: https://twistartups.substack.com

This Week in Startups
Can an AI Agent Legally Own a Company? Christian van der Henst's Wild Experiment| E2283

This Week in Startups

Play Episode Listen Later May 1, 2026 70:05


This Week In Startups is made possible by:Pilot - https://pilot.com/twistShopify - https://shopify.com/twistGrasshopper Bank - https://grasshopper.bank/twistToday's show:An AI agent named Valerie is running a real vending machine in San Francisco — setting prices, ordering inventory, managing a bank account, and posting to Instagram. And it's not just a stunt. We're getting an early look at the future of one-agent companies. There's still work to do to help agents ease into the economy, potentially opening up new startup opportunities.Jason Calacanis and Alex Wilhelm cover a stacked docket: Christian van der Henst demos the Valerie AI vending machine powered by OpenClaw; Robert Myers, CEO of Manifold Labs, breaks down Targon, a confidential GPU compute marketplace running on Bittensor Subnet 4; Jason calls Bitcoin "played out;" Alex is impressed by Anthropic's stunning $900 billion upcoming valuation; and the guys discuss Big Tech's accelerating CapEx spend, Chinese AI models in Congress crosshairs, and the NBA Playoffs.Timestamps:0:00 Intro & sponsor reads (Pilot, Shopify, Grasshopper Bank)1:06 Christian van der Henst: Valerie the AI vending machine demo4:28 Legal structure: Giving an AI agent business ownership via trust7:23 Where agents can and can't operate today10:10 Grasshopper Bank: Time is money. Don't waste either. Go to https://grasshopper.bank/twist and get an exclusive $500 cash bonus just for opening an account.11:48 AI café in Stockholm running on agents18:21 Plaud: If your work depends on conversations — interviews, meetings, calls — you need a Plaud NotePin. You can check it out at https://Plaud.ai/twist and use code TWIST for 10% off!19:05 Robert Myers, Manifold Labs: Targon & Bittensor Subnet 4 interview20:21 What is Bittensor? An "incubator with 128 subnets"21:08 Shopify: Turn those What If's into sales with the ecommerce platform powering millions of businesses. Sign up for your $1-per-month trial today at https://shopify.com/twist25:21 Pricing, utilization caps, and why GPUs are sold out26:38 Who's using Targon? Customers, use cases, and the mom-and-pop data center argument30:06 Pilot: Focus on your product, let Pilot handle your bookkeeping. Pilot provides the most reliable accounting, CFO, and tax services for startups and small businesses. Head to https://pilot.com/twist and get $1,200 off your first year.35:07 Jason explains the annotated.com vision — 15 years in the making39:11 Polymarket: Will Anthropic flip Bitcoin by Dec 31?40:29 Jason's Bitcoin bear case: "It's played out. No incremental buyers."46:05 MicroStrategy / Strategy updates52:37 AI compute demand vs. the fiber overbuild analogy55:55 Congress pressuring startups over Chinese AI models (DeepSeek, Moonshot)57:11 A16z on the geopolitical risk of Chinese AI models1:00:08 Reflection AI — America's open source AI champion (or lack thereof)1:01:26 Off Duty: Knicks blow out Atlanta Hawks 140–89, Jason goes road-trippingSubscribe to the TWiST500 newsletter: https://ticker.thisweekinstartups.comCheck out the TWIST500: https://www.twist500.comSubscribe to This Week in Startups on Apple: https://rb.gy/v19fcpFollow Lon:X: https://x.com/lonsFollow Alex:X: https://x.com/alexLinkedIn: ⁠https://www.linkedin.com/in/alexwilhelmFollow Jason:X: https://twitter.com/JasonLinkedIn: https://www.linkedin.com/in/jasoncalacanisCheck out all our partner offers: https://partners.launch.co/Great TWIST interviews: Will Guidara, Eoghan McCabe, Steve Huffman, Brian Chesky, Bob Moesta, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarlandCheck out Jason's suite of newsletters: https://substack.com/@calacanisFollow TWiST:Twitter: https://twitter.com/TWiStartupsYouTube: https://www.youtube.com/thisweekinInstagram: https://www.instagram.com/thisweekinstartupsTikTok: https://www.tiktok.com/@thisweekinstartupsSubstack: https://twistartups.substack.com