The New Stack Podcast

Share on

The New Stack Podcast is all about the developers, software engineers and operations people who build at-scale architectures that change the way we develop and deploy software. Subscribe to TNS on YouTube at: https://www.youtube.com/c/TheNewStack

The New Stack

Jun 18, 2026 LATEST EPISODE
weekdays NEW EPISODES
29m AVG DURATION
668 EPISODES

Search for episodes from The New Stack Podcast with a specific topic:

Latest episodes from The New Stack Podcast

Gusto Cofounder: An AI agent that runs payroll, HR, and benefits without waiting to be asked

Play Episode Listen Later Jun 18, 2026 28:58

Gusto is betting that small businesses need more than another AI assistant. The company's new product, Gusto Cofounder, is designed to act as a proactive business partner that helps owners manage and grow their companies, drawing inspiration from the traditional mom-and-pop partnership that co-founder and CTOEddie Kimwitnessed growing up. Unlike reactive chatbots, Cofounder can take action across payroll, HR, benefits, scheduling, insurance, and accounting workflows by leveraging data already stored within Gusto. Users interact with the platform through text messages or Slack, while a consent framework ensures access to sensitive payroll and employee data remains tightly controlled. Businesses can grant explicit permissions and gradually increase autonomy as trust is established. The platform also integrates with third-party tools such as Google Workspace, enabling it to gather data, perform calculations, run payroll, and communicate results automatically. Kim said the product was built by a five-person team in just eight weeks using Claude Code, which he believes demonstrates how AI is expanding software creation beyond traditional engineering roles. Looking ahead, Gusto plans to add more integrations and eventually enable customers and developers to share reusable, industry-specific business automations. Learn more from The New Stack around how AI is expanding software creation beyond traditional engineering roles: How AI Is Reshaping Software Engineering: Key Takeaways From DeveloperWeek 2025 AI and the Future of Code: Developers Are Key The Engineer in the AI Age: The Orchestrator and Architect Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

WeAreDevelopers is coming to the US to give unsung developers a bigger voice

Play Episode Listen Later Jun 11, 2026 50:10

WeAreDevelopers, the Berlin-based developer conference founded in 2015, has grown into a major global event, attracting 15,000 developers from over 70 countries each year. In 2026, it expands beyond Europe with new editions in San Jose, California, and Bengaluru, India. Co-founder and CEO Sead Ahmetovic says the conference was created to give developers a stronger voice in an industry where marketers, salespeople, and entrepreneurs often receive more recognition. He believes developers, despite being less vocal, build the products that power the modern world. The event began as a small meetup that quickly gained popularity, filling a gap between highly specialized technical gatherings and broader business-focused conferences. Former GitHub CEO Thomas Dohmke highlights another benefit: giving developers a platform to share the stories behind their work and inspire peers. Discussing the future of software development, Dohmke predicts AI agents will handle much of the coding, while developers focus on managing ideas, prompts, and workflows. Ahmetovic agrees, arguing that developers will remain essential, spending less time typing code and more time thinking, orchestrating, and creating new solutions. Learn more from The New Stack around the latest in developer community growth: How Community Helps Developers Grow Empowering Developers Is Critical to Drive AI Innovation 3 Ways Organizations Can Redefine the Developer Experience Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

california ai europe voice tech berlin conference developers entire san jose software engineers unsung software developers bengaluru tech podcast tech conferences ai engineer new stack

Why MotherDuck refuses to fork DuckDB

Play Episode Listen Later May 27, 2026 27:43

At a recent MCP developer summit, The New Stack spoke with Till Döhmen, AI lead atMotherDuck, about the company's growing role in the evolving DuckDB ecosystem. Backed by investors includingTomasz Tunguz, MotherDuck is commercializing the open-source analytical databaseDuckDBwhile also expanding how employees interact with data through AI agents rather than traditional dashboards. Döhmen emphasized the company's close collaboration withDuckDB FoundationandDuckDB Labs. Because MotherDuck operates what he described as the world's largest fleet of DuckDB databases, the startup regularly pushes the database to its limits and feeds insights back to the core maintainers. Rather than forking DuckDB to create proprietary advantages, MotherDuck instead extends the platform through its existing architecture while contributing core improvements upstream when needed. The conversation highlighted the delicate but productive relationship between venture-backed companies and the open-source projects they commercialize, positioning MotherDuck as another example of startups driving both OSS adoption and strong business growth simultaneously. Learn more from The New Stack around the latest in DuckDB: DuckDB: Query Processing Is King DuckDB: In-Process Python Analytics for Not-Quite-Big Data Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech analytics backed open source fork refuses software engineers oss software developers mcp tech podcast new stack duckdb new stack makers

JetBrains is selling independence as the rest of AI coding picks sides

Play Episode Listen Later May 21, 2026 26:04

JetBrains is positioning itself as the last major independent AI coding-tool vendor in a market increasingly tied to hyperscalers and foundation model labs. Speaking at Google Cloud Next, JetBrains VP of business developmentMikhail Vink argued that competitors such as Microsoft Copilot, Anysphere Cursor, and Windsurfare all tied to either AI labs or cloud providers. By contrast, JetBrains says its independence allows customers to switch freely between models fromOpenAI,Anthropic, andGoogle Cloudwithout being locked into one ecosystem. That flexibility underpins JetBrains' broader AI strategy. Rather than building its own foundation model, the company is focusing on orchestration and governance through JetBrains Central, announced in March as a management layer for AI agents, usage controls, analytics, and consumption-based billing. Vink said the company's profitability, 16 million users, and 300,000 commercial customers from its long-running IDE business have allowed it to remain venture-free and model-neutral. JetBrains argues that as developers increasingly swap between AI models, neutrality may become more valuable than owning the models themselves. Learn more from The New Stack around the latest in AI coding-tools: JetBrains ‘Agentic' AI Agent Helps Automate Coding Tasks JetBrains: AI agents are about to repeat the cloud ROI crisis JetBrains names the debt AI agents leave behind Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

Why Block handed Goose to the Linux Foundation

Play Episode Listen Later May 15, 2026 19:30

What began as an internal developer tool atBlockhas evolved into a broader open-source initiative with industry backing. Goose, Block's AI coding agent, followed a path similar to Amazon's transformation of internal infrastructure intoAmazon Web Services. After deploying Goose companywide, Block open-sourced the tool under a permissive license, leading to rapid adoption across the developer community. But according to Manik Surtani, Office of the CTO, Block and Co Founder of Agentic AI Foundation, early momentum exposed governance challenges. Although Goose was technically open source, Block retained trademark ownership, creating concerns for enterprises seeking truly independent governance. To address this, the team partnered with the creators ofAnthropicand the Model Context Protocol community to establish theAgentic AI Foundationunder the umbrella of theLinux Foundation. Goose, MCP, and Agents.MD became the foundation's initial projects, chosen largely to accelerate the launch of the new organization and create a collaborative ecosystem around agentic AI development. Learn more from The New Stack around the latest in open-source AI: Anthropic extends MCP with a UI framework Why the Linux Foundation adopted MCP, with Jim Zemlin and Mazin Gilbert Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

amazon ai co founders office md cto goose open source handed software engineers software developers mcp linux foundation ai engineer new stack alex wilhelm new stack makers

Fivetran's CPO: closed data stacks won't survive the agent era

Play Episode Listen Later May 13, 2026 22:55

At Google Cloud Next 2026, Fivetran Chief Product Officer Anjan Kundavaram argued that enterprise data systems are unprepared for the scale of AI-driven analytics. Unlike humans, AI agents can generate exponentially more queries, often routing them through the same expensive compute infrastructure. Kundavaram compared it to “using a Lamborghini to mow the lawn.” To address this, Fivetran introduced its “Open Data Infrastructure” vision and a benchmark designed to expose hidden AI workload costs in closed ecosystems. Kundavaram said agents can optimize for cost instead of speed, choosing cheaper compute engines when appropriate — but only in open architectures with multiple options. Closed systems force every query through high-cost paths. He also warned that fragmented data and weak context create a “triple whammy” of poor AI responses, soaring analytics bills, and wasted compute. While many organizations respond by tightening controls, Kundavaram argued the better path is investing in open infrastructure, interoperability, and strong semantic data practices before AI costs spiral further. Learn more from The New Stack around the latest in enterprise data systems: Enterprise AI Success Demands Real-Time Data Platforms AI Agents Are Morphing Into the 'Enterprise Operating System' Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech data survive agent closed lamborghini stacks software engineers tech podcast google cloud next ai engineer matt burns new stack new stack makers

The new FinOps problem isn't cloud bills

Play Episode Listen Later May 12, 2026 28:03

At Google Cloud Next 2026, Finout co-founder and CEO Roi Ravhon and Google Cloud FinOps lead Pathik Sharma discussed how FinOps is rapidly evolving for the AI era. Ravhon argued that while cloud FinOps had a decade to mature, AI economics are forcing the industry to adapt within a year. Unlike traditional cloud workloads, AI costs are unpredictable because token usage varies even for identical prompts, while advanced reasoning models consume significantly more tokens despite falling prices. Both emphasized that effective AI FinOps requires intelligent orchestration, routing workloads to the cheapest capable models instead of defaulting to expensive frontier models. Sharma noted that AI costs extend beyond APIs to GPUs, storage, training, and organizational adoption. They also cautioned against relying solely on LLMs for operational automation. Deterministic systems, observability metrics, and human approvals remain essential guardrails. Ultimately, both stressed that FinOps is primarily an organizational and cultural discipline, recommending newcomers start with the FinOps Foundation before investing in tools. Learn more from The New Stack around the latest in FinOps: Why FinOps Isn't About Saving Money FinOps Foundation's FOCUS 1.2 Expands to SaaS, PaaS Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech focus cloud bills automation saas apis sharma expands software engineers gpus software developers tech podcast finops deterministic google cloud next ai engineer new stack finops foundation

How Microsoft is governing thousands of Kubernetes clusters without manual intervention

Play Episode Listen Later May 7, 2026 25:28

Managing Kubernetes at fleet scale introduces significant complexity, especially as organizations expand from a few clusters to hundreds or thousands across cloud, on-premises, and edge environments. While GitOps remains the dominant model for declarative management, its traditional one-to-one repository-to-cluster approach struggles to handle multi-cluster realities such as global traffic routing, shared secrets, and unified observability. AsStephane Erbrech, Principal Software Engineer at Microsoftexplains, the challenge shifts from deployment to governance—maintaining consistency, security, and compliance across a vast distributed system without manual intervention. This need is amplified by the rise of AI workloads at the edge, where inference is increasingly decentralized. To address these challenges,Microsoft Azure Kubernetes Fleet Managerenables coordinated, staged rollouts across clusters, allowing teams to validate updates in lower-risk environments before production. Supporting this,Cilium Cluster Meshprovides seamless cross-cluster connectivity, enabling workload mobility and efficient resource use, especially for scarce GPU capacity. Together, these tools help modern platform teams manage lifecycle, networking, and orchestration at scale. Learn more from The New Stack around managing Kubernetes at fleet scale: KubeFleet: The Future of Multicluster Kubernetes App Management Why Microsoft is betting on temporary identities to stop autonomous agents from going rogue Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech microsoft thousands manual intervention deployment gpu software engineers governing kubernetes software developers clusters tech podcast principal software engineer ai engineer new stack new stack makers

Why long-running AI agents break on HTTP and how Ably is fixing it

Play Episode Listen Later May 6, 2026 31:31

In this episode ofThe New Stack Makers, Matthew O'Riordan, CEO of Ably, explains how infrastructure originally built for human collaboration is now well-suited for long-running AI agents. While Ably initially resisted positioning itself as an AI company, the rise of agents that reason, call tools, and operate over extended periods revealed a natural fit for its real-time communication platform. O'Riordan highlights the limitations of HTTP for these use cases. While effective for short, request-response interactions, HTTP struggles with persistent, stateful experiences—such as handling dropped connections, multi-device usage, or mid-task interruptions. To address this, a new “durable session” layer is emerging, enabling continuous synchronization between agents and users through shared state, presence, and recovery mechanisms. Ably's solution, AI Transport, augments existing architectures by keeping HTTP for requests while shifting responses to durable sessions. Features like mutable message streams and “live objects” allow seamless reconnection and collaboration. The goal is to provide a drop-in layer that developers can adopt without rethinking their stack—moving beyond traditional pub/sub models. Learn more from The New Stack around Ably and AI Transport: How MCP Uses Streamable HTTP for Real-Time AI Tool Interaction Ably Touts Real-Time Starter Kits for Vercel and Netlify AI Agents Need Help. Here's 4 Ways To Ship Software Reliably Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ceo ai running tech fixing software engineers software developers tech podcast ai engineer new stack ably new stack makers

Why the Linux Foundation adopted MCP, with Jim Zemlin and Mazin Gilbert

Play Episode Listen Later May 6, 2026 32:32

Agentic AI is advancing rapidly, with open-source projects racing to keep pace with real-world deployment. To accelerate progress, the Linux Foundation consolidated key technologies—Model Context Protocol (MCP), Goose, and AGENTS.md—under the newly formed Agentic AI Foundation (AAIF) in late 2025. At the MCP Dev Summit in New York City, Linux Foundation CEO Jim Zemlin and newly appointed AAIF executive director Mazin Gilbert discussed this transition. Zemlin explained that leading both organizations was unsustainable, prompting a careful search for a leader with both technical expertise and collaborative leadership skills. Gilbert now takes on the challenge of guiding AAIF as it shapes the emerging agentic AI ecosystem. While the foundation currently oversees three projects, its broader mission involves defining the future architecture of agent-driven systems—deciding what to build, when, and why. These decisions will influence the trajectory of open-source AI development. The conversation also highlights the importance of open collaboration, funding dynamics, and early adopters in shaping the agentic stack's evolution. Learn more from The New Stack around the latest in open-source projects and The Linux Foundation: Anthropic Donates the MCP Protocol to the Agentic AI Foundation SAFE-MCP, a Community-Built Framework for AI Agent Security Google Donates the Agent2Agent Protocol to the Linux Foundation Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai new york city leadership tech goose adopted software engineers software developers tech podcast linux foundation cncf mazin ai engineer cloud native computing foundation new stack new stack makers

Fresh data has us asking, does AI demand Kubernetes?

Play Episode Listen Later May 1, 2026 23:01

Kubernetes is rapidly emerging as the de facto operating system for AI, with two-thirds of organizations using it for generative AI inference and 82% adopting it in production. Its ecosystem — including tools like Kubeflow — enables organizations to build, scale, and retain control of AI systems through open, community-driven infrastructure. Bob Killen of CNCF and Liam Bollmann-Dodd of SlashData shared insights from recent reports showing that AI success still hinges on strong engineering fundamentals—especially internal developer platforms and overall developer experience. While AI-generated code accelerates development, it shifts bottlenecks to DevOps, reliability, and security, increasing operational complexity. As a result, operator experience and well-defined guardrails have become critical to safely scaling AI. These controls help constrain both human and AI developers, reducing risk while enabling speed. At the same time, organizations are evolving team structures, expanding platform engineering groups to support internal users more effectively. Despite growing complexity, the core lesson remains consistent: open source innovation thrives on people, processes, and collaboration as much as on technology itself. Learn more from The New Stack around the latest in Kubernetes and its emergence as an operating system for AI: Kubernetes and AI: Are They a Fit? How AI Is Pushing Kubernetes Storage Beyond Its Limits Kubernetes and AI Are Shaping the Next Generation of Platforms Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai research tech data fresh next generation open source devops software engineers containers kubernetes software developers tech podcast microservices cloud native cncf ai engineer new stack new stack makers

How SUSE positions itself as the infrastructure layer for the AI era

Play Episode Listen Later Apr 30, 2026 26:53

In this episode ofThe New Stack Makers,Pete Smailsoutlines howSUSEis evolving from its Linux roots into an AI-native infrastructure platform. Speaking atKubeCon + CloudNativeCon Europe 2026, Smails explains the company's strategy to unify AI, containers and virtual machines on a single open, enterprise-ready foundation. Central to this isSUSE Rancher Prime, which enables consistent orchestration across hybrid and multi-cloud environments, alongsideSUSE Virtualizationfor modernizing legacy systems. A key innovation is “Liz,” a context-aware AI agent embedded in Rancher Prime that helps engineers identify vulnerabilities, troubleshoot deployments and interact with infrastructure using natural language. Unlike generic AI tools, Liz understands real-time cluster states and uses Model Context Protocol to deliver actionable insights. Smails emphasizes developer experience as critical to adoption, highlighting Rancher Developer Access for simplified local Kubernetes workflows. Overall, SUSE aims to deliver secure, automated infrastructure that reduces complexity while accelerating cloud-native and AI adoption. Learn more from The New Stack around the latest around SUSE: SUSE Displays Enhanced Enterprise Linux at SECESSION SUSE Launches a Sovereign Premium Support Service for EU Customers Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai infrastructure open source positions linux layer operating systems kubernetes software developers suse ai engineer open platform new stack smails new stack makers

Cut AI token usage by 96%? Here's how AWS Strands Agents does it.

Play Episode Listen Later Apr 29, 2026 28:06

In this episode of The New Stack Makers, AWS developer advocate Morgan Willis demonstrates Strands Agents, an open source agentic framework with rapid adoption since its launch. Using a simple accounting API, she walks through three approaches to retrieving a customer's latest invoice, highlighting how design choices dramatically impact efficiency. The initial method maps each API endpoint to a separate tool, requiring five chained calls and consuming about 52,000 tokens. By shifting to intent-based tools—focused on outcomes rather than individual data operations—the same task is completed in a single call using just 2,000 tokens, improving both efficiency and reasoning. In a third iteration, tools are hosted on a remote MCP server via AWS Agent Core Gateway, with semantic search limiting the agent's toolset to only what's relevant per query, further reducing token usage. Willis emphasizes that narrowly scoped agents outperform general-purpose ones, delivering better speed, accuracy, and context efficiency. Designing smaller, specialized agents with tailored tools is key as tool ecosystems expand. Learn more from The New Stack around the latest with Strands and MCP: AWS Launches Its Take on an Open Source AI Agents SDK What Is MCP? Game Changer or Just More Hype? MCP's biggest growing pains for production use will soon be solved Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

tech designing game changers stands api willis open source aws token usage software engineers software developers mcp tech podcast strands ai engineer new stack morgan willis new stack makers

Why Broadcom is betting on a private cloud comeback

Play Episode Listen Later Apr 28, 2026 23:40

Broadcom's VMware Cloud Foundation (VCF) is evolving from a turnkey infrastructure stack into a modern application platform, balancing simplicity with the flexibility demanded by Kubernetes-driven environments. AtKubeCon + CloudNativeCon Europe 2026, Broadcom leaders highlighted how VCF is adapting to support platform engineering teams, cloud-native workloads, and large-scale operations. A key industry shift is the return to private cloud, driven by data sovereignty concerns and the growing impact of AI. Enterprises are bringing workloads back on-premises while still expecting a cloud-like operating model. Broadcom is responding by prioritizing on-prem stability and aligning closely with open source, reflecting its strong contributions toKubernetesand related projects. Kubernetes is no longer a bolt-on but the core control plane of VCF, enabling unified management of compute, storage, and networking through declarative APIs. At the same time, the distinction between virtual machines and containers is fading. The focus is shifting toward application-centric platforms, where developers interact through consistent abstractions, allowing infrastructure to be provisioned seamlessly behind the scenes. Learn more from The New Stack around the latest around Broadcom: Broadcom ‘Doubles Down' on Open Source, Donates Kubernetes Tool to CNCF Why Broadcom gave Velero to the CNCF Sandbox — and what it means for Kubernetes data protection Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech comeback betting open source apis enterprises software engineers kubernetes software developers broadcom tech podcast platform engineering private cloud ai engineer vcf new stack vmware cloud foundation new stack makers

Why Broadcom gave Velero to the CNCF Sandbox — and what it means for Kubernetes data protection

Play Episode Listen Later Apr 25, 2026 22:59

Broadcom continues to expand its role as a major contributor to cloud-native open source, particularly within the Cloud Native Computing Foundation (CNCF) ecosystem. Its recent donation of Velero—originally developed by VMware—to the CNCF Sandbox reflects a strategic move to foster broader community trust and collaboration. By shifting governance away from vendor control, Broadcom aims to position Velero as a truly community-driven data protection standard for Kubernetes environments, encouraging wider adoption and contribution. At the same time, the company is reinforcing its position as a full-stack Kubernetes provider across both cloud-native and private cloud environments. Despite Kubernetes' dominance, many organizations still struggle with its complexity. Broadcom is addressing this by focusing on lifecycle management, long-term support, and deep integration with existing infrastructure like vSphere. In a podcast recorded at KubeCon + CloudNativeCon Europe 2026, Dilpreet Bindra emphasized that open source success comes not just from code contributions, but also from relinquishing control to empower the broader ecosystem and drive sustainable innovation. Learn more from The New Stack about the latest developments around Velero: Broadcom donates Velero to CNCF — and it could reshape how Kubernetes users handle backup and disaster recovery How AI Search Is Supporting Artistic Freedom Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

tech open source sandbox vmware kubernetes data protection broadcom tech podcast cncf vsphere new stack new stack makers

Why AI engineering needs old-school discipline

Play Episode Listen Later Apr 24, 2026 24:26

In this episode of The New Stack Makers, Nimisha Asthagiri of Thoughtworks explores why many AI initiatives stall between proof of concept and production. A key issue is that organizations focus on speed—asking how to move faster—rather than rethinking what new capabilities AI actually enables. Successful companies take a systems-thinking approach, investing in organizational literacy and aligning teams around meaningful use cases instead of retrofitting AI into existing workflows. Asthagiri highlights that core engineering practices are ফিরে to prominence. As AI-generated code increases, so does the risk of “cognitive debt,” where developers lose understanding of their own systems. To counter this, teams are reviving fundamentals like test-driven development, mutation testing, observability, and zero-trust security, especially as autonomous agents contribute to production code. She also introduces the concept of “dark code”—AI-generated code that may never be used—and argues for more intentional lifecycle management, including ephemeral code. Ultimately, the focus shifts from code itself to specifications, context management, and disciplined engineering practices. Learn more from The New Stack around the latest about system-thinking approaches: System Two AI: The Dawn of Reasoning Agents in Business A practical systems engineering guide: Architecting AI-ready infrastructure for the agentic era Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech engineering old school software engineers software developers tech podcast thoughtworks school discipline ai engineer new stack new stack makers

Jim Bugwadia on why finding a Kubernetes problem is only half the battle for Kyverno users

Play Episode Listen Later Apr 23, 2026 23:06

Graduating within the CNCF marks a major milestone for an open source project, signaling not just technical maturity but strong governance, security practices, and widespread adoption. Kyverno, a Kubernetes policy engine, reached this stage after five years — becoming only the 35th project to progress from sandbox to graduation. As co-founder Jim Bugwadia explains, incubation reflects production readiness and adoption, while graduation validates the project's long-term sustainability and governance rigor. Originally built to help teams manage Kubernetes complexity through declarative policies, Kyverno has evolved alongside the ecosystem. Its shift to the Kubernetes-native Common Expression Language (CEL) and rising demand driven by AI workloads have expanded its user base beyond regulated industries to mainstream enterprises. With over three billion downloads, it underscores the growing need for automated policy enforcement across development, security, and operations teams. Commercially, Nirmata maintains a clear boundary between open source and enterprise offerings, focusing on remediation and advanced management. While only 2–5% of users convert, that small percentage becomes meaningful at Kyverno's scale. Learn more from The New Stack around the latest about Kyverno: Simplify Kubernetes Security With Kyverno and OPA Gatekeeper Using the Kyverno CLI to Write Policy Test Cases Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech users open source graduating software engineers guardrails kubernetes software developers tech podcast commercially half the battle cncf ai engineer policy management new stack new stack makers

How AWS Bedrock is shaping Model Context Protocol

Play Episode Listen Later Apr 22, 2026 31:15

At the MCP Summit in New York City, AWS's Luca Chang, a Bedrock team member and MCP specification maintainer, discussed the rapid rise of the Model Context Protocol (MCP) as a standard for connecting AI models and agents to tools and data. He explained that MCP's development is shaped by a diverse group of maintainers who collaboratively prioritize features, balancing major challenges with smaller enhancements that can unlock creative new capabilities. This breadth of perspectives prevents groupthink but makes prioritization difficult, as many ideas compete for limited bandwidth. Chang highlighted the role of large organizations like Amazon in advancing open source projects. AWS contributions such as Tasks and Elicitations emerged from internal efforts to map cloud services to MCP, revealing gaps in the protocol. Rather than contributing for speed, AWS focuses on real customer use cases, contributing only when clear needs arise. Chang also noted growing demand for MCP servers, while expressing caution about overly specialized, agent-specific implementations that could limit broader interoperability. Learn more from The New Stack around the latest in Model Context Protocol (MCP) becoming a standard for connecting AI models and agents to tools and data: Model Context Protocol: A Primer for the Developers Beyond the vibe code: The steep mountain MCP must climb to reach production https://thenewstack.io/model-context-protocol-evolution/ Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

amazon ai new york city tech model context tasks shaping protocol chang open source aws bedrock software developers mcp tech podcast ai engineer new stack alex wilhelm new stack makers

Why Microsoft is betting on temporary identities to stop autonomous agents from going rogue

Play Episode Listen Later Apr 21, 2026 24:37

AtKubeCon Europe 2026,Jorge Palmaoutlined how Microsoft is advancing AI operations across cloud and edge environments. He demonstrated an agent capable of diagnosing, mitigating, and explaining application issues in minutes, highlighting the growing role of agentic operations in Kubernetes. Palma emphasized that recent progress in tools likeAzure Kubernetes ServiceandAzure Archas made edge AI more practical by bridging cloud and on-prem systems. Kubernetes now acts as the unifying layer, while fleet management automates deployments that previously required manual GitOps workflows. To address fragmentation in inference engines, Microsoft introducedAI Runway, a standardized API that allows teams to swap underlying engines without changing workflows. Security remains a core challenge. Palma advocates for tightly scoped, temporary permissions and policy validation for agents, enforced through tools like the Agent Governance Toolkit. This reflects a broader shift: applying cloud-native principles—portability, abstraction, and policy control—to manage the unpredictable nature of AI workloads. Learn more from The New Stack about the latest around advancing AI operations across cloud and edge environments The Future of AI: Hybrid Edge Deployments Are Indispensable AI Is Coming to the Edge, but It Will Look Different Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai microsoft security betting api temporary palma autonomous identities kubernetes going rogue gitops new stack

As agentic AI explodes, Amazon doubles down on MCP

Play Episode Listen Later Apr 16, 2026 24:20

At the MCP Summit inNew York City,Clare LiguoriofAmazon Web Servicesdiscussed the rapid rise of theModel Context Protocol(MCP), now a leading way to connect AI agents with tools and data. Originally developed byAnthropicand later transferred to theLinux Foundation, MCP has seen surging enterprise adoption as agentic AI expands. Liguori highlighted her dual role shaping MCP's evolving specification, including work on integrating webhooks, events, and notifications to support always-on AI agents. AWS has actively contributed features like Tasks and Elicitations and offers managed MCP servers, positioning itself as both contributor and experimental platform for emerging capabilities. This collaboration illustrates how corporate involvement can accelerate open-source innovation and adoption. Looking ahead, MCP's role as connective infrastructure for AI agents is expected to grow, especially as tools become more accessible. With broader adoption of AI development platforms across non-engineering roles, MCP could help extend automation beyond tech teams to businesses of all sizes. Learn more from The New Stack about the latest around Model Context Protocol(MCP): MCP: The Missing Link Between AI Agents and APIs Beyond the vibe code: The steep mountain MCP must climb to reach production MCP is everywhere, but don't panic. Here's why your existing APIs still matter. Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

amazon ai tech tasks aws apis doubles software engineers explodes agentic software developers mcp tech podcast liguori ai engineer new stack alex wilhelm new stack makers

A year in, Google wants its Axion processors to feel like a scheduling decision

Play Episode Listen Later Apr 15, 2026 22:18

At KubeCon Europe, Google Cloud's Jago Macleod and Abdel Sghiouar argued that adopting Arm for Kubernetes workloads has shifted from a complex migration to a practical, low-friction choice. After a year of production use, Google's custom Arm-based Axion processors—powering C4A and N4A instances—are positioned as broadly viable for most containerized applications, offering strong gains in performance, cost efficiency, and energy usage compared to x86. Rather than requiring a full overhaul, moving to Arm typically involves recompiling containers for a multi-architecture target and gradually rolling out via Kubernetes practices like canary deployments. While edge cases exist, they are relatively uncommon. A key enabler is GKE's compute classes, which allow workloads to express preferences across VM types, turning infrastructure decisions into automated scheduling choices rather than manual provisioning. Ultimately, the conversation points to a larger constraint: energy. As AI workloads grow, efficiency—measured in “tokens per watt”—is emerging as the defining metric, with cost savings translating directly into greater compute capacity. Learn more from The New Stack about the latest developments around Google's work with Axion: Arm: See a Demo About Migrating a x86-Based App to ARM64 Do All Your AI Workloads Actually Require Expensive GPUs? Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai google tech decision arm scheduling open source vm software engineers google cloud kubernetes gpus tech podcast processors axion ai engineer google wants new stack gke

Can you make Kubernetes invisible? Here's why AWS is on a mission to do it.

Play Episode Listen Later Apr 14, 2026 23:14

In this episode ofThe New Stack Makers, Jesse Butler, principal product manager for AWS Elastic Kubernetes Service, shares his vision for simplifying cloud-native computing. Since joining AWS in 2020, Butler has focused on making Kubernetes easier to use, emphasizing open-source as a democratizing force. He highlights the role of the Cloud Native Computing Foundation (CNCF) in standardizing and governing open ecosystems while balancing community-driven innovation with commercial contributions. Butler describes Kubernetes as widely adopted—used in production by around 80% of enterprises—yet still overly complex. His goal is to make it “invisible,” much like Linux, by abstracting and consolidating services. He points to projects like Karpenter, which enables real-time node provisioning for efficient scaling; Kro, which simplifies resource orchestration; and Cedar, a flexible policy engine for fine-grained authorization. He underscores the importance of open-source contributors, noting their critical yet often underappreciated role. Looking ahead, Butler envisions a future where automation and human collaboration further enhance usability and innovation in open-source software. Learn more from The New Stack about the latest around AWS Elastic Kubernetes Service 2026 Will Be the Year of Agentic Workloads in Production on Amazon EKS Amazon EKS Auto Mode wants to end Kubernetes toil — one node at a time Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

amazon mission tech production invisible butler open source aws linux software engineers cedar kubernetes software developers tech podcast kro kubecon ai engineer new stack jesse butler

The next stages of AI conformance in the cloud-native, open-source world

Play Episode Listen Later Apr 9, 2026 25:00

Running AI models on Kubernetes has historically been inconsistent, with workloads behaving differently across cloud providers due to variations in GPUs, networking, and autoscaling. As organizations move AI from experimentation to production, standardization has become critical. In this episode of The New Stack Makers, Jonathan Bryce, Executive Director of The Cloud Native Computing Foundation shared that the Foundation's Kubernetes AI conformance program aims to solve this by ensuring portability, predictability, and production readiness for AI workloads across environments. The initiative reflects a broader industry shift: AI is moving from training-heavy workloads to inference at scale, with inference expected to dominate compute usage by the end of the decade. Unlike batch-based training, inference requires real-time, always-on performance, making Kubernetes an attractive platform due to its elasticity, GPU-aware autoscaling, and observability. The conformance program establishes baseline standards for handling accelerators like GPUs and TPUs, reducing vendor lock-in and simplifying deployment. Early adopters include major cloud providers and ecosystem players, while new projects like llm-d aim to bridge orchestration and inference. As requirements evolve, ongoing collaboration and recertification will ensure the standards stay aligned with real-world needs. Learn more from The New Stack about the latest developments around The Cloud Native Computing Foundation's Kubernetes AI conformance program: CNCF: Kubernetes is ‘foundational' infrastructure for AI Kubernetes Gets an AI Conformance Program — and VMware Is Already On Board Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech executive director foundation stages open source gpu software engineers kubernetes gpus software developers tech podcast cloud native cncf ai engineer cloud native computing foundation new stack new stack makers

Microsoft wants to make service mesh invisible

Play Episode Listen Later Apr 8, 2026 21:21

At KubeCon EU 2026, Mitch Connors of Microsoft outlined a vision to make service meshes effectively invisible to users. Now working on Azure Kubernetes Application Network, a fully managed service built on Istio's ambient mode, Connors aims to deliver core capabilities like mTLS without requiring users to engage with the complexity traditionally associated with service meshes. Ambient mode eliminates sidecar upgrade challenges by shifting functionality to node-level and waypoint proxies, though adoption still faces hurdles, including lagging CVE patching. Connors emphasized that AI workloads are reshaping network demands, as request variability in large language models requires smarter routing and resource management. Istio is addressing this through a two-speed model: stable APIs for reliability and experimental integrations like Agent Gateway for emerging AI protocols. Features such as inference-aware routing and policy enforcement for approved LLM endpoints highlight the mesh's growing role in AI governance. With multi-cluster support and GPU scarcity driving workload mobility, Microsoft's approach bets that simplifying and abstracting the mesh will broaden adoption while meeting the evolving needs of AI-driven systems. Learn more from The New Stack about service meshes: The Hidden Costs of Service Meshes All the Things a Service Mesh Can Do Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

Amazon EKS Auto Mode wants to end Kubernetes toil — one node at a time

Play Episode Listen Later Apr 7, 2026 22:31

At KubeCon + CloudNativeCon Europe 2026 in Amsterdam, Alex Kestner, principal product manager for Amazon Elastic Kubernetes Service (EKS), discussed how Amazon EKS Auto Mode aims to reduce the operational burden of running Kubernetes at scale. While Kubernetes delivers significant power, it also introduces complexity—particularly through repetitive, day-to-day tasks like managing node lifecycles, ensuring security updates, and selecting optimal infrastructure. Kestner emphasized that much of this “undifferentiated heavy lifting” distracts platform teams from delivering business value. Amazon EKS Auto Mode addresses this by automating infrastructure operations across the full node lifecycle, shifting responsibility for key operational components outside the cluster and into AWS-managed services. Built in collaboration with the EC2 team and leveraging technologies like Karpenter, Auto Mode dynamically provisions right-sized compute resources based on workload requirements. While it doesn't eliminate all challenges—such as unpredictable workloads or diverse deployment needs—it provides a more application-focused approach to scaling and cost optimization. Ultimately, Auto Mode represents a meaningful step toward simplifying Kubernetes operations in increasingly complex cloud-native environments. Learn more from The New Stack about the latest developments around the latest with Amazon Elastic Kubernetes Service (EKS): 2026 Will Be the Year of Agentic Workloads in Production on Amazon EKS How Amazon EKS Auto Mode Simplifies Kubernetes Cluster Management (Part 1) A Deep Dive Into Amazon EKS Auto (Part 2) Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

Edge-forward: Akamai eyes sweet spot between centralized & decentralized AI inference

Play Episode Listen Later Apr 1, 2026 22:02

At KubeCon + CloudNativeCon Europe 2026, Lena Hall and Thorsten Hans of Akamai outlined how the company is evolving from a CDN provider into a developer-focused cloud platform for AI. Akamai's strategy centers on low-latency, distributed computing, combining managed Kubernetes, serverless functions, and a distributed AI inference platform to support modern workloads. With a global footprint of core and “distributed reach” datacenters, Akamai aims to bring compute closer to users while still leveraging centralized infrastructure for heavier processing. This hybrid model enables faster feedback loops critical for applications like fraud detection, robotics, and conversational AI. To address concerns about complexity, Akamai emphasizes managed infrastructure and self-service tools that abstract away integration challenges. Its platform supports open source through managed Kubernetes and pre-packaged tools, simplifying deployment. Akamai also invests in serverless technologies like WebAssembly-based functions, enabling developers to build and deploy globally distributed applications quickly. Overall, the company prioritizes developer experience, allowing teams to focus on application logic rather than infrastructure management. Learn more from The New Stack about the latest developments around how Akamai is transforming to a developer-focused cloud platform for AI. Akamai Picks Up Hosting for Kernel.org Should You Care About Fermyon Wasm Functions on Akamai? Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai forward open source sweet spot decentralized kubernetes cdn kernel inference akamai webassembly kubecon lena hall new stack new stack makers

Kubernetes co-founder Brendan Burns: AI-generated code will become as invisible as assembly

Play Episode Listen Later Mar 24, 2026 43:42

In this episode of The New Stack Makers, Microsoft Corporate Vice President and Technical Fellow, Brendan Burns discusses how AI is reshaping Kubernetes and modern infrastructure. Originally designed for stateless applications, Kubernetes is evolving to support AI workloads that require complex GPU scheduling, co-location, and failure sensitivity. Features like Dynamic Resource Allocation and projects such as KAITO introduce AI-specific capabilities, while maintaining Kubernetes' core strength: vendor-neutral extensibility. Burns highlights that AI also changes how systems are monitored. Success is no longer binary; it depends on answer quality, user feedback, and large-scale testing using thousands of prompts and even AI evaluators. On software development, Burns argues that the industry's focus on reviewing AI-generated code is temporary. Just as developers stopped inspecting compiler output, AI-generated code will become a disposable artifact validated by tests and specifications. This shift will redefine engineering roles and may lead to programming languages designed for machines rather than humans, signaling a fundamental transformation in how software is built and maintained. Learn more from The New Stack about the latest developments around how AI is reshaping Kubernetes and modern infrastructure: How To Use AI To Design Intelligent, Adaptable Infrastructure The AI Infrastructure crisis: When ambition meets ancient systems Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

AI can write your infrastructure code. There's a reason most teams won't let it.

Play Episode Listen Later Mar 20, 2026 29:21

In this episode ofThe New Stack Agents, Marcin Wyszynski, co-founder of Spacelift and OpenTofu, explains how AI is transforming infrastructure as code (IaC). Originally built for individual operators, tools like Terraform struggled to scale across teams, prompting Wyszynski to help launch OpenTofu after HashiCorp's 2023 license change. Now, the bigger shift is AI: engineers no longer write configuration languages like HCL manually, as AI tools generate it, dramatically lowering the barrier to entry. However, this creates a dangerous gap between generating infrastructure and truly understanding it—like using a phrasebook to ask questions in a foreign language but not understanding the response. In infrastructure, that lack of comprehension can lead to serious risks. To address this, Spacelift introduced Intent, which allows AI to directly interact with cloud systems in real time while enforcing deterministic guardrails through policy controls. The broader challenge remains balancing speed with control—enabling faster experimentation without sacrificing safety. Wyszynski argues that, like humans, AI can be trusted when constrained by strong guardrails. Learn more from The New Stack about the latest developments around how AI is transforming infrastructure as code (IaC). The Maturing State of Infrastructure as Code in 2025 Generative AI Tools for Infrastructure as Code Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech write code infrastructure intent software developers tech podcast iac terraform hcl hashicorp ai engineer new stack opentofu

OutSystems CEO on how enterprises can successfully adopt vibe coding

Play Episode Listen Later Mar 6, 2026 43:53

Woodson Martin, CEO ofOutSystems, argues that successful enterprise AI deployments rarely rely on standalone agents. Instead, production systems combine AI agents with data, workflows, APIs, applications, and human oversight. While claims that “95% of agent pilots fail” are common, Martin suggests many of those pilots were simply low-commitment experiments made possible by the low cost of testing AI. Enterprises that succeed typically keep humans in the loop, at least initially, to review recommendations and maintain control over decisions. Current enterprise use cases for agents include document processing, decision support, and personalized outputs. When integrated into broader systems, these applications can deliver measurable productivity gains. For example,Travel Essencebuilt an agentic system that reduced a two-hour customer planning process to three minutes, allowing staff to focus more on sales and helping drive 20% top-line growth. Martin also believes AI will pressure traditional SaaS seat-based pricing and accelerate custom software development. In this environment, governed platforms like OutSystems can help enterprises adopt “vibe coding” while maintaining compliance, security, and lifecycle management. Learn more from The New Stack about the latest developments around enterprise adoption of vibe coding: How To Use Vibe Coding Safely in the Enterprise 5 Challenges With Vibe Coding for Enterprises Vibe Coding: The Shadow IT Problem No One Saw Coming Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech current saas vibe successfully adopt coding apis enterprises software engineers tech podcast outsystems ai engineer new stack developer podcast

Inception Labs says its diffusion LLM is 10x faster than Claude, ChatGPT, Gemini

Play Episode Listen Later Mar 2, 2026 43:41

On a recent episode of the The New Stack Agents, Inception Labs CEO Stefano Ermon introduced Mercury 2, a large language model built on diffusion rather than the standard autoregressive approach. Traditional LLMs generate text token by token from left to right, which Ermon describes as “fancy autocomplete.” In contrast, diffusion models begin with a rough draft and refine it in parallel, similar to image systems like Stable Diffusion. This parallel process allows Mercury 2 to produce over 1,000 tokens per second—five to ten times faster than optimized models from labs such as OpenAI, Anthropic, and Google, according to company tests. Ermon argues diffusion models better leverage GPUs, with support from investor Nvidia to optimize performance. While Mercury 2 matches mid-tier models like Claude Haiku and Google Flash rather than top systems such as Claude Opus or GPT-4, Ermon believes diffusion's speed and economic advantages will become increasingly compelling as AI applications scale. Learn more from The New Stack about the latest developments around around large language model built on diffusion: How Diffusion-Based LLM AI Speeds Up Reasoning Get Ready for Faster Text Generation With Diffusion LLMs Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai google chatgpt mercury gemini openai nvidia labs gpt inception anthropic gpus diffusion stable diffusion new stack

NanoClaw's answer to OpenClaw is minimal code, maximum isolation

Play Episode Listen Later Feb 20, 2026 51:54

OnThe New Stack Agents, Gavriel Cohen discusses why he built NanoClaw, a minimalist alternative to OpenClaw, after discovering security and architectural flaws in the rapidly growing agentic framework. Cohen, co-founder of AI marketing agencyQwibit, had been running agents across operations, sales, and research usingClaude Code. When Clawdbot (laterOpenClaw) launched, it initially seemed ideal. But Cohen grew concerned after noticing questionable dependencies—including his own outdated GitHub package—excessive WhatsApp data storage, a massive AI-generated codebase nearing 400,000 lines, and a lack of OS-level isolation between agents. In response, he createdNanoClawwith radical minimalism: only a few hundred core lines, minimal dependencies, and containerized agents. Built around Claude Code “skills,” NanoClaw enables modular, build-time integrations while keeping the runtime small enough to audit easily. Cohen argues AI changes coding norms—favoring duplication over DRY, relaxing strict file limits, and treating code as disposable. His goal is simple, secure infrastructure that enterprises can fully understand and trust. Learn more from The New Stack about the latest around personal AI agents Anthropic: You can still use your Claude accounts to run OpenClaw, NanoClaw and Co. It took a researcher fewer than 2 hours to hijack OpenClaw OpenClaw is being called a security “Dumpster fire,” but there is a way to stay safe Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech built code os whatsapp isolation maximum dry github minimal dumpsters software engineers software developers tech podcast ai engineer new stack

The developer as conductor: Leading an orchestra of AI agents with the feature flag baton

Play Episode Listen Later Feb 19, 2026 19:32

A few weeks after Dynatrace acquired DevCycle, Michael Beemer and Andrew Norris discussed on The New Stack Makers podcast how feature flagging is becoming a critical safeguard in the AI era. By integrating DevCycle's feature flagging into the Dynatrace observability platform, the combined solution delivers a “360-degree view” of software performance at the feature level. This closes a key visibility gap, enabling teams to see exactly how individual features affect systems in production. As “agentic development” accelerates—where AI agents rapidly generate code—feature flags act as a safety net. They allow teams to test, control, and roll back AI-generated changes in live environments, keeping a human in the loop before full releases. This reduces risk while speeding enterprise adoption of AI tools. The discussion also highlighted support for the Cloud Native Computing Foundation's OpenFeature standard to avoid vendor lock-in. Ultimately, developers are evolving into “conductors,” orchestrating AI agents with feature flags as their baton. Learn more from The New Stack about the latest around AI enterprise development: Why You Can't Build AI Without Progressive Delivery Beyond automation: Dynatrace unveils agentic AI that fixes problems on its own Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ai tech developers feature flag acquisition orchestras conductor software engineers baton tech podcast dynatrace ai engineer matt burns cloud native computing foundation feature flags software delivery new stack new stack makers

The reason AI agents shouldn't touch your source code — and what they should do instead

Play Episode Listen Later Feb 13, 2026 22:41

Dynatrace is at a pivotal point, expanding beyond traditional observability into a platform designed for autonomous operations and security powered by agentic AI. In an interview on *The New Stack Makers*, recorded at the Dynatrace Perform conference, Chief Technology Strategist Alois Reitbauer discussed his vision for AI-managed production environments. The conversation followed Dynatrace's acquisition of DevCycle, a feature-management platform. Reitbauer highlighted feature flags—long used in software development—as a critical safety mechanism in the age of agentic AI. Rather than allowing AI agents to rewrite and deploy code, Dynatrace envisions them operating within guardrails by adjusting configuration settings through feature flags. This approach limits risk while enabling faster, automated decision-making. Customers, Reitbauer noted, are increasingly comfortable with AI handling defined tasks under constraints, but not with agents making sweeping, unsupervised changes. By combining AI with controlled configuration tools, Dynatrace aims to create a safer path toward truly autonomous operations. Learn more from The New Stack about the latest in progressive delivery: Why You Can't Build AI Without Progressive Delivery Continuous Delivery: Gold Standard for Software Development Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai tech touch customers software engineers simplecast software developers source code tech podcast application development dynatrace matt burns feature flags software delivery new stack new stack makers

You can't fire a bot: The blunt truth about AI slop and your job

Play Episode Listen Later Feb 11, 2026 57:18

Matan-Paul Shetrit, Director of Product Management at Writer, argues that people must take responsibility for how they use AI. If someone produces poor-quality output, he says, the blame lies with the user—not the tool. He believes many misunderstand AI's role, confusing its ability to accelerate work with an abdication of accountability. Speaking on The New Stack Agents podcast, Shetrit emphasized that “we're all becoming editors,” meaning professionals increasingly review and refine AI-generated content rather than create everything from scratch. However, ultimate responsibility remains human. If an AI-generated presentation contains errors, the presenter—not the AI—is accountable. Shetrit also discussed the evolving AI landscape, contrasting massive general-purpose models from companies like OpenAI and Google with smaller, specialized models. At Writer, the focus is on enabling enterprise-scale AI adoption by reducing costs, improving accuracy, and increasing speed. He argues that bespoke, narrowly focused models tailored to specific use cases are essential for delivering reliable, cost-effective AI solutions at scale. Learn more from The New Stack about the latest around enterprise development: Why Pure AI Coding Won't Work for Enterprise Software How To Use Vibe Coding Safely in the Enterprise Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

director ai google work speaking tech writer bots openai product management blunt software engineers simplecast slop software developers tech podcast ai engineer new stack

GitLab CEO on why AI isn't helping enterprise ship code faster

Play Episode Listen Later Feb 10, 2026 57:18

AI coding assistants are boosting developer productivity, but most enterprises aren't shipping software any faster. GitLab CEO Bill Staples says the reason is simple: coding was never the main bottleneck. After speaking with more than 60 customers, Staples found that developers spend only 10–20% of their time writing code. The remaining 80–90% is consumed by reviews, CI/CD pipelines, security scans, compliance checks, and deployment—areas that remain largely unautomated. Faster code generation only worsens downstream queues.GitLab's response is its newly GA'ed Duo Agent Platform, designed to automate the full software development lifecycle. The platform introduces “agent flows,” multi-step orchestrations that can take work from issue creation through merge requests, testing, and validation. Staples argues that context is the key differentiator. Unlike standalone coding tools that only see local code, GitLab's all-in-one platform gives agents access to issues, epics, pipeline history, security data, and more through a unified knowledge graph.Staples believes this platform approach, rather than fragmented point solutions, is what will finally unlock enterprise software delivery at scale. Learn more from The New Stack about the latest around GitLab and AI: GitLab Launches Its AI Agent Platform in Public BetaGitLab's Field CTO Predicts: When DevSecOps Meets AIJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai tech code ga ship enterprise staples software engineers simplecast software developers gitlab ci cd tech podcast ai engineer new stack new stack makers

The enterprise is not ready for "the rise of the developer"

Play Episode Listen Later Feb 5, 2026 25:50

Sean O'Dell of Dynatrace argues that enterprises are unprepared for a major shift brought on by AI: the rise of the developer. Speaking at Dynatrace Perform in Las Vegas, O'Dell explains that AI-assisted and “vibe” coding are collapsing traditional boundaries in software development. Developers, once insulated from production by layers of operations and governance, are now regaining end-to-end ownership of the entire software lifecycle — from development and testing to deployment and security. This shift challenges long-standing enterprise structures built around separation of duties and risk mitigation. At the same time, the definition of “developer” is expanding. With AI lowering technical barriers, software creation is becoming more about creative intent than mastery of specialized tools, opening the door to nontraditional developers. Experimentation is also moving into production environments, a change that would have seemed reckless just 18 months ago. According to O'Dell, enterprises now understand AI well enough to experiment confidently, but many are not ready for the cultural, operational, and security implications of developers — broadly defined — taking full control again.Learn more from The New Stack about the latest around enterprise developers and AI: Retool's New AI-Powered App Builder Lets Non-Developers Build Enterprise AppsSolving 3 Enterprise AI Problems Developers FaceEnterprise Platform Teams Are Stuck in Day 2 HellJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai las vegas speaking tech developers enterprise experimentation software engineers simplecast software development software developers tech podcast dynatrace ai engineer matt burns new stack

Meet Gravitino, a geo-distributed, federated metadata lake

Play Episode Listen Later Jan 29, 2026 29:27

In the era of agentic AI, attention has largely focused on data itself, while metadata has remained a neglected concern. Junping (JP) Du, founder and CEO of Datastrato, argues that this must change as AI fundamentally alters how data and metadata are consumed, governed, and understood. To address this gap, Datastrato created Apache Gravitino, an open source, high-performance, geo-distributed, federated metadata lake designed to act as a neutral control plane for metadata and governance across multi-modal, multi-engine AI workloads. Gravitino achieved major milestones in 2025, including graduation as an Apache Top Level Project, a stable 1.1.0 release, and membership in the new Agentic AI Foundation. Du describes Gravitino as a “catalog of catalogs” that unifies metadata across engines like Spark, Trino, Ray, and PyTorch, eliminating silos and inconsistencies. Built to support both structured and unstructured data, Gravitino enables secure, consistent, and AI-friendly data access across clouds and regions, helping enterprises manage governance, access control, and scalability in increasingly complex AI environments.Learn more from The New Stack about how the latest data and metadata are consumed, governed, and understood: Is Agentic Metadata the Next Infrastructure Layer?Why AI Loves Object StorageThe Real Bottleneck in Enterprise AI Isn't the Model, It's ContextJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ceo ai tech model built lake spark open source distributed simplecast metadata software developers tech podcast federated trino pytorch ai engineer new stack new stack makers

CTO Chris Aniszczyk on the CNCF push for AI interoperability

Play Episode Listen Later Jan 22, 2026 23:33

Chris Aniszczyk, co-founder and CTO of the Cloud Native Computing Foundation (CNCF), argues that AI agents resemble microservices at a surface level, though they differ in how they are scaled and managed. In an interview ahead of KubeCon/CloudNativeCon Europe, he emphasized that being “AI native” requires being cloud native by default. Cloud-native technologies such as containers, microservices, Kubernetes, gRPC, Prometheus, and OpenTelemetry provide the scalability, resilience, and observability needed to support AI systems at scale. Aniszczyk noted that major AI platforms like ChatGPT and Claude already rely on Kubernetes and other CNCF projects.To address growing complexity in running generative and agentic AI workloads, the CNCF has launched efforts to extend its conformance programs to AI. New requirements—such as dynamic resource allocation for GPUs and TPUs and specialized networking for inference workloads—are being handled inconsistently across the industry. CNCF aims to establish a baseline of compatibility to ensure vendor neutrality. Aniszczyk also highlighted CNCF incubation projects like Metal³ for bare-metal Kubernetes and OpenYurt for managing edge-based Kubernetes deployments. Learn more from The New Stack about CNCF and what to expect in 2026:Why the CNCF's New Executive Director Is Obsessed With InferenceCNCF Dragonfly Speeds Container, Model Sharing with P2PJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Solving the Problems that Accompany API Sprawl with AI

Play Episode Listen Later Jan 15, 2026 19:19

API sprawl creates hidden security risks and missed revenue opportunities when organizations lose visibility into the APIs they build. According to IBM's Neeraj Nargund, APIs power the core business processes enterprises want to scale, making automated discovery, observability, and governance essential—especially when thousands of APIs exist across teams and environments. Strong governance helps identify endpoints, remediate shadow APIs, and manage risk at scale. At the same time, enterprises increasingly want to monetize the data APIs generate, packaging insights into products and pricing and segmenting usage, a need amplified by the rise of AI.To address these challenges, Nargund highlights “smart APIs,” which are infused with AI to provide context awareness, event-driven behavior, and AI-assisted governance throughout the API lifecycle. These APIs help interpret and act on data, integrate with AI agents, and support real-time, streaming use cases.IBM's latest API Connect release embeds AI across API management and is designed for hybrid and multi-cloud environments, offering centralized governance, observability, and control through a single hybrid control plane.Learn more from The New Stack about smart APIs: Redefining API Management for the AI-Driven Enterprise How To Accelerate Growth With AI-Powered Smart APIs Wrangle Account Sprawl With an AI Gateway Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai tech ibm api apis software engineers simplecast software developers tech podcast accompany sprawl ai engineer new stack new stack makers

CloudBees CEO: Why Migration Is a Mirage Costing You Millions

Play Episode Listen Later Jan 13, 2026 34:08

A CloudBees survey reveals that enterprise migration projects often fail to deliver promised modernization benefits. In 2024, 57% of enterprises spent over $1 million on migrations, with average overruns costing $315,000 per project. In The New Stack Makers podcast, CloudBees CEO Anuj Kapur describes this pattern as “the migration mirage,” where organizations chase modernization through costly migrations that push value further into the future. Findings from the CloudBees 2025 DevOps Migration Index show leaders routinely underestimate the longevity and resilience of existing systems. Kapur notes that applications often outlast CIOs, yet new leadership repeatedly mandates wholesale replacement. The report argues modernization has been mistakenly equated with migration, which diverts resources from customer value to replatforming efforts. Beyond financial strain, migration erodes developer morale by forcing engineers to rework functioning systems instead of building new solutions. CloudBees advocates meeting developers where they are, setting flexible guardrails rather than enforcing rigid platforms. Kapur believes this approach, combined with emerging code assistance tools, could spark a new renaissance in software development by 2026.Learn more from The New Stack about enterprise modernization: Why AI Alone Fails at Large-Scale Code ModernizationHow AI Can Speed up Modernization of Your Legacy IT SystemsJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

tech millions migration cios costing findings mirage software engineers simplecast software development modernization software developers tech podcast kapur ai engineer new stack cloudbees it modernization new stack makers

Human Cognition Can't Keep Up with Modern Networks. What's Next?

Play Episode Listen Later Jan 7, 2026 23:16

IBM's recent acquisitions of Red Hat, HashiCorp, and its planned purchase of Confluent reflect a deliberate strategy to build the infrastructure required for enterprise AI. According to IBM's Sanil Nambiar, AI depends on consistent hybrid cloud runtimes (Red Hat), programmable and automated infrastructure (HashiCorp), and real-time, trustworthy data (Confluent). Without these foundations, AI cannot function effectively. Nambiar argues that modern, software-defined networks have become too complex for humans to manage alone, overwhelmed by fragmented data, escalating tool sophistication, and a widening skills gap that makes veteran “tribal knowledge” hard to transfer. Trust, he says, is the biggest barrier to AI adoption in networking, since errors can cause costly outages. To address this, IBM launched IBM Network Intelligence, a “network-native” AI solution that combines time-series foundation models with reasoning large language models. This architecture enables AI agents to detect subtle warning patterns, collapse incident response times, and deliver accurate, trustworthy insights for real-world network operations.Learn more from The New Stack about AI infrastructure and IBM's approach: AI in Network Observability: The Dawn of Network Intelligence How Agentic AI Is Redefining Campus and Branch Network Needs Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

trust ai tech modern ibm networks keep up software engineers simplecast red hat software developers tech podcast hybrid cloud hashicorp confluent ai engineer human cognition new stack new stack makers

From Group Science Project to Enterprise Service: Rethinking OpenTelemetry

Play Episode Listen Later Dec 30, 2025 17:20

Ari Zilka, founder of MyDecisive.ai and former Hortonworks CPO, argues that most observability vendors now offer essentially identical, reactive dashboards that highlight problems only after systems are already broken. After speaking with all 23 observability vendors at KubeCon + CloudNativeCon North America 2025, Zilka said these tools fail to meaningfully reduce mean time to resolution (MTTR), a long-standing demand he heard repeatedly from thousands of CIOs during his time at New Relic.Zilka believes observability must shift from reactive monitoring to proactive operations, where systems automatically respond to telemetry in real time. MyDecisive.ai is his attempt to solve this, acting as a “bump in the wire” that intercepts telemetry and uses AI-driven logic to trigger actions like rolling back faulty releases.He also criticized the rising cost and complexity of OpenTelemetry adoption, noting that many companies now require large, specialized teams just to maintain OTel stacks. MyDecisive aims to turn OpenTelemetry into an enterprise-ready service that reduces human intervention and operational overhead.Learn more from The New Stack about OpenTelemetry:Observability Is Stuck in the Past. Your Users Aren't. Setting Up OpenTelemetry on the Frontend Because I Hate MyselfHow to Make OpenTelemetry Better in the BrowserJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai service tech rethinking enterprise cios change management software engineers simplecast software developers tech podcast observability new relic science project kubecon ai engineer otel new stack new stack makers

Why You Can't Build AI Without Progressive Delivery

Play Episode Listen Later Dec 23, 2025 27:42

Former GitHub CEO Thomas Dohmke's claim that AI-based development requires progressive delivery frames a conversation between analyst James Governor and The New Stack's Alex Williams about why modern release practices matter more than ever. Governor argues that AI systems behave unpredictably in production: models can hallucinate, outputs vary between versions, and changes are often non-deterministic. Because of this uncertainty, teams must rely on progressive delivery techniques such as feature flags, canary releases, observability, measurement and rollback. These practices, originally developed to improve traditional software releases, now form the foundation for deploying AI safely. Concepts like evaluations, model versioning and controlled rollouts are direct extensions of established delivery disciplines. Beyond AI, Governor's book “Progressive Delivery” challenges DevOps thinking itself. He notes that DevOps focuses on development and operations but often neglects the user feedback loop. Using a framework of four A's — abundance, autonomy, alignment and automation — he argues that progressive delivery reconnects teams with real user outcomes. Ultimately, success isn't just reliability metrics, but whether users are actually satisfied. Learn more from The New Stack about progressive delivery: Mastering Progressive Hydration for Enhanced Web Performance Continuous Delivery: Gold Standard for Software Development Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai tech governor progressive delivery concepts devops simplecast software development tech podcast alex williams software delivery new stack redmonk new stack makers

How Nutanix Is Taming Operational Complexity

Play Episode Listen Later Dec 18, 2025 15:20

Most enterprises today run workloads across multiple IT infrastructures rather than a single platform, creating significant operational challenges. According to Nutanix CTO Deepak Goel, organizations face three major hurdles: managing operational complexity amid a shortage of cloud-native skills, migrating legacy virtual machine (VM) workloads to microservices-based cloud-native platforms, and running VM-based workloads alongside containerized applications. Many engineers have deep infrastructure experience but lack Kubernetes expertise, making the transition especially difficult and increasing the learning curve for IT administrators. To address these issues, organizations are turning to platform engineering and internal developer platforms that abstract infrastructure complexity and provide standardized “golden paths” for deployment. Integrated development environments (IDEs) further reduce friction by embedding capabilities like observability and security. Nutanix contributes through its hyper converged platform, which unifies compute and storage while supporting both VMs and containers. At KubeCon North America, Nutanix announced version 2.0 of Nutanix Data Services for Kubernetes (NDK), adding advanced data protection, fault-tolerant replication, and enhanced security through a partnership with Canonical to deliver a hardened operating system for Kubernetes environments.Learn more from The New Stack about operational complexity in cloud native environments:Q&A: Nutanix CEO Rajiv Ramaswami on the Cloud Native Enterprise Kubernetes Complexity Realigns Platform Engineering Strategy Platform Engineering on the Brink: Breakthrough or Bust? Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

bust complexity taming integrated operational vm simplecast ides kubernetes canonical vms nutanix new stack

Do All Your AI Workloads Actually Require Expensive GPUs?

Play Episode Listen Later Dec 18, 2025 29:49

GPUs dominate today's AI landscape, but Google argues they are not necessary for every workload. As AI adoption has grown, customers have increasingly demanded compute options that deliver high performance with lower cost and power consumption. Drawing on its long history of custom silicon, Google introduced Axion CPUs in 2024 to meet needs for massive scale, flexibility, and general-purpose computing alongside AI workloads. The Axion-based C4A instance is generally available, while the newer N4A virtual machines promise up to 2x price performance.In this episode, Andrei Gueletii, a technical solutions consultant for Google Cloud joined Gari Singh, a product manager for Google Kubernetes Engine (GKE), and Pranay Bakre, a principal solutions engineer at Arm for this episode, recorded at KubeCon + CloudNativeCon North America, in Atlanta. Built on Arm Neoverse V2 cores, Axion processors emphasize energy efficiency and customization, including flexible machine shapes that let users tailor memory and CPU resources. These features are particularly valuable for platform engineering teams, which must optimize centralized infrastructure for cost, FinOps goals, and price performance as they scale.Importantly, many AI tasks—such as inference for smaller models or batch-oriented jobs—do not require GPUs. CPUs can be more efficient when GPU memory is underutilized or latency demands are low. By decoupling workloads and choosing the right compute for each task, organizations can significantly reduce AI compute costs.Learn more from The New Stack about the Axion-based C4A: Beyond Speed: Why Your Next App Must Be Multi-ArchitectureArm: See a Demo About Migrating a x86-Based App to ARM64Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai google tech drawing built expensive arm open source require gpu cpu software engineers simplecast google cloud workload kubernetes gpus software developers cpus tech podcast finops axion ai engineer new stack gary singh new stack makers

Breaking Data Team Silos Is the Key to Getting AI to Production

Play Episode Listen Later Dec 17, 2025 30:47

Enterprises are racing to deploy AI services, but the teams responsible for running them in production are seeing familiar problems reemerge—most notably, silos between data scientists and operations teams, reminiscent of the old DevOps divide. In a discussion recorded at AWS re:Invent 2025, IBM's Thanos Matzanas and Martin Fuentes argue that the challenge isn't new technology but repeating organizational patterns. As data teams move from internal projects to revenue-critical, customer-facing applications, they face new pressures around reliability, observability, and accountability.The speakers stress that many existing observability and governance practices still apply. Standard metrics, KPIs, SLOs, access controls, and audit logs remain essential foundations, even as AI introduces non-determinism and a heavier reliance on human feedback to assess quality. Tools like OpenTelemetry provide common ground, but culture matters more than tooling.Both emphasize starting with business value and breaking down silos early by involving data teams in production discussions. Rather than replacing observability professionals, AI should augment human expertise, especially in critical systems where trust, safety, and compliance are paramount.Learn more from The New Stack about enabling AI with silos: Are Your AI Co-Pilots Trapping Data in Isolated Silos?Break the AI Gridlock at the Intersection of Velocity and TrustTaming AI Observability: Control Is the Key to SuccessJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Why AI Parallelization Will Be One of the Biggest Challenges of 2026

Play Episode Listen Later Dec 16, 2025 24:05

Rob Whiteley, CEO of Coder, argues that the biggest winners in today's AI boom resemble the “picks and shovels” sellers of the California Gold Rush: companies that provide tools enabling others to build with AI. Speaking onThe New Stack Makersat AWS re:Invent, Whiteley described the current AI moment as the fastest-moving shift he's seen in 25 years of tech. Developers are rapidly adopting AI tools, while platform teams face pressure to approve them, as saying “no” is no longer viable. Whiteley warns of a widening gap between organizations that extract real value from AI and those that don't, driven by skills shortages and insufficient investment in training. He sees parallels with the cloud-native transition and predicts the rise of “AI-native” companies. As agentic AI grows, developers increasingly act as managers overseeing many parallel AI agents, creating new challenges around governance, security, and state management. To address this, Coder introduced Mux, an open source coding agent multiplexer designed to help developers manage and evaluate large volumes of AI-generated code efficiently.Learn more from The New Stack about AI Parallelization The Production Generative AI Stack: Architecture and ComponentsEnable ParallelFrontend/Backend Development to Unlock VelocityJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ceo ai tech developers open source invent simplecast biggest challenges coders software developers tech podcast california gold rush whiteley ai engineer mux new stack aws reinvent

Kubernetes GPU Management Just Got a Major Upgrade

Play Episode Listen Later Dec 11, 2025 35:26

Nvidia Distinguished Engineer Kevin Klues noted that low-level systems work is invisible when done well and highly visible when it fails — a dynamic that frames current Kubernetes innovations for AI. At KubeCon + CloudNativeCon North America 2025, Klues and AWS product manager Jesse Butler discussed two emerging capabilities: dynamic resource allocation (DRA) and a new workload abstraction designed for sophisticated AI scheduling.DRA, now generally available in Kubernetes 1.34, fixes long-standing limitations in GPU requests. Instead of simply asking for a number of GPUs, users can specify types and configurations. Modeled after persistent volumes, DRA allows any specialized hardware to be exposed through standardized interfaces, enabling vendors to deliver custom device drivers cleanly. Butler called it one of the most elegant designs in Kubernetes.Yet complex AI workloads require more coordination. A forthcoming workload abstraction, debuting in Kubernetes 1.35, will let users define pod groups with strict scheduling and topology rules — ensuring multi-node jobs start fully or not at all. Klues emphasized that this abstraction will shape Kubernetes' AI trajectory for the next decade and encouraged community involvement.Learn more from The New Stack about dynamic resource allocation: Kubernetes Primer: Dynamic Resource Allocation (DRA) for GPU WorkloadsKubernetes v1.34 Introduces Benefits but Also New Blind SpotsJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

The Rise of the Cognitive Architect

Play Episode Listen Later Dec 10, 2025 22:53

At KubeCon North America 2025, GitLab's Emilio Salvador outlined how developers are shifting from individual coders to leaders of hybrid human–AI teams. He envisions developers evolving into “cognitive architects,” responsible for breaking down large, complex problems and distributing work across both AI agents and humans. Complementing this is the emerging role of the “AI guardian,” reflecting growing skepticism around AI-generated code. Even as AI produces more code, humans remain accountable for reviewing quality, security, and compliance.Salvador also described GitLab's “AI paradox”: developers may code faster with AI, but overall productivity stalls because testing, security, and compliance processes haven't kept pace. To fix this, he argues organizations must apply AI across the entire development lifecycle, not just in coding. GitLab's Duo Agent Platform aims to support that end-to-end transformation.Looking ahead, Salvador predicts the rise of a proactive “meta agent” that functions like a full team member. Still, he warns that enterprise adoption remains slow and advises organizations to start small, build skills, and scale gradually.Learn more from The New Stack about the evolving role of "cognitive architects":The Engineer in the AI Age: The Orchestrator and ArchitectThe New Role of Enterprise Architecture in the AI EraThe Architect's Guide to Understanding Agentic AIJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai guide engineers architects cognitive open source software engineers simplecast kubernetes software developers gitlab complementing enterprise architecture ai engineer new stack new stack makers

Why the CNCF's New Executive Director is Obsessed With Inference

Play Episode Listen Later Dec 9, 2025 25:09

Jonathan Bryce, the new CNCF executive director, argues that inference—not model training—will define the next decade of computing. Speaking at KubeCon North America 2025, he emphasized that while the industry obsesses over massive LLM training runs, the real opportunity lies in efficiently serving these models at scale. Cloud-native infrastructure, he says, is uniquely suited to this shift because inference requires real-time deployment, security, scaling, and observability—strengths of the CNCF ecosystem. Bryce believes Kubernetes is already central to modern inference stacks, with projects like Ray, KServe, and emerging GPU-oriented tooling enabling teams to deploy and operationalize models. To bring consistency to this fast-moving space, the CNCF launched a Kubernetes AI Conformance Program, ensuring environments support GPU workloads and Dynamic Resource Allocation. With AI agents poised to multiply inference demand by executing parallel, multi-step tasks, efficiency becomes essential. Bryce predicts that smaller, task-specific models and cloud-native routing optimizations will drive major performance gains. Ultimately, he sees CNCF technologies forming the foundation for what he calls “the biggest workload mankind will ever have.” Learn more from The New Stack about inference: Confronting AI's Next Big Challenge: Inference Compute Deep Infra Is Building an AI Inference Cloud for Developers Join our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Kubernetes Gets an AI Conformance Program — and VMware Is Already On Board

Play Episode Listen Later Dec 8, 2025 30:40

The Cloud Native Computing Foundation has introduced the Certified Kubernetes AI Conformance Program to bring consistency to an increasingly fragmented AI ecosystem. Announced at KubeCon + CloudNativeCon North America 2025, the program establishes open, community-driven standards to ensure AI applications run reliably and portably across different Kubernetes platforms. VMware by Broadcom's vSphere Kubernetes Service (VKS) is among the first platforms to achieve certification.In an interview with The New Stack, Broadcom leaders Dilpreet Bindra and Himanshu Singh explained that the program applies lessons from Kubernetes' early evolution, aiming to reduce the “muddiness” in AI tooling and improve cross-platform interoperability. They emphasized portability as a core value: organizations should be able to move AI workloads between public and private clouds with minimal friction.VKS integrates tightly with vSphere, using Kubernetes APIs directly to manage infrastructure components declaratively. This approach, along with new add-on management capabilities, reflects Kubernetes' growing maturity. According to Bindra and Singh, this stability now enables enterprises to trust Kubernetes as a foundation for production-grade AI. Learn more from The New Stack about Broadcom's latest updates with Kubernetes: Has VMware Finally Caught Up with Kubernetes?VMware VCF 9.0 Finally Unifies Container and VM ManagementJoin our community of newsletter subscribers to stay on top of the news and at the top of your game. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

ai board singh open source simplecast vmware kubernetes broadcom ai engineer vsphere cloud native computing foundation new stack new stack makers

Claim The New Stack Podcast

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel