Podcasts about Observability

  • 444PODCASTS
  • 1,367EPISODES
  • 41mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • Jun 23, 2025LATEST

POPULARITY

20172018201920202021202220232024

Categories



Best podcasts about Observability

Show all podcasts related to observability

Latest podcast episodes about Observability

The Tech Blog Writer Podcast
3324: How Splunk Helps Businesses Cut Through Digital Noise

The Tech Blog Writer Podcast

Play Episode Listen Later Jun 23, 2025 21:14


How do you keep complex digital experiences running smoothly when every layer, from networks to cloud infrastructure to applications, can break in ways that frustrate customers and burn out IT teams? This question is at the heart of my conversation recorded live at Cisco Live in San Diego with Patrick Lin, Senior Vice President and General Manager for Observability at Splunk, now part of Cisco. In this episode, Patrick explains how observability has evolved far beyond simple monitoring and is becoming the nerve centre for digital resilience in a world where reactive alerts no longer cut it. We unpack how Splunk and Cisco ThousandEyes are now deeply integrated, giving teams a single source of truth that connects application behaviour, infrastructure health, and network performance, even across systems they do not directly control. Patrick also shares what these two-way integrations mean in practice: faster incident resolution, fewer blame games, and far less time wasted chasing false alerts. We explore how AI is enhancing this vision by cutting through the noise to detect real anomalies, correlate related events, and suggest root causes at a speed no human team could match. If your business depends on staying online and your teams are drowning in disconnected data, this conversation offers a glimpse into the next phase of unified observability and assurance. It might even help quiet the flood of alerts that keep IT professionals awake at night. How is your organisation tackling alert fatigue and rising complexity? Listen in and tell me what strategies you have found that actually work.

A Bootiful Podcast
Micrometer.io lead Tommy Ludwig on the latest-and-greatest in observability for the Spring developer

A Bootiful Podcast

Play Episode Listen Later Jun 19, 2025 46:40


Hi, Spring fans! In this episode, I talk to Micrometer.io lead Tommy Ludwig on the latest-and-greatest in observability for the Spring developer

Smart Software with SmartLogic
LangChain: LLM Integration for Elixir Apps with Mark Ericksen

Smart Software with SmartLogic

Play Episode Listen Later Jun 12, 2025 38:18


Mark Ericksen, creator of the Elixir LangChain framework, joins the Elixir Wizards to talk about LLM integration in Elixir apps. He explains how LangChain abstracts away the quirks of different AI providers (OpenAI, Anthropic's Claude, Google's Gemini) so you can work with any LLM in one more consistent API. We dig into core features like conversation chaining, tool execution, automatic retries, and production-grade fallback strategies. Mark shares his experiences maintaining LangChain in a fast-moving AI world: how it shields developers from API drift, manages token budgets, and handles rate limits and outages. He also reveals testing tactics for non-deterministic AI outputs, configuration tips for custom authentication, and the highlights of the new v0.4 release, including “content parts” support for thinking-style models. Key topics discussed in this episode: • Abstracting LLM APIs behind a unified Elixir interface • Building and managing conversation chains across multiple models • Exposing application functionality to LLMs through tool integrations • Automatic retries and fallback chains for production resilience • Supporting a variety of LLM providers • Tracking and optimizing token usage for cost control • Configuring API keys, authentication, and provider-specific settings • Handling rate limits and service outages with degradation • Processing multimodal inputs (text, images) in Langchain workflows • Extracting structured data from unstructured LLM responses • Leveraging “content parts” in v0.4 for advanced thinking-model support • Debugging LLM interactions using verbose logging and telemetry • Kickstarting experiments in LiveBook notebooks and demos • Comparing Elixir LangChain to the original Python implementation • Crafting human-in-the-loop workflows for interactive AI features • Integrating Langchain with the Ash framework for chat-driven interfaces • Contributing to open-source LLM adapters and staying ahead of API changes • Building fallback chains (e.g., OpenAI → Azure) for seamless continuity • Embedding business logic decisions directly into AI-powered tools • Summarization techniques for token efficiency in ongoing conversations • Batch processing tactics to leverage lower-cost API rate tiers • Real-world lessons on maintaining uptime amid LLM service disruptions Links mentioned: https://rubyonrails.org/ https://fly.io/ https://zionnationalpark.com/ https://podcast.thinkingelixir.com/ https://github.com/brainlid/langchain https://openai.com/ https://claude.ai/ https://gemini.google.com/ https://www.anthropic.com/ Vertex AI Studio https://cloud.google.com/generative-ai-studio https://www.perplexity.ai/ https://azure.microsoft.com/ https://hexdocs.pm/ecto/Ecto.html https://oban.pro/ Chris McCord's ElixirConf EU 2025 Talk https://www.youtube.com/watch?v=ojL_VHc4gLk Getting started: https://hexdocs.pm/langchain/gettingstarted.html https://ash-hq.org/ https://hex.pm/packages/langchain https://hexdocs.pm/igniter/readme.html https://www.youtube.com/watch?v=WM9iQlQSFg @brainlid on Twitter and BlueSky Special Guest: Mark Ericksen.

Code RED
#27 - The Hard Truths of Modern Observability: Lessons on Cost, Complexity and What Needs Fixing with Andrew Mallaband

Code RED

Play Episode Listen Later Jun 12, 2025 34:21


Industry veteran Andrew Mallaband of Breakthrough Moments joins Dash0's Mirko Novakovic to explore critical gaps in modern observability. Drawing on his "Observability 2025" series, Andrew breaks down why cost, data overload and poor strategy are holding teams back. They discuss the rise of AI-powered SRE agents, the challenge of missing telemetry, and how aligning data collection with intention is key to unlocking AI's potential.

O11ycast
Ep. #83, Observability Isn't Just SRE on Steroids with Dan Ravenstone

O11ycast

Play Episode Listen Later Jun 11, 2025 36:15


In episode 83 of o11ycast, the Honeycomb team chats with Dan Ravenstone, the o11yneer. Dan unpacks the crucial, often underappreciated, role of the observability engineer. He discusses how this position champions the user, bridging the gap between technical performance and real-world customer experience. Learn about the challenges of mobile observability, the importance of clear terminology, and how building alliances across an organization drives successful observability practices.

Heavybit Podcast Network: Master Feed
Ep. #83, Observability Isn't Just SRE on Steroids with Dan Ravenstone

Heavybit Podcast Network: Master Feed

Play Episode Listen Later Jun 11, 2025 36:15


In episode 83 of o11ycast, the Honeycomb team chats with Dan Ravenstone, the o11yneer. Dan unpacks the crucial, often underappreciated, role of the observability engineer. He discusses how this position champions the user, bridging the gap between technical performance and real-world customer experience. Learn about the challenges of mobile observability, the importance of clear terminology, and how building alliances across an organization drives successful observability practices.

VMware Communities Roundtable
#719 - Beyond Monitoring_ How Network Observability Transforms IT Operations

VMware Communities Roundtable

Play Episode Listen Later Jun 11, 2025


Bob and Eric discuss Network Observability with VMware tools.

VMware Communities Roundtable
#720 - A Developer_s Journey into Observability and Automation with Garvit Kataria

VMware Communities Roundtable

Play Episode Listen Later Jun 11, 2025


Enginears
Discovering How PromptLayer Manages Over 80 Million Monthly Prompt Executions! | Enginears Podcast

Enginears

Play Episode Listen Later Jun 11, 2025 34:58


If you're keen to share your story, please reach out to us!Guest:https://www.linkedin.com/in/imjaredz/https://www.promptlayer.com/careers/Powered by Artifeks!https://www.linkedin.com/company/artifeksrecruitmenthttps://www.artifeks.co.ukhttps://www.linkedin.com/in/agilerecruiterLinkedIn: https://www.linkedin.com/company/enginearsioTwitter: https://x.com/EnginearsioAll Podcast Platforms: https://smartlink.ausha.co/enginears00:00 - Enginears Intro.00:37 - PromptLayer origin.02:24 - PromptLayer Intro.03:52 - PromptLayer & prompt engineering today and the evolution going forward.08:18 - Challenges building PromptLayer.11:04 - How is Jared and PromptLayer focusing the team on what matters?15:17 - What is Vibe coder?17:00 - What Vibe coding lacks?18:10 - Prompt engineers don't have to be technical.21:30 - Taking an idea exploring it through A/B testing.24:56 - Observability in prompting.28:26 - How would Jared best advise someone to build an operational LLM?30:50 - Next 12 months at PromptLayer.33:38 - Jared & PromptLayer Outro.34:16 - Enginears Outro.Hosted by Ausha. See ausha.co/privacy-policy for more information.

PodRocket - A web development podcast from LogRocket
Server functions don't exist with Jack Herrington

PodRocket - A web development podcast from LogRocket

Play Episode Listen Later Jun 5, 2025 21:20


Jack Herrington, podcaster, software engineer, writer and YouTuber, joins the pod to uncover the truth behind server functions and why they don't actually exist in the web platform. We dive into the magic behind frameworks like Next.js, TanStack Start, and Remix, breaking down how server functions work, what they simplify, what they hide, and what developers need to know to build smarter, faster, and more secure web apps. Links YouTube: https://www.youtube.com/@jherr Twitter: https://x.com/jherr Github: https://github.com/jherr ProNextJS: https://www.pronextjs.dev Discord: https://discord.com/invite/KRVwpJUG6p LinkedIn: https://www.linkedin.com/in/jherr Website: https://jackherrington.com Resources Server Functions Don't Exist (It Matters) (https://www.youtube.com/watch?v=FPJvlhee04E) We want to hear from you! How did you find us? Did you see us on Twitter? In a newsletter? Or maybe we were recommended by a friend? Let us know by sending an email to our producer, Em, at emily.kochanek@logrocket.com (mailto:emily.kochanek@logrocket.com), or tweet at us at PodRocketPod (https://twitter.com/PodRocketpod). Follow us. Get free stickers. Follow us on Apple Podcasts, fill out this form (https://podrocket.logrocket.com/get-podrocket-stickers), and we'll send you free PodRocket stickers! What does LogRocket do? LogRocket provides AI-first session replay and analytics that surfaces the UX and technical issues impacting user experiences. Start understanding where your users are struggling by trying it for free at LogRocket.com. Try LogRocket for free today. (https://logrocket.com/signup/?pdr) Special Guest: Jack Herrington.

CaSE: Conversations about Software Engineering
Mirko Novakovic on Waves of Innovation and Observability Product Management

CaSE: Conversations about Software Engineering

Play Episode Listen Later Jun 5, 2025 106:27 Transcription Available


In this episode of the CaSE Podcast, Mirko Novakovic, a seasoned entrepreneur and investor, shares his journey through the waves of technological innovation—from the early days of online banking to the rise of AI and open telemetry. We explore with him how the lessons learned in diverse industries, including the food business, can reshape our approach to software development and architecture, emphasizing the importance of curiosity, adaptability, and a solid grasp of the fundamentals.

MLOps.community
Product Metrics are LLM Evals // Raza Habib CEO of Humanloop // #320

MLOps.community

Play Episode Listen Later Jun 3, 2025 53:06


Raza Habib, the CEO of LLM Eval platform Humanloop, talks to us about how to make your AI products more accurate and reliable by shortening the feedback loop of your evals. Quickly iterating on prompts and testing what works, along with some of his favorite Dario from Anthropic AI Quotes.// BioRaza is the CEO and Co-founder at Humanloop. He has a PhD in Machine Learning from UCL, was the founding engineer of Monolith AI, and has built speech systems at Google. For the last 4 years, he has led Humanloop and supported leading technology companies such as Duolingo, Vanta, and Gusto to build products with large language models. Raza was featured in the Forbes 30 Under 30 technology list in 2022, and Sifted recently named him one of the most influential Gen AI founders in Europe.// Related LinksWebsites: https://humanloop.com~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreMLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Raza on LinkedIn: /humanloop-razaTimestamps:[00:00] Cracking Open System Failures and How We Fix Them[05:44] LLMs in the Wild — First Steps and Growing Pains[08:28] Building the Backbone of Tracing and Observability[13:02] Tuning the Dials for Peak Model Performance[13:51] From Growing Pains to Glowing Gains in AI Systems[17:26] Where Prompts Meet Psychology and Code[22:40] Why Data Experts Deserve a Seat at the Table[24:59] Humanloop and the Art of Configuration Taming[28:23] What Actually Matters in Customer-Facing AI[33:43] Starting Fresh with Private Models That Deliver[34:58] How LLM Agents Are Changing the Way We Talk[39:23] The Secret Lives of Prompts Inside Frameworks[42:58] Streaming Showdowns — Creativity vs. Convenience[46:26] Meet Our Auto-Tuning AI Prototype[49:25] Building the Blueprint for Smarter AI[51:24] Feedback Isn't Optional — It's Everything

Engineering Kiosk
#198 RBAC & Co: Wer darf was? Klingt banal, ist aber verdammt wichtig!

Engineering Kiosk

Play Episode Listen Later Jun 3, 2025 67:34


Wer darf eigentlich was? Und sollten wir alle wirklich alles dürfen?Jedes Tech-Projekt beginnt mit einer simplen Frage: Wer darf eigentlich was? Doch spätestens wenn das Startup wächst, Kunden Compliance fordern oder der erste Praktikant an die Produktionsdatenbank rührt, wird Role Based Access Control (RBAC) plötzlich zur Überlebensfrage – und wer das Thema unterschätzt, hat schnell die Rechtehölle am Hals.In dieser Folge nehmen wir das altbekannte Konzept der rollenbasierten Zugriffskontrolle auseinander. wir klären, welches Problem RBAC eigentlich ganz konkret löst, warum sich hinter den harmlosen Checkboxen viel technische Tiefe und organisatorisches Drama verbirgt und weshalb RBAC nicht gleich RBAC ist.Dabei liefern wir dir Praxis-Insights: Wie setzen Grafana, Sentry, Elasticsearch, OpenSearch oder Tracing-Tools wie Jäger dieses Rechtekonzept um? Wo liegen die Fallstricke in komplexen, mehrmandantenfähigen Systemen?Ob du endlich verstehen willst, warum RBAC, ABAC (Attribute-Based), ReBAC (Relationship-Based) und Policy Engines mehr als nur Buzzwords sind oder wissen möchtest, wie du Policies, Edge Cases und Constraints in den Griff bekommst, darum geht es in diesem Deep Dives.Auch mit dabei: Open Source-Highlights wie Casbin, SpiceDB, OpenFGA und OPA und echte Projekt- und Startup-Tipps für pragmatischen Start und spätere Skalierung.Bonus: Ein Märchen mit Kevin und Max, wo auch manchmal der Praktikant trotzdem gegen den Admin gewinnt

Cup o' Go

Cup o' Go

Play Episode Listen Later May 29, 2025 31:04 Transcription Available


This episode was sponsored by Elastic! Elastic is the company behind Elasticsearch, they help teams find, analyze, and act on their data in real-time through their Search, Observability, and Security solutions. Thanks Elastic! This episode was recorded at Elastic's offices in San Francisco during a meetup.Find info about the show, past episodes including transcripts, our swag store, Patreon link, and more at https://cupogo.dev/.

Code RED
#26 - Rethinking Query Standards: Jacek Migdal on SQL Translation, Portability and Observability

Code RED

Play Episode Listen Later May 29, 2025 32:51


Quesma CEO Jacek Migdal joins Mirko Novakovic to explore the future of querying and database interoperability. They dive into how Quesma rewrites SQL on the fly for cross-system compatibility, and they tackle the lack of query language standards in observability. Jacek also shares his thoughts on AI-assisted querying and why decoupling applications from databases is key to long-term flexibility.

The Data Stack Show
246: AI, Abstractions, and the Future of Data Engineering with Pete Hunt of Dagster

The Data Stack Show

Play Episode Listen Later May 28, 2025 48:59


Highlights from this week's conversation include:Pete's Background and Journey in Data (1:36)Evolution of Data Practices (3:02)Integration Challenges with Acquired Companies (5:13)Trust and Safety as a Service (8:12)Transition to Dagster (11:26)Value Creation in Networking (14:42)Observability in Data Pipelines (18:44)The Era of Big Complexity (21:38)Abstraction as a Tool for Complexity (24:41)Composability and Workflow Engines (28:08)The Need for Guardrails (33:13)AI in Development Tools (36:24)Internal Components Marketplace (40:14)Reimagining Data Integration (43:03)Importance of Abstraction in Data Tools (46:17)Parting Advice for Listeners and Closing Thoughts (48:01)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it's needed to power smarter decisions and better customer experiences. Each week, we'll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

IBM Analytics Insights Podcasts
Part 2: Automation Remix: Observability, IBM Concert & the Next Wave of IT Ops

IBM Analytics Insights Podcasts

Play Episode Listen Later May 28, 2025 20:26


Send us a textWe're back for Part 2 of our Automation deep-dive—and the hits just keep coming! Host Al Martin reunites with IBM automation aces Sarah McAndrew (WW Automation Technical Sales) and Vikram Murali (App Mod & IT Automation Development) to push past the hype and map out the road ahead.

Making Data Simple
Part 2: Automation Remix: Observability, IBM Concert & the Next Wave of IT Ops

Making Data Simple

Play Episode Listen Later May 28, 2025 20:26


Send us a textWe're back for Part 2 of our Automation deep-dive—and the hits just keep coming! Host Al Martin reunites with IBM automation aces Sarah McAndrew (WW Automation Technical Sales) and Vikram Murali (App Mod & IT Automation Development) to push past the hype and map out the road ahead.

OpenObservability Talks
ClickHouse: Breaking the Speed Limit for Observability and Analytics - OpenObservability Talks S5E12

OpenObservability Talks

Play Episode Listen Later May 27, 2025 58:27


The ClickHouse® project is a rising star in observability and analytics, challenging performance conventions with its breakneck speed. This open source OLAP column store, originally developed at Yandex to power their web analytics platform at massive scale, has quickly evolved into one of the hottest open source observability data stores around. Its published performance benchmarks have been the topic of conversation, outperforming many legacy databases and setting a new bar for fast queries over large volumes of data.Our guest for this episode is Robert Hodges, CEO of Altinity — the second largest contributor to the ClickHouse project. With over 30 years of experience in databases, Robert brings deep insights into how ClickHouse is challenging legacy databases at scale. We'll also explore Altinity's just-launched groundbreaking open source project—Project Antalya—which extends ClickHouse with Apache Iceberg shared storage, unlocking dramatic improvements in both performance and cost efficiency. Think 90% reductions in storage costs and 10 to 100x faster queries, all without requiring any changes to your existing applications.The episode was live-streamed on 20 May 2025 and the video is available at https://www.youtube.com/watch?v=VeyTL2JlWp0You can read the recap post: https://medium.com/p/2004160b2f5e/ OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat.⁠⁠https://www.youtube.com/@openobservabilitytalks⁠  https://www.twitch.tv/openobservability⁠Show Notes:00:00 - Intro01:38 - ClickHouse elevator pitch02:46 - guest intro04:48 - ClickHouse under the hood08:15 - SQL and the database evolution path 11:20 - the return of SQL16:13 - design for speed 17:14 - use cases for ClickHouse19:18 - ClickHouse ecosystem22:22 - ClickHouse on Kubernetes 31:45 - know how ClickHouse works inside to get the most out of it 38:59 - ClickHouse for Observability46:58 - Project Antalya55:03 - Kubernetes 1.33 release55:32 - OpenSearch 3.0 release56:01 - New Permissive License for ML Models Announced by the Linux Foundation57:08 - OutroResources:ClickHouse on GitHub: https://github.com/ClickHouse/ClickHouse Shopify's Journey to Planet-Scale Observability: https://medium.com/p/9c0b299a04ddProject Antalya: https://altinity.com/blog/getting-started-with-altinitys-project-antalya https://cmtops.dev/posts/building-observability-with-clickhouse/ Kubernetes 1.33 release highlights: https://www.linkedin.com/feed/update/urn:li:activity:7321054742174924800/ New Permissive License for Machine Learning Models Announced by the Linux Foundation: https://www.linkedin.com/feed/update/urn:li:share:7331046183244611584  Opensearch 3.0 major release: https://www.linkedin.com/posts/horovits_opensearch-activity-7325834736008880128-kCqrSocials:Twitter:⁠ https://twitter.com/OpenObserv⁠YouTube: ⁠https://www.youtube.com/@openobservabilitytalks⁠Dotan Horovits============X (Twitter): @horovitsLinkedIn: www.linkedin.com/in/horovitsMastodon: @horovits@fosstodonBlueSky: @horovits.bsky.socialRobert Hodges=============LinkedIn: https://www.linkedin.com/in/berkeleybob2105/ 

ITOps, DevOps, AIOps - All Things Ops
Ep. 53 - From 5,000 Alerts to AI-Ready Ops: Inside Acrisure's Observability Overhaul - with Jordon Peeple

ITOps, DevOps, AIOps - All Things Ops

Play Episode Listen Later May 27, 2025 34:52


Jordon Peeple is Head of IT Infrastructure Operations at Acrisure—the fast-growing fintech powerhouse you've probably used without even knowing it.In this episode, Jordon shares how his team turned 5,000 ignored alerts into a focused, AI-ready monitoring system. He explains how they cut through the noise, rebuilt escalation chains, and shifting from reactive ops towards a proactive, business-aligned observability—supported by a complete IT org restructure.You'll learn:1. How to reorganize your ops team for scale and collaboration2. Why reducing alerts boosts reliability and response time3. What makes or breaks your escalation chain strategy4. Why involving business owners early pays dividends in monitoring5. What it takes to future-proof observability for hybrid infrastructure___________Get in touch with Jordon on LinkedIn: https://www.linkedin.com/in/jordonpeeple/ ___________About the host Elias Voelker: Elias is the VP for North America at Checkmk. He comes from a strategy consulting background but has been an entrepreneur for the better part of the last 10 years. In his spare time, he likes to do triathlons.Get in touch with Elias via LinkedIn or email podcast@checkmk.com.___________Podcast Music:Music by Ströme, used by permission‚Panta Rhei‘ written by Mario Schoenhofer(c)+p 2022, Compost Medien GmbH & Co KGhttps://stroeme.com/https://compost-rec.com/Thanks to our friends at SAWOO for producing this episode with us! 

PurePerformance
The Research Behind the AI and Observability Innovation with Otmar Ertl and Martin Flechl

PurePerformance

Play Episode Listen Later May 26, 2025 50:59


Scientific research is the foundation of many innovative solutions in any field. Did you know that Dynatrace runs its own Research Lab within the Campus of the Johannes Kepler University (JKU) in Linz, Austria - just 2 kilometers away from our global engineering headquarter? What started in 2020 has grown to 20 full time researchers and many more students that do research on topics such as GenAI, Agentic AI, Log Analytics, Procesesing of Large Data Sets, Sampling Strategies, Cloud Native Security or Memory and Storage Optimizations.Tune in and hear from Otmar and Martin how they are researching on the N+2 generation of Observability and AI, how they are contributing to open source projects such as OpenTelemetry, and what their predictions are when AI is finally taking control of us humans!To learn more about their work check out these links:Martin's LinkedIn: https://www.linkedin.com/in/mflechl/Otmar's LinkedIn: https://www.linkedin.com/in/otmar-ertl/Dynatrace Research Lab: https://careers.dynatrace.com/locations/linz/#__researchLab

Getup Kubicast
#169 - Conversando sobre conversar - Carreira e Networking

Getup Kubicast

Play Episode Listen Later May 22, 2025 57:05


No episódio 169 do Kubicast, batemos um papo com Rafael Ferreira sobre um tema fundamental, mas muitas vezes negligenciado: a arte de conversar. Sim, a gente conversou sobre conversar! De forma descontraída e bem-humorada, destrinchamos como a comunicação impacta nossas carreiras, nosso networking e até o modo como nos vestimos em eventos tech.Falamos sobre gifs em palestras, sobre a "cara de pau" que ajuda a romper bolhas, e sobre como não adianta ser o melhor se ninguém souber disso. O Rafael compartilhou aprendizados de eventos, bastidores do Low Ops e sua jornada até virar MVP da Microsoft. Spoiler: ele usou o podcast como estratégia de networking. E funcionou.Participe do nosso programa de acesso antecipado de Imagens Zero CVE: getup.io/zerocveO Kubicast é uma produção da Getup, empresa especialista em Kubernetes e projetos open source para Kubernetes. Os episódios do podcast estão nas principais plataformas de áudio digital e no YouTube.com/@getupcloud.

The Data Stack Show
244: Postgres to ClickHouse: Simplifying the Modern Data Stack with Aaron Katz & Sai Krishna Srirampur

The Data Stack Show

Play Episode Listen Later May 20, 2025 34:51


Highlights from this week's conversation include:Background of ClickHouse (1:14)PostgreSQL Data Replication Tool (3:19)Emerging Technologies Observations (7:25)Observability and Market Dynamics (11:26)Product Development Challenges (12:39)Challenges with PostgreSQL Performance (15:30)Philosophy of Open Source (18:01)Open Source Advantages (22:56)Simplified Stack Vision (24:48)End-to-End Use Cases (28:13)Migration Strategies (30:21)Final Thoughts and Takeaways (33:29)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we'll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Federal Tech Podcast: Listen and learn how successful companies get federal contracts
Ep. 239 Boosting Federal Cybersecurity with Agentless Observability

Federal Tech Podcast: Listen and learn how successful companies get federal contracts

Play Episode Listen Later May 20, 2025 24:38


Connect to John Gilroy on LinkedIn   https://www.linkedin.com/in/john-gilroy/ Want to listen to other episodes? www.Federaltechpodcast.com AFCEA'S TechNet Cyber conference held in Baltimore, Maryland was the perfect opportunity to sit down with Bryan Rosensteel, Head of Public Sector Marketing at Wiz.  Wiz is the “new kid on the block,” and it has had tremendous growth.  During the interview, Bryan Rosensteel shows how agentless approaches can improve visibility and assist with compliance.  We all know how complexity has infiltrated federal technology.  We have the usual suspect of Cloud Service Providers, hybrid clouds, private clouds, and, if that was not complicated enough, alt-clouds.  As a result, it is almost impossible to get a “bird's eye” visibility to provide cyber security. Two main ways have been proposed to secure this much-desired system's view. Agent.  One approach is to put a bit of code on each device, called an “agent” method.  It is good for granular control, but can slow down a scan and must be maintained Agentless.  Bryan Rosensteel from Wiz describes something called a “agentless” method to gain visibility into complex systems.  This method leverages infrastructure and protocols to accomplish the scanning objective much faster. Bryan Rosensteel states that in a world of constant attacks, this faster method allows for rapid updates to threats. Beyond better observation, an agentless method, like the one provided by Wiz, allows for compliance automation, continuous monitoring, and sets the groundwork for effective Zero Trust implementation.

Startup Project
How Chronosphere Solved Observability in Containerized Environments to Build $1.6B Company | Uber spin-out, 5x Cheap & Impact of AI in Observability | CEO Martin Mao | Startup Project #101

Startup Project

Play Episode Listen Later May 18, 2025 50:47


Martin Mao is the co-founder and CEO of Chronosphere, an observability platform built for the modern containerized world. Prior to Chronosphere, Martin led the observability team at Uber, tackling the unique challenges of large-scale distributed systems. With a background as a technical lead at AWS, Martin brings unique experience in building scalable and reliable infrastructure. In this episode, he shares the story behind Chronosphere, its approach to cost-efficient observability, and the future of monitoring in the age of AI.What you'll learn:The specific observability challenges that arise when transitioning to containerized environments and microservices architectures, including increased data volume and new problem sources.How Chronosphere addresses the issue of wasteful data storage by providing features that identify and optimize useful data, ensuring customers only pay for valuable insights.Chronosphere's strategy for competing with observability solutions offered by major cloud providers like AWS, Azure, and Google Cloud, focusing on specialized end-to-end product.The innovative ways in which Chronosphere's products, including their observability platform and telemetry pipeline, improve the process of detecting and resolving problems.How Chronosphere is leveraging AI and knowledge graphs to normalize unstructured data, enhance its analytics engine, and provide more effective insights to customers.Why targeting early adopters and tech-forward companies is beneficial for product innovation, providing valuable feedback for further improvements and new features. How observability requirements are changing with the rise of AI and LLM-based applications, and the unique data collection and evaluation criteria needed for GPUs.Takeaways:Chronosphere originated from the observability challenges faced at Uber, where existing solutions couldn't handle the scale and complexity of a containerized environment.Cost efficiency is a major differentiator for Chronosphere, offering significantly better cost-benefit ratios compared to other solutions, making it attractive for companies operating at scale.The company's telemetry pipeline product can be used with existing observability solutions like Splunk and Elastic to reduce costs without requiring a full platform migration.Chronosphere's architecture is purposely single-tenanted to minimize coupled infrastructures, ensuring reliability and continuous monitoring even when core components go down.AI-driven insights for observability may not benefit from LLMs that are trained on private business data, which can be diverse and may cause models to overfit to a specific case.Many tech-forward companies are using the platform to monitor model training which involves GPU clusters and a new evaluation criterion that is unlike general CPU workload.The company found a huge potential by scrubbing the diverse data and building knowledge graphs to be used as a source of useful information when problems are recognized.Subscribe to Startup Project for more engaging conversations with leading entrepreneurs!→ Email updates: ⁠https://startupproject.substack.com/⁠#StartupProject #Chronosphere #Observability #Containers #Microservices #Uber #AWS #Monitoring #CloudNative #CostOptimization #AI #ArtificialIntelligence #LLM #MLOps #Entrepreneurship #Podcast #YouTube #Tech #Innovation

O11ycast
Ep. #81, Observability 3.0 and Beyond with Hazel Weakly and Matt Klein

O11ycast

Play Episode Listen Later May 14, 2025 40:36


In episode 81 of o11ycast, Charity Majors and Martin Thwaites dive into a lively discussion with Hazel Weakly and Matt Klein on the evolving landscape of observability. The guests explore the concept of observability versioning, the challenges of cost and ROI, and the future of observability tools, including the potential convergence with AI and business intelligence.

TestGuild Performance Testing and Site Reliability Podcast
Observability at Scale with AI with Jacob Leverich

TestGuild Performance Testing and Site Reliability Podcast

Play Episode Listen Later May 14, 2025 36:47


In this episode of the DevOps Toolchain podcast, Joe Colantonio sits down with Jacob Leverich, cofounder and Chief Product Officer at Observe, to explore how AI and cutting-edge data strategies are transforming the world of observability. With a career spanning heavyweight roles from Splunk to Google and Kuro Labs, Jacob shares his journey from banging out Perl scripts as a Linux sysadmin to building scalable, data-driven solutions that address the complex realities of today's digital infrastructure. Tune in as Joe and Jacob explore why traditional monitoring approaches are struggling with massive data volumes, how knowledge graphs and data lakes are breaking down tool silos, and what engineering leaders often get wrong when scaling visibility across teams. Whether you're a tester, developer, SRE, or team lead, get ready to discover actionable insights on maximizing the value of your data, the true role of AI in troubleshooting, and practical tips for leading your organization into the future of DevOps observability. Don't miss it! Try out Insight Hub free for 14 days now: https://testguild.me/insighthub. No credit card required.

Heavybit Podcast Network: Master Feed
Ep. #81, Observability 3.0 and Beyond with Hazel Weakly and Matt Klein

Heavybit Podcast Network: Master Feed

Play Episode Listen Later May 14, 2025 40:36


In episode 81 of o11ycast, Charity Majors and Martin Thwaites dive into a lively discussion with Hazel Weakly and Matt Klein on the evolving landscape of observability. The guests explore the concept of observability versioning, the challenges of cost and ROI, and the future of observability tools, including the potential convergence with AI and business intelligence.

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 524: Agentic AI Done Right - How to avoid missing out or messing up.

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later May 13, 2025 18:33


Agentic AI is equally as daunting as it is dynamic. So…… how do you not screw it up? After all, the more robust and complex agentic AI becomes, the more room there is for error. Luckily, we've got Dr. Maryam Ashoori to guide our agentic ways. Maryam is the Senior Director of Product Management of watsonx at IBM. She joined us at IBM Think 2025 to break down agentic AI done right. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Have a question? Join the convo here.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Agentic AI Benefits for EnterprisesWatson X's New Features & AnnouncementsAI-Powered Enterprise Solutions at IBMResponsible Implementation of Agentic AILLMs in Enterprise Cost OptimizationDeployment and Scalability EnhancementsAI's Impact on Developer ProductivityProblem-Solving with Agentic AITimestamps:00:00 AI Agents: A Business Imperative06:14 "Optimizing Enterprise Agent Strategy"09:15 Enterprise Leaders' AI Mindset Shift09:58 Focus on Problem-Solving with Technology13:34 "Boost Business with LLMs"16:48 "Understanding and Managing AI Risks"Keywords:Agentic AI, AI agents, Agent lifecycle, LLMs taking actions, WatsonX.ai, Product management, IBM Think conference, Business leaders, Enterprise productivity, WatsonX platform, Custom AI solutions, Environmental Intelligence Suite, Granite Code models, AI-powered code assistant, Customer challenges, Responsible AI implementation, Transparency and traceability, Observability, Optimization, Larger compute, Cost performance optimization, Chain of thought reasoning, Inference time scaling, Deployment service, Scalability of enterprise, Access control, Security requirements, Non-technical users, AI-assisted coding, Developer time-saving, Function calling, Tool calling, Enterprise data integration, Solving enterprise problems, Responsible implementation, Human in the loop, Automation, IBM savings, Risk assessment, Empowering workforce.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Packet Pushers - Full Podcast Feed
TNO028: Move From Monitoring to Full Internet Stack Observability: New Strategies for NetOps (Sponsored)

Packet Pushers - Full Podcast Feed

Play Episode Listen Later May 9, 2025 52:35


Network monitoring, Internet monitoring, and observability are all key components of NetOps. We speak with sponsor Catchpoint to understand how Catchpoint can help network operators proactively identify and resolve issues before they impact customers. We discuss past and current network monitoring strategies and the challenges that operators face with both on-prem and cloud monitoring, along... Read more »

Packet Pushers - Fat Pipe
TNO028: Move From Monitoring to Full Internet Stack Observability: New Strategies for NetOps (Sponsored)

Packet Pushers - Fat Pipe

Play Episode Listen Later May 9, 2025 52:35


Network monitoring, Internet monitoring, and observability are all key components of NetOps. We speak with sponsor Catchpoint to understand how Catchpoint can help network operators proactively identify and resolve issues before they impact customers. We discuss past and current network monitoring strategies and the challenges that operators face with both on-prem and cloud monitoring, along... Read more »

AWS re:Think Podcast
Episode 40: AI Observabilty and Evaluation with Arize AI

AWS re:Think Podcast

Play Episode Listen Later May 7, 2025 39:04


AI can still sometimes hallucinate and give less than optimal answers. To address this, we are joined by Arize AI's Co-Founder a Aparna Dhinakaran for a discussion on Observability and Evaluation for AI. We begin by discussing the challenges AI Observability and Evaluation. For example, how does “LLM as a Judge” work? We conclude with some valuable advice from Aparna for first time entrepreneurs.Begin Observing and Evaluating your AI Applications with Open Source Phoenix:https://phoenix.arize.com/AWS Hosts: Nolan Chen & Malini ChatterjeeEmail Your Feedback: rethinkpodcast@amazon.com

The Cloud Gambit
Seeing Through the Clouds: Observability with Justin Ryburn

The Cloud Gambit

Play Episode Listen Later May 6, 2025 48:30 Transcription Available


Send us a textJustin Ryburn is the Field CTO at Kentik and works as a Limited Partner (LP) for Stage 2 Capital. Justin has 25 years of experience in network operations, engineering, sales, and marketing with service providers and vendors. In this conversation, we discuss startup funding,  the challenges that organizations face with hybrid and multi-cloud visibility, the impact of AI on network monitoring, and explore how companies can build more reliable systems through proper observability practices.Where to Find JustinLinkedIn: https://www.linkedin.com/in/justinryburn/Twitter: https://x.com/JustinRyburnBlog: http://ryburn.org/Talks: https://www.youtube.com/playlist?list=PLRrjaaisdWrYaue9KVLRdq5mlGE_2i0RTShow LinksKentik: https://www.kentik.com/Day One: Deploying BGP FlowSpec: https://www.juniper.net/documentation/en_US/day-one-books/DO_BGP_FLowspec.pdfStage 2 Capital: https://www.stage2.capital/Doug Madory's Internet Analysis: https://www.kentik.com/blog/author/doug-madory/Netflix Tech Blog: https://netflixtechblog.com/Multi-Region AWS: https://www.pluralsight.com/resources/blog/cloud/why-and-how-do-we-build-a-multi-region-active-active-architectureAutoCon: https://events.networktocode.com/autocon/Follow, Like, and Subscribe!Podcast: https://www.thecloudgambit.com/YouTube: https://www.youtube.com/@TheCloudGambitLinkedIn: https://www.linkedin.com/company/thecloudgambitTwitter: https://twitter.com/TheCloudGambitTikTok: https://www.tiktok.com/@thecloudgambit

The New Stack Podcast
Prequel: Software Errors Be Gone

The New Stack Podcast

Play Episode Listen Later May 5, 2025 5:13


Prequel is launching a new developer-focused service aimed at democratizing software error detection—an area typically dominated by large cloud providers. Co-founded by Lyndon Brown and Tony Meehan, both former NSA engineers, Prequel introduces a community-driven observability approach centered on Common Reliability Enumerations (CREs). CREs categorize recurring production issues, helping engineers detect, understand, and communicate problems without reinventing solutions or working in isolation. Their open-source tools, cre and prereq, allow teams to build and share detectors that catch bugs and anti-patterns in real time—without exposing sensitive data, thanks to edge processing using WebAssembly.The urgency behind Prequel's mission stems from the rapid pace of AI-driven development, increased third-party code usage, and rising infrastructure costs. Traditional observability tools may surface symptoms, but Prequel aims to provide precise problem definitions and actionable insights. While observability giants like Datadog and Splunk dominate the market, Brown and Meehan argue that engineers still feel overwhelmed by data and underpowered in diagnostics—something they believe CREs can finally change.Learn more from The New Stack about the latest Observability insights Why Consolidating Observability Tools Is a Smart MoveBuilding an Observability Culture: Getting Everyone Onboard Join our community of newsletter subscribers to stay on top of the news and at the top of your game. 

UX Research Geeks
Su Milazzo | Building effective operations in design and research | #56

UX Research Geeks

Play Episode Listen Later May 5, 2025 35:13


Su Milazzo talks about the role of operations in UX, especially where design and research operations overlap. She explains that operations work to reduce friction and improve workflows for internal teams, using empathy and change management to help them succeed.

EM360 Podcast
How Do AI and Observability Redefine Application Performance?

EM360 Podcast

Play Episode Listen Later May 2, 2025 29:08


"Having the insight and being able to stitch together your technical resources and business decisions together, is the prime place where observability can add value to you,” stated Manesh Tailor, EMEA Field CTO at New Relic.In this episode of the Tech Transformed podcast, Kevin Petrie, Vice President of Research at BARC, speaks with Manesh Tailor about the intersection of artificial intelligence (AI) and observability, and how this is positively changing business operations.Tailor emphasises how intelligent observability has changed beyond simple monitoring to provide real-time insights into customer experience and the entire technology stack. This enables informed decisions across engineering, operations, and business domains, directly linking technical performance to strategic business outcomes.He also discusses the different stages observability has been through and where it's leading to now. The current wave, Observability 3.0, takes advantage of AI to predict issues and even enable self-healing systems. New Relic has embraced this two-way street, using AI within its platform. This was in an ambition to help users and "AI monitoring" to track the performance of language models alongside traditional metrics. Such a platform provides a holistic view of system health and the cost implications of AI deployments.Alluding to the management of AI-powered applications, Tailor says collaboration is key between application and data science teams. Not only does it provide real time data but as a result leads to efficient decision making.Futuristically, the speedy proliferation of AI agents has both pros and cons for observability. This is where New Relic comes in. It addresses the challenges by constructing a platform-centric "AI orchestrator" with a growing library of AI-native agents. In essence, as AI-powered applications become increasingly integral to business operations, intelligent observability is no longer optional. TakeawaysObservability is crucial for understanding unknowns in systems.AI enhances observability by providing predictive insights.The evolution of observability includes intelligent monitoring.Collaboration between technical and business teams is essential.Cost efficiency is a key focus in modern observability.Real-time data is vital for effective decision-making.Self-healing systems represent the future of observability.AI and observability must work in tandem for success.The complexity of systems is increasing, requiring better tools.Observability is applicable across all organizational levels.Chapters00:00 Introduction to AI and Observability03:10 Defining Observability and Its Evolution05:49 The Role of AI in Observability08:46 Navigating AI-Driven Applications11:52 Target Users and Community for Observability14:57 Collaboration Across Teams17:55 Challenges and Opportunities in Observability20:47 The Future of Observability and AI23:54 Key Takeaways for CIOs and IT LeadersAbout New RelicThe New Relic Intelligent Observability Platform empowers businesses to proactively eliminate disruptions in their digital experiences. As the only AI-enhanced platform that unifies and correlates telemetry data, New...

Catalog & Cocktails
TAKEAWAYS - What is Data + AI Observability and Why It's Part of Your Competitive Moat with Barr Moses

Catalog & Cocktails

Play Episode Listen Later May 1, 2025 4:10


Barr Moses, CEO & Co-Founder of Monte Carlo, challenges the notion that models alone create competitive advantage, arguing instead that the real moat lies in how organizations manage their proprietary data and ensure end-to-end reliability. Tim and Juan chat with Barr to get the Honest, No-BS scoop of what AI observability is (hint, it's really data + AI) and how organizations can build resilient AI applications.

Catalog & Cocktails
What is Data + AI Observability and Why It's Part of Your Competitive Moat with Barr Moses

Catalog & Cocktails

Play Episode Listen Later May 1, 2025 53:09


Barr Moses, CEO & Co-Founder of Monte Carlo, challenges the notion that models alone create competitive advantage, arguing instead that the real moat lies in how organizations manage their proprietary data and ensure end-to-end reliability. Tim and Juan chat with Barr to get the Honest, No-BS scoop of what AI observability is (hint, it's really data + AI) and how organizations can build resilient AI applications.

AWS for Software Companies Podcast
Ep097: Specialized Agents & Agentic Orchestration - New Relic and the Future of Observability

AWS for Software Companies Podcast

Play Episode Listen Later Apr 28, 2025 29:04


New Relic's Head of AI and ML Innovation, Camden Swita discusses their four-cornered AI strategy and envisions a future of "agentic orchestration" with specialized agents.Topics Include:Introduction of Camden Swita, Head of AI at New Relic.New Relic invented the observability space for monitoring applications.Started with Java workloads monitoring and APM.Evolved into full-stack observability with infrastructure and browser monitoring.Uses advanced query language (NRQL) with time series database.AI strategy focuses on AI ops for automation.First cornerstone: Intelligent detection capabilities with machine learning.Second cornerstone: Incident response with generative AI assistance.Third cornerstone: Problem management with root cause analysis.Fourth cornerstone: Knowledge management to improve future detection.Initially overwhelmed by "ocean of possibilities" with LLMs.Needed narrow scope and guardrails for measurable progress.Natural language to NRQL translation proved immensely complex.Selecting from thousands of possible events caused accuracy issues.Shifted from "one tool" approach to many specialized tools.Created routing layer to select right tool for each job.Evaluation of NRQL is challenging even when syntactically correct.Implemented multi-stage validation with user confirmation step.AWS partnership involves fine-tuning models for NRQL translation.Using Bedrock to select appropriate models for different tasks.Initially advised prototyping on biggest, best available models.Now recommends considering specialized, targeted models from start.Agent development platforms have improved significantly since beginning.Future focus: "Agentic orchestration" with specialized agents.Envisions agents communicating through APIs without human prompts.Integration with AWS tools like Amazon Q.Industry possibly plateauing in large language model improvements.Increasing focus on inference-time compute in newer models.Context and quality prompts remain crucial despite model advances.Potential pros and cons to inference-time compute approach.Participants:Camden Swita – Head of AI & ML Innovation, Product Management, New RelicSee how Amazon Web Services gives you the freedom to migrate, innovate, and scale your software company at https://aws.amazon/isv/

OpenObservability Talks
CNCF Ambassadors Share the Best of KubeCon EU 2025 - OpenObservability Talks S5E11

OpenObservability Talks

Play Episode Listen Later Apr 28, 2025 62:54


KubeCon Europe 2025 in London has wrapped up, and we're bringing you all the highlights, trends, and behind-the-scenes insights straight from the show floor!In this special recap episode, I'm joined by two CNCF Ambassadors and community powerhouses: Kasper Borg Nissen, the Co-Chair of this KubeCon as well as of the KubeCon 2024 editions, and a Developer Relations Engineer at Dash0; and William Rizzo, Consulting Architect at Mirantis and Linkerd Ambassador.Together, we unpack the major themes from the event—from platform engineering and internal developer platforms, to open source observability, and where Kubernetes is headed next. We also chat about the vibe of the community, emerging projects to watch, and important trends in European tech sphere.Whether you missed the conference or want to catch up on important updates you might have missed, this episode gives you a curated take straight from the experts who know the cloud-native space inside out.The episode was live-streamed on 22 April 2025 and the video is available at https://www.youtube.com/watch?v=JyxJOmOEBvQYou can read the recap post: https://medium.com/p/740258a5fa46OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and chime in with your comments and questions on the live chat.⁠⁠https://www.youtube.com/@openobservabilitytalks⁠  https://www.twitch.tv/openobservability⁠Show Notes:00:00 - intro03:28 - KubeCon impressions09:59 - Backstage turns 518:56 - CNCF turns 10 and CNCF annual survey27:22 - Sovereign cloud in Europe and the NeoNephos initiative33:55 - CI/CD use in production increases36:52 - OpenInfra joins the Linux Foundation40:16 - Cloud native local communities, DEI and the BIPOC initiative 51:11 - Observability query standardization SIG updates59:36 - outroResources:CNCF 2024 Annual Survey https://www.cncf.io/reports/cncf-annual-survey-2024/NeoNephos initiative for sovereign EU cloud: https://www.linkedin.com/feed/update/urn:li:share:7313115943075766273/ OpenInfra Foundation and OpenStack join The Linux Foundation: https://www.linkedin.com/feed/update/urn:li:share:7307839934072066048/ Backstage turns 5: https://www.linkedin.com/feed/update/urn:li:activity:7318163557206966272/ Kubernetes 1.33 release: https://www.linkedin.com/feed/update/urn:li:activity:7321054742174924800/Socials:Twitter:⁠ https://twitter.com/OpenObserv⁠YouTube: ⁠https://www.youtube.com/@openobservabilitytalks⁠Dotan Horovits============Twitter: @horovitsLinkedIn: www.linkedin.com/in/horovitsMastodon: @horovits@fosstodonBlueSky: @horovits.bsky.socialKasper Borg Nissen===============Twitter: https://www.twitter.com/phennexLinkedIn: https://www.linkedin.com/in/kaspernissen/BlueSky: https://bsky.app/profile/kaspernissen.xyz⁠William Rizzo===========Twitter: https://twitter.com/WilliamRizzo19LinkedIn: https://www.linkedin.com/in/william-rizzo/BlueSky: https://bsky.app/profile/williamrizzo.bsky.social

TechCrunch Startups – Spoken Edition
Datadog acquires AI-powered observability startup Metaplane

TechCrunch Startups – Spoken Edition

Play Episode Listen Later Apr 28, 2025 4:05


Cloud monitoring and security platform Datadog on Wednesday announced that it has acquired Metaplane, an AI-powered data observability startup, for an undisclosed amount. In a press release, Datadog said that the deal “accelerates” its expansion into data observability, building on the launch of related products. Learn more about your ad choices. Visit podcastchoices.com/adchoices

Packet Pushers - Full Podcast Feed
Tech Bytes: Network Observability AIOps Tips For Success (Sponsored)

Packet Pushers - Full Podcast Feed

Play Episode Listen Later Apr 21, 2025 23:39


Today on the Tech Bytes podcast we're talking AI readiness with sponsor Broadcom. More specifically, getting your network observability ready to support AI operations. This isn't just a hardware or software issue. It's also a data issue. We'll get some tips with our guest Jeremy Rossbach. Jeremy is Chief Technical Evangelist and Lead Product Marketing... Read more »

Content Strategy Insights
Jeff Eaton: Content Observability in Complex Systems

Content Strategy Insights

Play Episode Listen Later Apr 21, 2025 31:37


Modern content systems are complex and abstract, presenting problems for managers who want to understand how their content is performing. At Autogram, Jeff Eaton and Karen McGrane have developed a content observability framework to address this complexity.  Their framework evaluates the composition, quality, health, and effectiveness of content programs to help enterprises measure the return on their content investment. https://ellessmedia.com/csi/jeff-eaton-2/

Packet Pushers - Briefings In Brief
Tech Bytes: Network Observability AIOps Tips For Success (Sponsored)

Packet Pushers - Briefings In Brief

Play Episode Listen Later Apr 21, 2025 23:39


Today on the Tech Bytes podcast we're talking AI readiness with sponsor Broadcom. More specifically, getting your network observability ready to support AI operations. This isn't just a hardware or software issue. It's also a data issue. We'll get some tips with our guest Jeremy Rossbach. Jeremy is Chief Technical Evangelist and Lead Product Marketing... Read more »

Software Engineering Daily
Prometheus and Open-Source Observability with Eric Schabell

Software Engineering Daily

Play Episode Listen Later Apr 15, 2025 46:06


Modern cloud-native systems are highly dynamic and distributed, which makes it difficult to monitor cloud infrastructure using traditional tools designed for static environments. This has motivated the development and widespread adoption of dedicated observability platforms. Prometheus is an open-source observability tool designed for cloud-native environments. Its strong integration with Kubernetes and pull-based data collection model The post Prometheus and Open-Source Observability with Eric Schabell appeared first on Software Engineering Daily.

AWS for Software Companies Podcast
Ep092: The Evolution of Monitoring: How New Relic is Transforming Cloud Operations

AWS for Software Companies Podcast

Play Episode Listen Later Apr 9, 2025 16:02


New Relic's Chief Customer Officer Arnaldo (Arnie) Lopez details how their observability platform helps 70,000+ customers monitor cloud performance through AWS infrastructure while introducing AI capabilities that simplify operations.Topics Include:Arnie Lopez is SVP, Chief Customer Officer at New Relic.Oversees pre-sales, post-sales, technical support, and enablement teams.New Relic University offers customer certifications.Founded in 2008, pioneered application performance monitoring (APM).Now offers "Observability 3.0" for full-stack visibility.Prevents interruptions during cloud migration and operations.Serves 70,000+ customers across various industries.16,000 enterprise-level paying customers.Platform consolidates multiple monitoring tools into one solution.Helps detect issues before customers experience performance problems.Market challenge: customers using disparate observability solutions.Reduces TCO by eliminating multiple monitoring tools.Targets VPs, CTOs, CIOs, and sometimes CEOs.Decade-long partnership with AWS.Platform built on largest unified telemetry data cloud.Uses AWS Graviton instances and Amazon EKS.AWS partnership enables innovation and customer trust.Three AI approaches: user assistance, LLM monitoring, faster insights.New Relic AI helps write query language (NURCLs).Monitors LLMs in customer environments.Uses AI to accelerate incident resolution.Lesson learned: should have started AI implementation sooner.Many customers still cautiously adopting AI technologies.Goal: continue growth with AWS partnership.Offers compute-based pricing model.Customers only pay for what they use.Announced one-step AWS monitoring for enterprise scale.Amazon Q Business and New Relic AI integration.Agent-to-agent AI eliminates data silos.Embeds performance insights into business application workflows.Participants:Arnie Lopez – SVP Chief Customer Officer, New RelicSee how Amazon Web Services gives you the freedom to migrate, innovate, and scale your software company at https://aws.amazon/isv/

Software Engineering Radio - The Podcast for Professional Software Developers
SE Radio 663: Tyler Flint on Managing External APIs

Software Engineering Radio - The Podcast for Professional Software Developers

Play Episode Listen Later Apr 8, 2025 52:27


Tyler Flint, CEO of qpoint.io, joins host Robert Blumen for a conversation about managing external vendor dependencies, including several best practices for adoption. They start with a look at internal versus external services, including details such as the footprint of external services within a micro-services application, and difficulties organizations have tracking their service consumption, quantifying service consumption, and auditing external services. Tyler also discusses the security implications of external services, including authentication and authorization. They examine metrics and monitoring, with recommendations on the key metrics to collect, as well as acceptable error rates for external services. From there they consider what can go wrong, how to respond to external service outages, and challenges related to testing external services. The episode wraps up with a discussion of qPoint's migration from a proxy-based solution to one based on eBPF kernel probes. Brought to you by IEEE Computer Society and IEEE Software magazine.

SolarWinds TechPod
Monitoring, Observability, and Operational Resilience

SolarWinds TechPod

Play Episode Listen Later Apr 8, 2025 41:22


In this episode of SolarWinds TechPod, hosts Chrystal Taylor and Sean Sebring explore the key differences between monitoring and observability with guest Jeff Stewart, GVP of Product Management at SolarWinds. Observability goes beyond traditional monitoring, offering AI-driven insights and a holistic view of system health. Like understanding the anatomy of the body, observability reveals how IT systems are interconnected—where one issue can ripple across the entire environment. They discuss how businesses can leverage observability to reduce downtime, improve efficiency, and stay ahead in a rapidly evolving tech landscape. © 2025 SolarWinds Worldwide, LLC. All rights reserved

Software Defined Talk
Episode 513: Put On A Musical

Software Defined Talk

Play Episode Listen Later Apr 4, 2025 47:50


This week, we discuss the shifting world of observability, the nightmare of “Configuration Hell,” and OpenAI's latest valuation. Plus, a surprise Broadway musical review! Runner-up Titles We say we're friends, but I don't really know them Observability 2025 I don't have any sympathy for anyone If you want to win observability, put on a musical Just is THE trigger word It's a well-known Hell The blog posts are making me angry Rundown CISO MUSICAL | Official Broadway Trailer (https://www.youtube.com/watch?v=4W17F9Ho_38) Monitoring is back Observability 3.0 - bitdrift Blog (https://blog.bitdrift.io/post/observability-3-0) Another observability 3.0 appears on the horizon (https://charity.wtf/2025/03/24/another-observability-3-0-appears-on-the-horizon/) ControlTheory Secures $5M Seed Funding to Bring Controllability to Observability (https://www.controltheory.com/blog/controltheory-secures-5m-seed-funding-to-bring-controllability-to-observability/) What is (https://www.controltheory.com/blog/what-is-controllability/) Cloud veterans launch ConfigHub to fix 'configuration hell' (https://techcrunch.com/2025/03/26/cloud-veterans-launch-confighub-to-fix-configuration-hell/) DOGE Plans to Rebuild SSA Codebase In Months, Risking Benefits and System Collapse (https://www.wired.com/story/doge-rebuild-social-security-administration-cobol-benefits/) OpenAI Exclusive | The Secrets and Misdirection Behind Sam Altman's Firing From OpenAI (https://www.wsj.com/tech/ai/the-real-story-behind-sam-altman-firing-from-openai-efd51a5d?st=GmdXEX&reflink=desktopwebshare_permalink) OpenAI closes $40 billion funding round, largest private tech deal on record (https://www.cnbc.com/2025/03/31/openai-closes-40-billion-in-funding-the-largest-private-fundraise-in-history-softbank-chatgpt.html) Relevant to your Interests How vibe coding will affect Engineering Managers (https://newsletter.manager.dev/p/effect-of-ai-on-engineering-managers) Mastering GitHub Copilot: When to use AI agent mode (https://github.blog/ai-and-ml/github-copilot/mastering-github-copilot-when-to-use-ai-agent-mode/) Using Spring AI 1.0.0-SNAPSHOT: Important Changes and Updates (https://spring.io/blog/2025/03/27/spring-ai-update-to-snapshots) Former Intel CEO Pat Gelsinger Makes a Few More Long-Shot Bets (https://www.wsj.com/articles/former-intel-ceo-pat-gelsinger-makes-a-few-more-long-shot-bets-01e7337f) Pat Gelsinger has joined VC firm Playground Global (https://www.axios.com/newsletters/axios-pro-rata-ad45da7c-2daa-4290-b379-bba556718155.html?chunk=2&utm_term=emshare#story2) Amazon Is Canceling a Major Alexa Privacy Feature on March 28: Should You Worry? (https://www.cnet.com/home/security/amazon-is-canceling-this-alexa-privacy-feature-on-march-28-should-you-worry/) oneAPI: A New Era of Heterogeneous Computing (https://www.intel.com/content/www/us/en/developer/tools/oneapi/overview.html#gs.kqodnv) Amazon unveils Nova Act, an AI agent that can control a web browser (https://techcrunch.com/2025/03/31/amazon-unveils-nova-act-an-ai-agent-that-uses-a-web-browser/) Ransomware Found in VSCode Extensions Raises Concerns Over Microsoft's Security Review (https://www.cysecurity.news/2025/03/ransomware-found-in-vscode-extensions.html?m=1) Lip-Bu Tan says Intel will spin off non-core units (https://techcrunch.com/2025/04/01/lip-bu-tan-says-intel-will-spin-off-non-core-units/) Announcing Chainguard VMs: Minimal, Zero-CVE Container Host Images (https://www.chainguard.dev/unchained/announcing-chainguard-vms-minimal-zero-cve-container-host-images) Andreessen Horowitz in talks to help buy out TikTok's Chinese owners (https://on.ft.com/4iXhAkG) Nonsense This couple is obsessed with Costco. Why do they love it so much? (https://www.deseret.com/2024/1/10/24031947/joy-of-costco-susan-and-david-schwartz-king-husein-utah/) CISO MUSICAL | Official Broadway Trailer (https://www.youtube.com/watch?v=4W17F9Ho_38) Conferences DevOps Days Atlanta (https://devopsdays.org/events/2025-atlanta/welcome/), April 29-30 Cloud Foundry Day US (https://events.linuxfoundation.org/cloud-foundry-day-north-america/), May 14th, Palo Alto, CA NDC Oslo (https://ndcoslo.com/), May 21-23, Coté speaking. SDT News & Community Join our Slack community (https://softwaredefinedtalk.slack.com/join/shared_invite/zt-1hn55iv5d-UTfN7mVX1D9D5ExRt3ZJYQ#/shared-invite/email) Email the show: questions@softwaredefinedtalk.com (mailto:questions@softwaredefinedtalk.com) Free stickers: Email your address to stickers@softwaredefinedtalk.com (mailto:stickers@softwaredefinedtalk.com) Follow us on social media: Twitter (https://twitter.com/softwaredeftalk), Threads (https://www.threads.net/@softwaredefinedtalk), Mastodon (https://hachyderm.io/@softwaredefinedtalk), LinkedIn (https://www.linkedin.com/company/software-defined-talk/), BlueSky (https://bsky.app/profile/softwaredefinedtalk.com) Watch us on: Twitch (https://www.twitch.tv/sdtpodcast), YouTube (https://www.youtube.com/channel/UCi3OJPV6h9tp-hbsGBLGsDQ/featured), Instagram (https://www.instagram.com/softwaredefinedtalk/), TikTok (https://www.tiktok.com/@softwaredefinedtalk) Book offer: Use code SDT for $20 off "Digital WTF" by Coté (https://leanpub.com/digitalwtf/c/sdt) Sponsor the show (https://www.softwaredefinedtalk.com/ads): ads@softwaredefinedtalk.com (mailto:ads@softwaredefinedtalk.com) Recommendations Brandon: OrbStack · Fast, light, simple Docker & Linux (https://orbstack.dev/) Photo Credits Header (https://unsplash.com/photos/red-theater-curtain-WW1jsInXgwM)