PurePerformance

Share on

The brutal truth about digital performance engineering and operations. Andreas (aka Andi) Grabner and Brian Wilson are veterans of the digital performance world. Combined they have seen too many applications not scaling and performing up to expectations. With more rapid deployment models made possi…

PurePerformance

Jul 20, 2026 LATEST EPISODE
every other week NEW EPISODES
42m AVG DURATION
535 EPISODES

Search for episodes from PurePerformance with a specific topic:

Latest episodes from PurePerformance

Blueprints for OTel Success: Standardizing Observability at Scale with Dan Gomez Blanco

Play Episode Listen Later Jul 20, 2026 52:17

"There is no single way to deploy OpenTelemetry at scale—and that's exactly the challenge."As organizations adopt OTel across teams and environments, they face tough questions around standardization, configuration, and operating resilient observability pipelines.To address these challenges, the OpenTelemetry community has introduced Blueprints and Reference Implementations—practical guidance on topics like data standards, consistent agent and collector configuration, pipeline resilience, and intelligent sampling.In this episode, we're joined by Dan Gomez Blanco, maintainer of the OpenTelemetry End-User SIG, to explore real-world reference architectures from organizations like Skyscanner, Adobe, and Mastodon.Tune in to learn how the community is turning OTel complexity into shared best practices—and how you can contribute your own blueprint

success scale blueprint adobe gomez blanco mastodon blueprints observability standardizing skyscanner cncf dynatrace otel

OpenTelemetry and the Reality of Vendor Choice with Adriana Villela and Josh Lee

Play Episode Listen Later Jul 6, 2026 51:56

I still hear people say, “OpenTelemetry is vendor-neutral, so you can switch any time!”In this episode, Adriana Villela and Josh Lee (both active OpenTelemetry contributors) help bust that myth.While OTel standardizes instrumentation and signal transport—and unlocks a rich ecosystem of tools—switching vendors isn't as simple as it sounds. There's real cost in retraining engineers, migrating dashboards, SLOs, and alerts, and reworking deep integrations across your delivery pipeline.We also dive into a key challenge the community is tackling: helping engineers instrument by value, not by default—making it easier to capture the right signals with high quality instead of just collecting everything.Here the links we discussed:Adriana's LinkedIn: https://www.linkedin.com/in/adrianavillela/Josh's LinkedIn: https://www.linkedin.com/in/joshuamlee/The blog article: https://thenewstack.io/opentelemetry-vendor-neutrality-guide/CND Austria Talk: https://www.youtube.com/watch?v=1gxLseuaTdMKCD Prague Talk: https://www.youtube.com/watch?v=pPXG20CXKxQOpenTelemetry Project Website: https://opentelemetry.io/

reality vendor observability cncf dynatrace villela slos otel josh lee

AI Is a Gift: Rethinking Software Engineering Education and Hiring

Play Episode Listen Later Jun 22, 2026 56:44

In this episode, we explore how AI is transforming education, from classrooms to corporate training. What changes are needed in schools and universities? How does AI affect both students and educators? And how should companies rethink internal training and hiring to stay competitive?To answer these questions, we're joined by Rainer Stropek, CEO of Software Architects and Chairman of Coding Club Linz. With decades of experience teaching at high schools and universities—and helping organizations upskill their engineers—Rainer brings a unique perspective on how software engineering education is evolving.While many view AI as a threat, Rainer sees it as a “Christmas gift”—opening up endless opportunities to learn, adapt, and innovate.Tune in to hear why curiosity is more important than ever, how educational institutions can prepare future engineers, and why organizations must step up to ensure everyone has a fair chance to succeed in the age of AI.Links we discussedRainer's LinkedIn Profile: https://www.linkedin.com/in/rainerstropek/Rainer's Website: https://rainerstropek.me/CodeClub: https://codeclub.org/en/Coder DoJo Linz: https://linz.coderdojo.net/

christmas ceo ai education software hiring engineering rethinking rainer software engineering engineering education

Beyond the Hype: Open Source, Observability, and Finding Your AI Breakthrough

Play Episode Listen Later Jun 8, 2026 34:18

Its rare - but it happens: A guest-free episode of PurePerformance, allowing Andi Grabner and Brian Wilson reconnect to share real-world insights from recent months in the cloud-native and observability space. From KubeCon Amsterdam experiences and the strength of open-source collaboration to emerging challenges like AI-generated contributions, they explore how the industry is evolving beyond the hype.Your co-hosts of PurePerformance discuss the changing role of observability in the AI-native era—both as a foundation for understanding complex systems and as a tool to monitor AI itself. Brian shares his personal shift from AI skepticism to practical adoption, highlighting how AI can significantly improve productivity when used thoughtfully.Hope you all enjoy this episode!

ai work performance cloud breakthrough open source brian wilson cloud native observability dynatrace kubecon beyond the hype

8 Factor Producers to Scale Platform Engineering in an AI-First world with Abby Bangser

Play Episode Listen Later May 25, 2026 34:35

In 2011 Heroku defined the 12 factor app to remove emerging bottlenecks as developers tried to scale their output when they moved from building monoliths to microservices. In Platform Engineer we see a repeating pattern called the "8 Factor Platform Producers". AI allows engineering teams to speed up but they face bottlenecks as platform capabilities are not scaling with that demand as they are often depending on a central platform engineering team to be built and maintained.To learn more about 8 Factor Platform Producers we invited Abby Bangser, Founding Principal Engineer at Syntasso and CNCF Ambassador. She gave an amazing talk at KubeCon in Amsterdam and today walks us through the need of defining both consumers and producers for platforms to eliminate any emerging bottlenecks in Platform Engineering and allow an organization to reap the benefit of speeding up with AILinks we discussed:Abby's LinkedIn: https://www.linkedin.com/in/abbybangser/Abby's Kubecon Keynote: https://www.youtube.com/watch?v=8t0-5cvvMGM&list=PLj6h78yzYM2MXCOWSN9CqqID6OOvF7wxL&index=3012 Factor Apps: https://12factor.net/CNCF Whitepaper: https://cloudnativeplatforms.com/whitepapers/platforms/

ai scale amsterdam platform engineers factor producers monolith first world microservices heroku cncf platform engineering dynatrace kubecon

Observability in the AI‑Native Era with Hilliary Lipsig and Rob Rati

Play Episode Listen Later May 11, 2026 51:30

As the software world is transforming from cloud native to AI-native, observability must transform with it. But how exactly? How do we apply this in an existing enterprise with established processes and practices?In this PurePerformance episode, Andi Grabner hosts Hilliary Lipsig and Rob Rati to discuss their new book, Observability in the AI‑Native Era. The conversation explores how AIOps, automation, and modern observability must evolve as systems become cloud‑native, data‑heavy, and AI‑driven.We talk about why old alerting and SLO models no longer scale, how to balance AI with automation and human judgment, and why trust, security, and compliance matter more than ever when machines start making operational decisions. A must‑listen for SREs, platform engineers, and engineering leaders navigating the AI‑native future.Links we discussedBook on Amazon: https://www.amazon.com/Observability-AI-Native-Era-Artificial-Intelligence-ebook/dp/B0GHZH1YFLHilliary LinkedIn: https://www.linkedin.com/in/hilliary-lipsig-a5935245/Rob LinkedIn: https://www.linkedin.com/in/roberthrati/Andi LinkedIn: https://www.linkedin.com/in/grabnerandi/

amazon ai cloud automation native slo observability aiops dynatrace sres rati

Don't babysit your AI Agents to keep them on track with Lukas Holzer

Play Episode Listen Later Apr 27, 2026 38:33

AI coding agents are fast—but speed alone doesn't guarantee quality. In this episode, Andi Grabner talks with Lukas Holzer (Straion) about why large context files and “almost right” AI code create new risks for engineering teams. You will learn about the "Lost in the Middle Syndrom" and why many organizations are not getting the promised 10x engineering boost right now!Andi and Lukas also explore rule adherence, dynamic context generation, enterprise readiness for AI-first development, and how software engineering roles are evolving in the age of AI.Tune in to learn more ...Links we discussedLinkedIn Profile: https://www.linkedin.com/in/lukas-holzer/Straion Website: https://straion.com/90 Percent Rule Blog: https://straion.com/blog/90-percent-rule-adherence-straion-coding-agents/1million tokens Blog: https://straion.com/blog/1m-tokens-wont-save-your-engineering-standards/

ai lost blog track code engineering engineers coding holzer babysit dynatrace

From Bowling Lanes to AI Lanes: Chris LaBrado on MDCD and the AI Interface Era

Play Episode Listen Later Apr 13, 2026 54:02

In this episode of the PurePerformance Podcast, Andi and Brian sit down with Chris LaBrado—Solutions Architect for AI Enablement, FSO, SRE, and ITSM at HSN/QVC, where he has spent an incredible 27 years shaping operational excellence. Their conversation dives deep into how AI is transforming software creation, enterprise workflows, and even the very role of developers.Chris shares how the barrier to entry for building tools and automation has dropped overnight thanks to natural‑language-based development: “Everyone can now create automation or tools without having to worry about the syntax.” He explains why AI is rapidly becoming the primary interface into the enterprise—capable of navigating presentations, emails, and complex back‑office systems—and why the future of engineering may shift from human‑oriented coding to AI-driven development models such as MDCD (MarkDown Continuous Development).The discussion also takes unexpected but fascinating detours into Chris's background as a former bowling‑industry podcaster, his recent work with generative agents like DynaClaude, his Vibe Coded Root Cause Agent, and a philosophical exploration of AI, creativity, and the concept of singularity.Amidst all the change, Chris remains optimistic: “AI opens up a lot of new opportunity for everyone willing to adapt. It will result in us creating more things that ultimately help us as humans.” This episode is a thoughtful, energizing look at where software engineering is headed—and why the future might be brighter than we think.Links we discussedChris LaBrado on LinkedIn: https://www.linkedin.com/in/chrislabrado/Mo Gawdat, former Google Executive on the Singularity "moment of truth": https://x.com/vitrupo/status/2008824930646057380?s=20CEO of NVIDIA had an interesting excerpt from interview: https://x.com/MinusWells/status/2031974516155695414?s=20Elon Musk on speed of AI: https://x.com/r0ck3t23/status/2031639621465931903?s=20AI brain emulation of a fly (e.g. "a sign of the times"): https://x.com/alexwg/status/2030217301929132323?s=20Elon on fiat currency transforming based on AI manufacturing loop: https://x.com/elonmusk/status/2020202496547844312?s=20Fiat currency moves to model based on thermodynamics: https://x.com/r0ck3t23/status/2033371028202602547?s=20

ai vibe nvidia bowling singularity interface lanes sre mo gawdat dynatrace itsm fso

AI-Ready Codebases: Engineering Discipline for Agentic AI with Adam Tornhill

Play Episode Listen Later Mar 30, 2026 52:30

In this episode, Andi and Brian welcome back Adam Tornhill—founder of CodeScene and author of Your Code as a Crime Scene—to explore how agentic AI is reshaping software engineering. Adam shares his personal journey from 40 years of hands-on coding to orchestrating AI-generated code, and what this shift really means for development teams.Together, they dive into new research on the hidden risks of AI-assisted coding, why low-quality or legacy code slows AI down, and how to measure the “AI-readiness” of a codebase. Adam breaks down practical strategies from his latest work on Agentic AI Coding, including guardrails, refactoring patterns, enforced processes, and why test coverage has become a surprising cornerstone for safe, fast AI iteration.Whether you're experimenting with AI coding tools or planning enterprise-scale adoption, this episode delivers actionable guidance rooted in data, engineering discipline, and real-world experience.Linkshttps://codescene.com/blog/agentic-ai-coding-best-practice-patterns-for-speed-with-qualityhttps://codescene.com/blog/strengthening-the-inner-developer-loop-turn-ai-into-a-reliable-engineering-partner

ai discipline code engineering coding agentic crime scene dynatrace your code

AI‑Native: Building Faster Than We Can Spec with Wolfgang Heider & Benedict Evert

Play Episode Listen Later Mar 16, 2026 43:47

AI is transforming software engineering—faster than many teams can adapt. In this episode, Andi talks with Wolfgang Heider and Benedict Evert about what it really means to build “AI‑native” software, where prototypes turn into production apps in minutes.We explore why good engineering fundamentals still matter, how multi‑agent workflows mirror traditional roles, and why testing, governance, and clarity of intent become more important—not less.We also discuss the future of junior engineers, the risk of everyone reinventing the same solution, and why value—not code generation—is becoming the real differentiator.Links we discussedhttps://www.linkedin.com/posts/wolfgangheider_productmanagement-softwareengineering-ai-activity-7425746505883607042-D1OZhttps://www.linkedin.com/pulse/machines-making-wolfgang-heider-5mvsfhttps://www.linkedin.com/pulse/i-built-app-between-final-stranger-things-episodes-wolfgang-heider-5penf/https://futurelab.studio/ora/ https://futurelab.studio/htmlctl/

ai software engineering agent native vibe coding wolfgang spec evert heider

Resilience in the Age of AI and Why we Still Suck at it with Adrian

Play Episode Listen Later Mar 2, 2026 51:44

Why do we still struggle with resilience in 2026? Is it the growing complexity of systems, the pressure to ship fast, or a lack of education around resilient design? In this episode we welcome Adrian Hornsby from Resilium Labs to explore these questions and learn about chaos, complexity, and the importance of continuous learning!Adrian has learned his chaos engineering skills while working at AWS for many years. He shares insights from his upcoming book and his experience helping organizations embrace resilience as a continuous learning practice. We discuss:Why traditional chaos engineering assumptions break down when AI starts writing your code.The rise of AI-powered SRE agents—are they a blessing or a missed learning opportunity?Organizational challenges and the importance of tracking near misses.Links we discussedAdrians LinkedIn: https://www.linkedin.com/in/adhorn/Resilium Labs: https://www.resiliumlabs.com/Upcoming Book: https://leanpub.com/whywestillsuckatresilience

ai resilience engineering suck resiliency aws organizational sre upcoming book dynatrace

From Zero to Open Source Contributor with Diana Todea

Play Episode Listen Later Feb 16, 2026 47:17

Contributing to Open Source is easier than ever - especially because contributions are needed for documentation, demos, tutorials and code. But how to get started? Where to look for "first good issues"? Is everyone welcome? What are the prerequisites?Tune in and hear from Diana Todea, Developer Experience Engineer at Victoria Metrics, on how within a year she made it from Zero to Developer and receiving the Contributor Award for OpenTelemetry 2025 at KubeCon Atlanta. Diana shares her journey, how she started, how she found the right topic and how she keeps herself motivated. Diana is also the Co-lead of the Neurodiversity CNCF Working Group and gives us insights into the Merge Forward community. And don't forget: Call for Papers for Cloud Native Days Romania and Austria are open and both Diana and Andi would be glad to see your proposals!So - what are you waiting for?Links we discussed:Diana's LinkedIn: https://www.linkedin.com/in/diana-todea-b2a79968/ From Zero to Developer Talk: https://www.youtube.com/watch?v=nPrxpEE5GpY Contributor Award: https://siliconangle.com/2025/11/13/accessibility-meets-open-source-collaboration-kubeconna/ Her latest CNCF Blog Post: https://www.cncf.io/blog/2025/12/04/my-first-kubecon-cloudnativecon-a-journey-through-community-inclusivity-and-neurodiversity/Start contributing to Open Source: https://contribute.cncf.io/contributors/getting-started/ Diana's Conference Talks: https://github.com/didiViking/Conferences_Talks Diana on Medium: https://medium.com/@dianatodea/ Articles on OpenTelemetry for beginners: https://medium.com/@dianatodea/the-unofficial-guide-to-contributing-to-opentelemetry-where-to-look-and-who-to-talk-to-9de04ae75fe0 CNCF Merge-Forward: https://community.cncf.io/merge-forwardCNCF Neurodiversity initiative: https://community.cncf.io/neurodiversity Cloud Native Days Romania: https://cloudnativedays.ro/Cloud Native Days Austria: https://cloudnativedays.at/

open medium austria developers open source papers contributors neurodiversity contributing from zero cncf

The many facets of an SRE with Alexandra Franz

Play Episode Listen Later Feb 2, 2026 48:45

From Systems Engineer in Aeronautics via many clouds to becoming an SRE in Observability! That's the path from our guest, Alexandra Franz who is a Lead Product Engineer in SRE at Dynatrace. Tune in and learn how their team plans ahead for expected high traffic around Black Friday, Cyber Monday or the Super Bowl. We discuss how regional traffic patterns and differences in available hardware get factored in for capacity management and cost control. We also learn why global cloud outages are stressful - but - how those incidents can also be the reward for a good SRE.Make sure to connect with Alexandra on LinkedIn: https://www.linkedin.com/in/alexandrafranz/

super bowl black friday franz cyber monday facets sre aeronautics observability dynatrace

10 Fundamentals to get Vibe Coding right with Jeff Blankenburg

Play Episode Listen Later Jan 19, 2026 54:32

If you are still treating your AI Coding Agent like a chat bot and not like a development team then this is one more reason to tune into this episode.In his blog post series 31 Days of Vibe Coding, Jeff Blankenburg walks us through all the lessons learned when bringing an idea to life just with vibe coding. His idea was building a website for collectors of baseball cards. With now more than 950k cards from almost 10k players, he has proven that vibe coding, when done right, can truly boost the output of software engineers. Tune in and learn about how to effectively use Git Issues as the backlog for your AI, the importance of going through different phases in your conversation with the AI and why it is important to ask the AI the question: "Do you have any questions for me?"Links we discussedLinkedIn Profile: https://www.linkedin.com/in/jeffblankenburg/31 Days of Vibe Coding: https://31daysofvibecoding.com/Collect Your Cards: https://collectyourcards.com/Claude: https://claude.ai/

ai vibe developers fundamentals chatbots coding dynatrace

Semiotics - A Future of Observability we are yet to see with William Louth

Play Episode Listen Later Jan 5, 2026 53:13

How many people have you met that implemented distributed tracing in the early 2000s? Make it one more after you have tuned into our latest podcast with William Louth. William, who can't seem to escape the observability space even though he keeps trying, has a track record in the space. He is an innovator and tool builder and is currently reimagining intelligent systems by shifting the focus from data collection to meaning-making. In our conversation we learn about situational awareness and how systems should use symbols to show their current state by also taking into account everything they are aware of happening in their ecosystem.This podcast episode has been long overdue and opens a fascinating new world beyond metrics, logs and traces!Links discussedWilliams LinkedIn: https://www.linkedin.com/in/william-david-louth/Humainary Research: https://humainary.io/research/Humainary GitHub: https://github.com/humainary-ioServentis Signs: https://raw.githubusercontent.com/humainary-io/substrates-api-java/refs/heads/main/ext/serventis/SIGNS.md

signs tracing observability louth semiotics dynatrace

From Vibe Coding to Vibe Architecting with Abhimanyu Selvan

Play Episode Listen Later Dec 22, 2025 42:38

It started with the prompt: "Create an Uber Clone"! Several iterations and some months later Abhi presents his lessons learned when vibing a Ride Share Platform for RoboTaxis at Cloud Native Days Austria!"Commit to one tool and go deep. Don't get distracted by all the options you have. Treat your agent like a human! Get better in expressing what you really want!", those are the many lessons learned in Abhi's journey applying the potential of the latest AI agents that are available for software engineers.Tune into our latest episode and understand what Abhi means when he says: Context is important! Give it Macro Context and do Micro Incremental Improvements!Links we discussedAbhi's LinkedIn: https://www.linkedin.com/in/abhimanyuselvan/Cloud Native Austria Talk: https://www.youtube.com/watch?v=VjMPHWjawxM&list=PLtLBTEzR4SqU9GwgWiaDt10-yOVIN0nzM&index=9Cursor AI: https://cursor.com/OpenSpec: https://openspec.dev/

ai treat cloud context commit native vibe builder coding robotaxis abhi architecting dynatrace abhimanyu

AI-Augmented Chaos Engineering in Practice with Bartek Pisulak

Play Episode Listen Later Dec 8, 2025 54:40

Chaos Engineering is the practice to introduced controlled failures into a system with the goal to improve the overall resiliency! What started with "lets see what happens when we unplug that server" to "lets simulate network latency issues" or "lets kill critical pods and see if the system recovers gracefully" is now seeing new experiments being conducted that are identified by a new companion: AI In this episode we have invited Bartek Pisulak, Dir of Cloud Quality Engineering at Pegasystems, who has been educating quality engineers on AI-Augmented Chaos Testing in Practice. Tune in and learn about the how AI can improve efficiency in the 5 critical phases of a chaos experiment: Steady State, Hypothesis, Run Experiment, Verify, Improve!To learn more about the foundational principles make sure to watch some of the conference talks from Bartek listed below:Links discussedBartek's LinkedIn: https://www.linkedin.com/in/bart%C5%82omiej-pisulak-82b94036/Talk at Cloud Native Days Austria: https://www.youtube.com/watch?v=xUVCKNpMEz8&list=PLtLBTEzR4SqU9GwgWiaDt10-yOVIN0nzM&index=10Talk at Porto Tech Hub: https://www.youtube.com/watch?v=-ZuEaA2PoToKraken: https://github.com/krkn-chaos/krknChasoEater: https://github.com/ntt-dkiku/chaos-eater

ai talk chaos practice testing engineering augmented hypothesis verify c5 bartek steady state dynatrace chaos engineering pegasystems

The Pragmatic Approach to Becoming AI-Native with Pini Reznik

Play Episode Listen Later Nov 24, 2025 58:17

There is only one successful way to adopt new technology, and that is transformational! Sounds like a high-level consulting pitch but our industry has a track record to validate this statement. Just look at the recent web or cloud-native transformations!Pini Reznik has been helping organizations along the current AI-Native transformational journey. And what a timing: He just published his book on From Cloud Native to AI-Native where he provides a pragmatic approach to leveraging AI from Pioneering to Gradually Scaling!Tune in and hear from Pini why he thinks that AI projects are not failing because of bad AI, but because they approaching the problem the old and wrong way!And, stay until the end to hear how it was to write a book about AI using AI!Links we discussedPini's LinkedIn: https://www.linkedin.com/in/pinireznik/Link to Book: https://re-cinq.com/bookOur previous episode: https://www.spreaker.com/episode/ai-native-the-next-revolution-after-cloud-native-with-pini-reznik--67692567Prompt Engineering Conference Talk: https://www.youtube.com/watch?v=W7z5XMnvYt8

ai technology cloud transform native pioneering pragmatic pini dynatrace reznik

Back to Basics: Increase DevEx in the Age of AI with Laura Tacho

Play Episode Listen Later Nov 10, 2025 48:07

Don't get stuck using AI to build faster horses. Instead, find the opportunities and rethink your software delivery processes! That, and only that, will help you increase Developer Experience and Efficiency!This episode is all about how to measure and improve DevEx in the age of Artificial Intelligence. And with Laura Tacho, CTO at DX, we think we found a perfect guest!Laura has been working in the dev tooling space for the past 15 years. In her current role at DX she is working on the evolution of DORA and SPACE into DX Core 4 and the DXI Measurement Framework.In our episode we learn about those frameworks but also how tech leaders need to rethink where and how to apply AI to improve overall efficiency, quality and effectiveness! The key takeaways from this conversation areDevEx is all about the identifying and reducing friction in the end-2-end development processTech Leaders need to become better in articulating technical change requirements to businessAs of today only 22% of code in git is really AI generated. Don't get fooled into believing AI is already betterBack to Basics makes companies successful with AI. That is: proper CI/CD, testing, documentation, observability!Here the links we discussedLaura's LinkedIn: https://www.linkedin.com/in/lauratacho/DX: https://getdx.com/Cloud Native Days Austria Talk: https://www.youtube.com/watch?v=kZ1F0-XS1l4Engineering Leadership Community: https://www.engineeringleaders.io/

ai space artificial intelligence basics developers cto efficiency back to basics dx ci cd developer experience tacho devex

Whats Hot in Cloud and AI-Native and what we learned from the AWS Outage

Play Episode Listen Later Oct 27, 2025 46:11

The AWS US-East problems on Oct 27th was a good reminder how depending we are on globally shared services. Built-in Resiliency is not guaranteed if systems have a hard dependency on a single region of a single vendor. Many of us have experienced systems being impacted that we use on a daily basis - some critical - some not so critical as Andi will tell you when he found out that is beloved Leberkas Pepi App didnt work!Besides this outage we discuss lessons learned from Cloud Native Days Austria, Observability and Platform Engineering Meetups in Gdansk and Tallinn as well as giving an outline to the upcoming Cloud and AI-Native US Tour from Henrik Rexed and Andi GrabnerAll the links we discussed are hereLeberkas Pepi: https://www.leberkaspepi.at/Cloud Native Austria: https://www.linkedin.com/company/cndaustria/Observability Meetup: https://www.meetup.com/observability-tech-community-meetup-group/US Tour from Henrik and Andi: https://events.dynatrace.com/noram-all-de-engineering-efficiency-tour-2025-28225/

ai built engineering cloud platform native resiliency aws outage tallinn observability us tour gdansk what's hot henrik rexed

How to test, optimize, and reduce hallucinations of AIs with Thomas Natschlaeger

Play Episode Listen Later Oct 13, 2025 49:08

While Artificial Intelligence seems to have just popped up when OpenAI brought ChatGPT to the consumer market it has its roots in the mids of the 20th century. But what is it that all of a sudden made it into every conversation we seem to have?Thomas Natschlaeger, Principal Data Scientist at Dynatrace, who has been working in the AI and Machine Learning space for the past 30 years gives us a brief historical overview and describes the critical evolutionary steps and compelling events in that technology that made it to what it is today. Tune in and hear about how AIs are trained, how they are optimized and most importantly: how their outputs can be tested and validated!In our conversation we discuss current trends towards small language models that will help model digital twins of our existing roles and how AIs are used to Validate other AIs like we humans do when a senior engineer does pair programming with a junior and with that provides essential feedback on current accuracy and input to improve the outcome of future tasks.Links we discussedLinkedIn Profile from Thomas: https://www.linkedin.com/in/thomas-natschlaeger/Ask Me Anything Session on Davis CoPilot: https://www.linkedin.com/posts/grabnerandi_llm-copilot-activity-7373837743971393536-QgxV?utm_source=share&utm_medium=member_desktop&rcm=ACoAAABLhVQBbh8Jkn_K8din5tsQlMCpXRNzlKUVoxxed Conference Talk: https://amsterdam.voxxeddays.com/talk/?id=39801Attention is all you need paper: https://en.wikipedia.org/wiki/Attention_Is_All_You_Need

learning ai network chatgpt intelligence reduce artificial openai machine learning optimize llm validate hallucinations neural dynatrace principal data scientist

Hello BOB - Cloud Native Cybersecurity with Bill of Behaviors with Constanze Roedig

Play Episode Listen Later Sep 29, 2025 27:05

On September 8 the world saw the npm supply chain attack. Fortunately the community reacted in record time to avert a disaster. In todays episode we have Constanze Roedig, Key Researcher at SBA Research, who introduces us to the new buddy of SBoM (Software Bill of Materials): SBoB (Software Bill of Behaviors) and her thoughts on how that new approach to fingerprinting software can help cyber security teams. What's a BoB? It's a detailed runtime behavior profile of software. It expands on the static validation option through SBOMs as it allows security teams to validate the correct execution behavior of deployed software at deploy time or continuously in production. Thanks to eBPF, a malicious behavior such as opening non expected ports or accessing non expected files can therefore be detected.Listen to Constanze who shares the work she and Vadim Bauer, Owner of 8gear, have done on this topic. You will learn about how software vendors can create their own SBOBs, ship them with their container images and how security teams can get alerted or enforce any detected malicious behavior. Make sure to check out their GitHub repo, star it if you like it and try their hands-on tutorial!Links:Constanze LinkedIn: https://www.linkedin.com/in/croedig/Vadim LinkedIn: https://www.linkedin.com/in/vadim-bauer/OBobCtl GitHub Repo: https://github.com/k8sstormcenter/bobctlCloud Native Summit Munich Talk: https://www.youtube.com/watch?v=XETuwndd_mw&index=11&pp=iAQBnpm supply chain attack: https://www.infosecurity-magazine.com/news/npm-supply-chain-attack-averted/

owner security behavior cybersecurity github sba cloud native npm constanze dynatrace sbom ebpf sboms

AI-Native: The Next Revolution after Cloud Native with Pini Reznik

Play Episode Listen Later Sep 15, 2025 51:55

Defining AI-Native in 2025 is like trying to define Cloud Native back in 2014! We are in the early stages of understanding what AI really means to us. The ecosystem is just evolving, and many organizations are still struggling with re-architecting their digital systems to cloud native patterns!To learn more about the current transformational wave—the AI-Native Wave—we have invited Pini Reznik, CEO and Co-Founder of re:cinq. We will discuss what we can learn from previous "waves of innovation," why the business must care, and why the primary AI use case should not be just cost-cutting! Make sure to get a copy of his book or catch his talk from Cloud Native Munich. All links we discussed here:Pini's LinkedIn: https://www.linkedin.com/in/pinireznik/The Next Transformation Mini Book: https://re-cinq.com/mini-bookCloud Native Munich Talk: https://www.youtube.com/watch?v=CHb3TLEV8ZU

ceo ai co founders cloud intelligence artificial cloud native pini dynatrace next revolution reznik

State of AI Observability with OpenLLMetry: The Best is Yet to Come with Nir Gazit

Play Episode Listen Later Sep 1, 2025 50:35

Most AI projects still fail, are too costly, or don't provide the value they hoped to gain. The root cause is nothing new: it's non-optimized models or code that runs the logic behind your AI Apps. The solution is also not new: tuning the system based on insights from Observability!To learn more about the state of AI Observability, we invited back Nir Gazit, CEO and Co-Founder of traceloop, the company behind OpenLLMetry, the open source observability standard that is seeing exponential adoption growth!Tune in and learn how OpenLLMetry became such a successful open source project, which problems it solves, and what we can learn from other AI project implementations that successfully launched their AI Apps and AgentsLinks we discussedNir's LinkedIn: https://www.linkedin.com/in/nirga/OpenLLMetry: https://github.com/traceloop/openllmetryTraceloop Hub LLM Gateway: https://www.traceloop.com/docs/hub

ceo ai co founders llm yet to come best is yet observability dynatrace

Platform Engineering is not just a trend and why Terraform is not dead with Artem Lajko

Play Episode Listen Later Aug 11, 2025 47:46

Did you know that the average salary for a Platform Engineer is 42.5% more than a DevOps engineer? But why is that?We sat down with Artem Lajko, CNCF Kubestronaut and Ambassador as well as Author of the book Implementing GitOps with Kubernetes. We dive into the role of a platform engineer, the common pitfalls in implementing IDPs and why Backstage and AI won't solve all your problems. And we touch upon a topic hot off the press around Terraform: Its not dead!Links we discussedArtem's LinkedIn: https://www.linkedin.com/in/lajko/Talk slides from Cloud Land: https://lajko10-my.sharepoint.com/personal/artem_lajko_dev/_layouts/15/onedrive.aspx?id=%2Fpersonal%2Fartem%5Flajko%5Fdev%2FDocuments%2FAttachments%2Fcloud%20land%2D2025%5F%2Epdf&parent=%2Fpersonal%2Fartem%5Flajko%5Fdev%2FDocuments%2FAttachments&ga=1State of Platform Engineering Report: https://platformengineering.org/reports/state-of-platform-engineering-vol-3Upjet GitHub Project: https://github.com/crossplane/upjet

ai talk engineering ambassadors platform trend backstage devops kubernetes artem terraform idps cncf platform engineering dynatrace k8s gitops perfmatters

What is Privacy Engineering and Why Its not as complicated as it sounds with Cat Easdon

Play Episode Listen Later Jul 28, 2025 53:22

"Privacy engineering is the art of translating privacy laws and policies into code, figuring out how to make legal requirements such as ‘an individual must be able to request deletion of all their personal data' a technical reality.", was the elegant explanation from Cat Easdon when asked about what she is doing in her day job.If you want to learn more then tune in to this episode. Cat, Privacy Engineer at Dynatrace, shares her learnings about things such as: When the right time is to form your own privacy engineering team, why privacy means different things for different people and regulators and what privacy considerations we specifically have in the observability industry so that our users trust our services!Links:Cat's LinkedIn Profile: https://www.linkedin.com/in/easdon/Publications from Cat: https://www.dynatrace.com/engineering/persons/catherine-easdon/Blog on Managing Sensitive Data at Scale: https://www.dynatrace.com/news/blog/manage-sensitive-data-and-privacy-requirements-at-scale/Semgrep for lightweight code scanning: https://github.com/semgrep/semgrepThe IAPP: https://iapp.org/'Meeting your users' expectations' is formally described by the theory of contextual integrity: https://www.open.edu/openlearncreate/mod/page/view.php?id=214540Facebook's $5 billion fine from the FTC: http://ftc.gov/news-events/news/press-releases/2019/07/ftc-imposes-5-billion-penalty-sweeping-new-privacy-restrictions-facebookFact-check: "The $5 billion penalty against Facebook is the largest ever imposed on any company for violating consumers' privacy and almost 20 times greater than the largest privacy or data security penalty ever imposed worldwide. It is one of the largest penalties ever assessed by the U.S. government for any violation." I think that's still true; the largest fine under the GDPR was €1.2 billion (again for Facebook/Meta)

personal data blog scale engineering cat privacy regulation complicated sensitive gdpr ftc regulators publications observability dynatrace semgrep

Platform Democracy NOW! How to keep your Platform Promise with Daniel Bryant

Play Episode Listen Later Jul 14, 2025 47:27

More than 50% of platform engineering leads don't know how to measure the impact of their platform! Many platform projects fall into common anti-pattern traps that make the platform look great on Day 1 but fail to scale and excite on Day 2!Daniel Bryant - who's profile tagline is "Helping you build better platforms" - is sharing his thoughts on how to measure the value of your platform, how to avoid common anti-patterns and why he believes that the future of platform engineering is in Platform Democracy!And of course, we wrap everything up with a discussion around the impact of Agentic AI towards platform engineering. So - tune in! Here the links we discussedDaniel's LinkedIn Profile: https://www.linkedin.com/in/danielbryantuk/Platform Engineering Book for Technical Product Leaders: https://www.amazon.de/Platform-Engineering-Technical-Product-Leaders/dp/1098153642/ref=asc_df_1098153642Platform Engineering Day Talk: https://www.syntasso.io/post/syntasso-at-platengday-london-presentation-recapKratix Website: https://www.kratix.io/Ai-Driven Platform Engineering Blog: https://www.syntasso.io/post/what-we-learned-building-a-prototype-ai-driven-dev-interface-for-kratixPlatform Democracy: https://www.syntasso.io/post/platform-democracy-rethinking-who-builds-and-consumes-your-internal-platformPlatform Anti Patterns: https://www.syntasso.io/post/platform-building-antipatterns-slow-low-and-just-for-showSlide Deck on Platform Engineering for Devs and Architects: https://speakerdeck.com/danielbryantuk/platform-engineering-for-software-developers-and-architects-redux

ai performance engineering platform democracy architects devs agentic platform engineering dynatrace antipatterns daniel bryant perfmatters

DX Core 4 Applied - Measuring Developer Productivity with Dušan Katona

Play Episode Listen Later Jun 23, 2025 49:48

"How do you measure the impact you have with your platform engineering initiative?" is a question you should be able to answer. To show improvement you must first need to know what the status quo is. And this is where frameworks such as DX Core 4 come in. Never heard about it? Then tune into this episode where we have Dušan Katona, Sr Director of Platform Engineering at Ataccama, who is a big fan of the DX Core Four Metrics and who has just applied it in his current role to optimize developer experience.Dušan explains the details behind those 4 Core metrics: Speed, Effectiveness, Quality and Impact. He also shares how improving those metrics by a single point results in the equivalent of 10 hours saved per developer per year.And here the relevant links we discussed todayDusan's LinkedIn Profile: https://www.linkedin.com/in/dusankatona/DX Core 4 Blog: https://getdx.com/research/measuring-developer-productivity-with-the-dx-core-4/Marian's JIRA Analytics Open Source Project: https://github.com/marian-kamenistak/jira-lead-cycle-time-duration-extractor

impact blog speed engineering platform developers measuring effectiveness applied sr director core4 platform engineering katona dynatrace developer productivity

In the AI Age being Smart is not enough for Tech Leadership with Marian Kamenistak

Play Episode Listen Later Jun 9, 2025 39:52

"15 years ago it was enough to be smart - going forward its not a differentiator - being smart will just make you average!". But what is it? What makes great leaders worth following and how do they achieve tripling their value while others keep waiting for their 5% raise?4 years ago Marian Kamenistak launched the Engineering Leadership Community out of Prague, Czech Republic. Feeding from his experience in the Silicon Valley this community has grown to 1500 members with the mission to create "Leaders worth following". Tune in and hear from Marian on how to think and talk about value impact vs being held up with trying to achieve technical perfection. Why its important to build a network around you, the difference between mentorship and management as well as how to proof the value to your leadership that you bring to the organization!Links we discussed todayMarian's LinkedIn: https://www.linkedin.com/in/mariankamenistak/Engineering Leadership Conference: https://www.elc-conference.io/Engineering Leadership Community: https://www.engineeringleaders.io/The Leadership Pipeline Book: https://www.amazon.com/Leadership-Pipeline-Build-Powered-Company/dp/0470894563

leadership tech management leaders smart silicon valley engineering feeding prague czech republic ai age dynatrace

The Research Behind the AI and Observability Innovation with Otmar Ertl and Martin Flechl

Play Episode Listen Later May 26, 2025 50:59

Scientific research is the foundation of many innovative solutions in any field. Did you know that Dynatrace runs its own Research Lab within the Campus of the Johannes Kepler University (JKU) in Linz, Austria - just 2 kilometers away from our global engineering headquarter? What started in 2020 has grown to 20 full time researchers and many more students that do research on topics such as GenAI, Agentic AI, Log Analytics, Procesesing of Large Data Sets, Sampling Strategies, Cloud Native Security or Memory and Storage Optimizations.Tune in and hear from Otmar and Martin how they are researching on the N+2 generation of Observability and AI, how they are contributing to open source projects such as OpenTelemetry, and what their predictions are when AI is finally taking control of us humans!To learn more about their work check out these links:Martin's LinkedIn: https://www.linkedin.com/in/mflechl/Otmar's LinkedIn: https://www.linkedin.com/in/otmar-ertl/Dynatrace Research Lab: https://careers.dynatrace.com/locations/linz/#__researchLab

ai research innovation memory austria campus scientific lab optimization genai logs linz cloud native observability research lab dynatrace cloud native security

Organizational Sustainability through Platform Engineering with Lesley Cordero

Play Episode Listen Later May 12, 2025 42:00

As a leader that wants to optimize an organization you are bound to fail if you isolate social (culture and people) and technical (tools and process) changes. When we ask Lesley Cordero, Staff Engineer at The New York Times how to solve this dilemma she answers: "Platform Engineering, it can drive organizational sustainability by practicing sociotechnical principles that provide a community driven support system for application developers using our standardized shared platform architecture"Tune in to our latest episode and learn more about the importance of leadership to continuously keep up and balance the tension between "Developers" and "Operations", between "End User Experience" and "Developer Experience" and ultimately between "Culture and People and "Tools and Processes"Links we discussedLesley's LinkedIn: https://www.linkedin.com/in/lesleycordero/GOTO Conference Talk => https://www.youtube.com/watch?v=Jx-XrUONJ-o QCon 2025 Talk Details: https://qconlondon.com/presentation/apr2025/platform-engineering-practice-sociotechnical-excellence DevOpsCon 2024 Talk Details: https://devopscon.io/business-company-culture/platform-engineering-devops/

culture new york times tools sustainability operations engineering platform developers technical nyt organizational socio cordero developer experience platform engineering dynatrace staff engineer qcon perfmatters

Run Towards the Fire: Why we should love incidents with Lisa Karlin Curtis

Play Episode Listen Later Apr 28, 2025 46:47

Do you plan for incidents? Do you have a time / cost budget for it in your sprint or quarterly planning? Do you have engineers that are "interruptible"?We discussed those and more questions with Lisa Karlin Curtis, Founding Engineer at incident.io who teaches us why we need to think differently about dealing with incidents!In our discussion we learn why modern incident management embraces more incidents that are publicly shared within an organization to foster learning. We learn about how to train more people to become incident responders, how to triage and categorize incidents, how to better plan for them and how to best report on themWe also touch on AI - and how AI-generated code will eventually result in more Incidents which we should use as an opportunity to learn and improve our engineering processP.S: This was our 10th-anniversary podcast episode!!Here the links we discussed in the podcast:Lisa's LinkedIn: https://www.linkedin.com/in/lisa-karlin-curtis-a4563920/Her talk at ELC Prague: https://docs.google.com/presentation/d/18536WBHBcPEppEeXXP7o5UQOX2XfWoGmfds2CHegHq4/edit?slide=id.g3434e0cba65_0_0#slide=id.g3434e0cba65_0_0Incident Playbook: https://incident.io/guide

ai performance code incident incidents karlin dynatrace

MCPs (Model Context Protocol) are not that magic, but they enable magic things with Dana Harrison

Play Episode Listen Later Apr 14, 2025 46:43

MCPs (Model Context Protocol) is an open source standard for connecting AI assistants to the the systems where data lives. But you probably already knew that if you have followed the recent hype around this topic after Anthropic made their announcement end of 2024.To learn more about that MCPs are not that magic, but enable "magic" new use cases to speed up efficiency of engineers we have invited Dana Harrison, Staff Site Reliability Engineer at Telus. Dana goes into the use cases he and his team have been testing out over the past months to increase developer efficiency.In our conversation we also talk about the difference between local and remote MCPs, the importance of keeping resiliance in mind as MCPs are connecting to many different API backends and how we can and should observe the interactions with MCPs.Links we discussedAntrohopic Blog: https://www.anthropic.com/news/model-context-protocolDana's LinkedIn: https://www.linkedin.com/in/danaharrisonsre/overlay/about-this-profile/

ai magic model context protocol api llm anthropic enable mcp sre telus dynatrace mcps dana harrison

The History & Power of Distributed Tracing with Christoph Neumueller & Thomas Rothschaedl

Play Episode Listen Later Mar 31, 2025 56:19

So you think Distributed Tracing is the new thing? Well - its not! But its never been as exciting as today!In this episode we combine 50 years of Distributed Tracing experience across our guests and hosts. We invited Christoph Neumueller and Thomas Rothschaedl who have seen the early days of agent-based instrumentation, how global standards like the W3C Trace Context allowed tracing to connect large enterprise systems and how OpenTelemetry is commoditizing data collection across all tech stacks.Tune in and learn about the difference between spans and traces, why collecting the data is only part of the story, how to combat the challenge when dealing with too much data and how traces relate and connect to logs, metrics and events.Links we discussedYouTube with Christoph: LINK WILL FOLLOW ONCE VIDEO IS POSTEDChristoph's LinkedIn: https://www.linkedin.com/in/christophneumueller/Thomas's LinkedIn: https://www.linkedin.com/in/rothschaedl/

history open christoph tracing distributed traces observability spans telemetry w3c dynatrace otel perfmatters

An Inside Look into Platform Engineering for Architects with the authors Max, Hilliary & Andi

Play Episode Listen Later Mar 17, 2025 59:14

In the ever-changing IT world, it's hard to create content that stays relevant for long. One of the objectives of "Platform Engineering for Architects: Crafting Modern Platforms as a Product" was to stay timeless by providing practical examples of use cases that are not necessarily tied to current technology trends.The book focuses on the importance of building a platform with a purpose, making the impact measurable and making sure the platform continuous evolves by continuously including the end users (the engineering teams) in the evolution of the platform.Tune in to this episode and hear from Max Körbächer (Founder of Liquid Reply), Hilliary Lipsig (Senior Principal SRE at RedHat) and Andi Grabner (Co-Host of PurePerformance) on what made them write a book on Platform Engineering and get some personal insights into what gets the authors excited about their respective topics.If you have a chance meet Max, Hilliary and Andi at KubeCon in London. They will present at Platform Engineering Day and will also do a book signing at KubeCrawl!Links we discussed:Book on Amazon: https://www.amazon.com/Platform-Engineering-Architects-Crafting-platforms-ebook/dp/B0DH5DJFTHPlatform Engineering Day Session: https://colocatedeventseu2025.sched.com/event/1u5mX/platform-engineering-for-architects-crafting-platforms-as-a-product-max-korbacher-liquid-reply-hilliary-lipsig-red-hatHilliary Lipsig: https://www.linkedin.com/in/hilliary-lipsig-a5935245/Max Körbächer: https://www.linkedin.com/in/maxkoerbaecher/Andi Grabner: https://www.linkedin.com/in/grabnerandi/

amazon founders product engineering platform architecture architects liquid inside look red hat sre platform engineering dynatrace kubecon max k perfmatters

How CERN analyzed 1 PetaByte per second using K8s with Ricardo Rocha

Play Episode Listen Later Mar 3, 2025 38:31

One PetaByte is the equivalent of 11000 4k movies. And CERN's Large Hadron Collider (LHC) generates this every single second. Only a fraction of this data (~1 GB/s) is stored and analyzed using a multicluster batch job dispatcher with Kueue running on Kubernetes. In this episode we have Ricardo Rocha, Platform Engineering Lead at CERN and CNCF Advocate, explaining why after 20 years at CERN he is still excited about the work he and his colleagues at CERN are doing. To kick things off we learn about the impact that the CNCF has on the scientific community, how to best balance an implementation of that scale between "easy of use" vs "optimized for throughput". Tune in and learn about custom hardware being built 20 years ago and how the advent of the latest chip generation has impacted the evolution of data scientists around the globeLinks we discussedRicardo's LinkedIn: https://www.linkedin.com/in/ricardo-rocha-739aa718/KubeCon SLC Keynote: https://www.youtube.com/watch?v=xMmskWIlktA&list=PLj6h78yzYM2Pw4mRw4S-1p_xLARMqPkA7&index=5Kueue CNCF Project: https://kubernetes.io/blog/2022/10/04/introducing-kueue/

engineering platform gb cern kubernetes collider analyzed lhc cncf dynatrace k8s hadron ricardo rocha large hadron collider lhc petabyte

Why Compliance is Important and not Boring with Michiel de Lepper

Play Episode Listen Later Feb 17, 2025 50:40

The word "Compliance" reminds many about mandatory training or audits. Two things not everyone gets excited about!Tune in and meet Michiel de Lepper who has spent most of his career in Security and Compliance. He gives us a different perspective on the importance of compliance, why it exists, how it intertwines with security and threat detection, what it has to do with security posture management and why he thinks its one of the most exciting things in IT!Links we discussed:Michiel's LinkedIn: https://www.linkedin.com/in/madelepper/Blog posts on security and compliance:https://www.dynatrace.com/news/blog/dynatrace-for-executives-security-compliance/ https://www.dynatrace.com/news/blog/manage-compliance-and-resilience-at-scale-with-dynatrace/ https://www.dynatrace.com/news/blog/dynatrace-kspm-transforming-kubernetes-security-and-compliance/

performance management blog security threats boring compliance posture detection michiel dynatrace lepper

What's next for Feature Flagging and OpenFeature with Ben Rometsch

Play Episode Listen Later Feb 3, 2025 50:36

Feature Flagging - some may call them "glorified if-statements" - has been a development practice for decades. But have we reached a stage where organizations are doing "Feature Flag-Driven Development?". After all it took years to establish a test-driven development culture despite having great tools and frameworks available!To learn more we invited Ben Rometsch, Co-Founder of Flagsmith, to chat about the history, state and future of Feature Flagging. He is giving us an update on where the market is heading, how the CNCF project OpenFeature and its community is driving best practices, what the role of AI might be and what he thinks might be next!Couple of links we discussed during the episode:Ben on LinkedIn: https://www.linkedin.com/in/benrometsch/YouTube Video on Observability & Feature Flagging: https://www.youtube.com/watch?v=VZakh1_oEL8OpenFeature: https://openfeature.dev/

ai co founders development couple feature flag observability flagging cncf

Observability Predictions 2025 Under the Covers with Bernd Greifeneder

Play Episode Listen Later Jan 20, 2025 47:31

To predict the future, it's important to know the past. And that is true for Bernd Greifeneder, Founder and CTO of Dynatrace, who has been driving innovation in the observability and security since he founded Dynatrace 20 years ago!Bernd agreed to sit down, look behind the covers and answer the open questions that people posted on his LinkedIn in response to his recent observability prediction blog. Tune in and learn about Bernd's though on the evaluation from reactive to preventive operations, who is behind the convergence of observability & security, why observability can help those that have serious intentions for sustainability and how observability becomes mandatory and indispensable for AI-driven services.We mentioned a lot of links in todays session. Here they are:Our podcast from 9 years ago: https://www.spreaker.com/episode/015-leading-the-apm-market-from-enterprise-into-cloud-native--9607734Bernds LinkedIn Post: https://www.linkedin.com/feed/update/urn:li:activity:7275101213237354497/Predictions Blog: https://www.dynatrace.com/news/blog/observability-predictions-for-2025/K8s Predictive Scaling Lab: https://github.com/Dynatrace/obslab-predictive-kubernetes-scalingSecurity Video: https://www.youtube.com/watch?v=ICUwRy4JFTkCarbon Impact App: https://www.youtube.com/watch?v=8Px0BB1U1ykAI & LLM Observability Video: https://www.youtube.com/watch?v=eW2KuWFeZyY

ceo founders ai performance predictions security cto covers bernd logs apm observability dynatrace

From Infra to Services to Happy End Users: The role of SLOs at Uber with Vishnu Acharya

Play Episode Listen Later Jan 6, 2025 50:59

eBay, Yahoo, Netflix and then 10+ years at Uber. In this episode we sit down with Vishnu Acharya, Head of Network Infrastructure EMEA and Platform Engineering at Uber. Vishnu shares how Uber has scaled over the years to about 4000 engineers and how his team makes sure that infrastructure and platform engineering scales with the growing company and the growing demand on their digital services.Tune in and learn about how Vishnu thinks about SLOs across all layers of the stack, how they manage to get better insights with their cloud providers and why its important to have an end-to-end understanding of the most critical end user journeys.Links we discussed:Conference talk at Observability & SRE Summit: https://www.iqpc.com/events-observability-sre-summit/speakers/vishnu-acharyaVishnu's LinkedIn Page: https://www.linkedin.com/in/vishnuacharya/Uber Engineering Blog: https://www.uber.com/blog/engineering/

The Road to OpenTelemetry Adoption at Booking with Anton Timofieiev

Play Episode Listen Later Dec 23, 2024 53:45

For the past 10 years Anton has been working at Booking.com - one of the leading digital travel companies based out of Amsterdam. The journey that started as System Administrator has led Anton to be an Engineering Manager for Site Reliability where over the past 3 years he led the rollout and adoption of OpenTelemetry as the standard for getting observability into new cloud native deployments.Tune in and learn how Anton saw R&D grow from 300 to 2000, why they replaced their home-grown Perl-based Observability Framework with OpenTelemetry, how they tackle adoption challenges and how they extend and contribute back to the open source communityLinks we discussed:Anton's LinkedIn Profile: https://www.linkedin.com/in/antontimofieiev/Observability & SRE Summit: https://www.iqpc.com/events-observability-sre-summit/speakers/anton-timofieievOpenTelemetry: https://opentelemetry.io/

technology open amsterdam adoption booking anton sre engineering manager observability telemetry dynatrace otel system administrator

Why Security and Compliance must not be a showstopper for SaaS with Milan Steskal

Play Episode Listen Later Dec 9, 2024 52:00

Most services are moving to SaaS - whether it's email, collaboration, customer relations, or finance. But not everyone can go to SaaS - or at least that's the initial reaction when navigating certain industries' rules and regulations.Milan Steskal - who worked in healthcare for many years - is now helping organizations ask the right questions and find the best solutions as they evaluate their options to move their observability data to SaaS. Tune in and learn about the questions to ask vendors and your internal security, privacy, and compliance teams. Milan also walks us through the capabilities SaaS vendors such as Dynatrace have put in place to protect data sent to the cloud so that it stays safe and only accessible to those needing access.Links discussed today:Milans LinkedIn Page: https://www.linkedin.com/in/milansteskal/Dynatrace Trust Center: https://www.dynatrace.com/company/trust-center/ Blogs on Trust: https://www.dynatrace.com/news/tag/trust-center/

trust performance security saas compliance observability showstoppers dynatrace

Every Byte Counts: Web Performance Flashback with Andreas Taranetz

Play Episode Listen Later Nov 25, 2024 51:28

Andreas Taranetz is a software engineer and lecturer at the University of Vienna. He creates a lot of educational content around Web Performance Optimization. For the past seven years, he has also operated Wahlkabine, Austria's top website, for matching one's political views with the parties that are up for election.This episode was an amazing flashback - reminding us about the time when Steve Souders - the "godfather" of Web Performance Optimization - educated web developers about optimizing CSS, JavaScript, and server-side roundtrips.Tune in and learn why Web Performance is still such an important topic, how it relates to sustainability, why you should cache on every layer, and what the Static Site Paradox really is! Links we discussed in the episode:Andreas on LinkedIn: https://www.linkedin.com/in/andreas-taranetz/Personal Website: https://andreas.taranetz.com/We Are Developers Talk: https://www.youtube.com/live/KRemC82gsBkWahlkabine: https://wahlkabine.at/Steve Souders: https://stevesouders.com/

university performance web austria andreas flashback counts javascript bytes css personal website dynatrace web performance wpo steve souders

The Security and Resiliency Challenges of Cloud Native Authorization with Alex Olivier

Play Episode Listen Later Nov 11, 2024 52:35

Authentication (validating who you claim to be) and Authorization (enforcing what you are allowed to do) are critical in modern software development. While authentication seems to be a solved problem, modern software development faces many challenges with secure, fast, and resilient authorization mechanisms. To learn more about those challenges, we invited Alex Olivier, Co-Founder and CPO at Cerbos, an Open Source Scalable Authorization Solution. Alex shared insights on attribute-based vs. role-based access Control, the difference between stateful and stateless authorization implementations, why Broken Access Control is in the OWASP Top 10 Security Vulnerabilities, and how to observe the authorization solution for performance, security, and auditing purposes.Links we discussed during the episode:Alex's LinkedIn: https://www.linkedin.com/in/alexolivier/Cerbos on GitHub: https://github.com/cerbos/cerbosOWASP Broken Access Control: https://owasp.org/www-community/Broken_Access_Control

challenges co founders security cloud native olivier resiliency github cpo authentication authorization cloud native owasp dynatrace owasp top security vulnerabilities

Open Source: Why its the Best Thing that happened to IT with Marcio Lena

Play Episode Listen Later Oct 28, 2024 64:40

Open Source is the Best Thing that happened to IT"! Powerful words from Marcio Lena who has been using and contributing back to open source for the past 20+ years. Besides being a vivid advocate for open source, Marcio also knows the concerns of large enterprises when picking open source projects.Tune in and follow our discussion about how to identify a healthy open-source project, how to balance between vendor and community lock-in, the power of open standards such as OpenTelemetry, open source business models as well as that contributing to open source is not limited to code but includes documentation, education and advocacy as well!Links we discussed:Marcio's LinkedIn Page: https://www.linkedin.com/in/marcio-lena/CNCF DevStats: https://devstats.cncf.io/Linux Foundation Events: https://events.linuxfoundation.org/CNCF Ambassadors: https://www.cncf.io/people/ambassadors/

powerful open source linux best things marcio cncf dynatrace

Understanding DORA - Europe's Digital Operational Resiliency Act with Kay Young

Play Episode Listen Later Oct 14, 2024 30:11

DORA - the EU's Digital Operational Resiliency Act - will take effect in January of 2025 and is currently top of mind for IT Leaders across all financial service institutions that operate in the European Union. But what is DORA really? Why is this important? How can institutions meet the DORA requirements? What is the role of observability, automation and AI in all of this?To answer all those and more questions we invited Kay Young, Sr Principal Product Manager at Dynatrace, who has been working with organizations around the globe that have been tasked to implement regulations such as DORA, GDPR, FedRAMP or others.In our conversation we also touch base on the third-party risk management as well as resiliency testing and incident reporting.Resources we discussed:Kay's LinkedIn Profile: https://www.linkedin.com/in/karlien-young-4a156730/What is DORA blog: https://www.dynatrace.com/news/blog/what-is-dora/Taming DORA compliance: https://www.dynatrace.com/news/blog/taming-dora-compliance-with-ai-observability-and-security/Blog on Dynatrace's DORA compliance journey: https://www.dynatrace.com/news/blog/the-dynatrace-journey-toward-dora-compliance/Beyond DORA compliance: https://www.dynatrace.com/news/blog/dora-how-dynatrace-helps-the-financial-sector-stay-resilient/

ai europe young digital european union blog act automation resiliency gdpr operational observability dynatrace fedramp it leaders

Lessons learned when building the NAIS Platform with Hans Kristian Flaatten

Play Episode Listen Later Sep 30, 2024 49:19

NAIS (pronounced like NICE) is a team central application platform that provides DevOps teams with the tools they need build, test, deploy, run and observe applications.In this episode Hans Kristian Flaatten, Platform Engineer at NAV, walks us through the WHYs, HOWs and challenges of building modern platforms on Kubernetes. Tune in and hear WHY they defined their own abstraction layer for applications, HOW developers benefit from that platform and WHY they developed their developer portal instead of going with other popular available choices.Links we discussed:Hans Kristian's LinkedIn: https://www.linkedin.com/in/hansflaatten/NAIS Documentation: https://docs.nais.io/

performance platform lessons learned devops nav whys kubernetes hows observability nais dynatrace hans kristian

Why Developer Observability is not a tooling problem with Viktor Farcic

Play Episode Listen Later Sep 16, 2024 58:29

"We will overwhelm developers if we give them the same specialized observability, security or deployment tools that are used by their platform engineering, operations, SREs or security teams!" - says Viktor Farcic, Developer Advocate at UpBound and host of The DevOps Toolkit YouTube channel. Tune in and hear us discuss about making observability easier accessible for developers, what Viktor doesn't like about Kubernetes and how Crossplane - the cloud native control plane framework - can be the gateway to real product-oriented platform engineering!Here the links we discussed during this episode:Viktor on LinkedIn: https://www.linkedin.com/in/viktorfarcic/DevOps Toolkit: https://www.youtube.com/@DevOpsToolkitCrossplane: https://www.crossplane.io/

performance cloud native developers viktor devops kubernetes tooling observability developer advocate dynatrace sres crossplane

Pitfalls to avoid when going all-in on OpenTelemetry with Hans Kristian Flaatten

Play Episode Listen Later Sep 2, 2024 55:07

Hans Kristian is a Platform Engineer for NAV's Kubernetes Platform Nais hosting Norway's wellfare services. With 10 years on Kubernetes, 2000 apps and 1000 developers across more than 100 teams there was a need to make OpenTelemetry adoption as easy as possible.Tune in as we hear from Hans Kristian who is also a CNCF Ambassador and hosts Cloud Native Day Bergen why OpenTelemetry is chosen by the public sector, why it took much longer to adopt, which challenges they had to scale the observability backend and how they are tackling the "noisy data problem"Links we discussed in the episodeFollow Hans Kristian on LinkedIn: https://www.linkedin.com/in/hansflaatten/From 0 to 100 OTel Blog: https://nais.io/blog/posts/otel-from-0-to-100/?foo=barCloud Native Day Bergen: https://2024.cloudnativebergen.dev/Public Money, Public Code. How we open source everything we do! (https://m.youtube.com/watch?v=4v05Huy2mlw&pp=ygUkT3BlbiBzb3VyY2Ugb3BlbiBnb3Zlcm5tZW50IGZsYWF0dGVu)State of Platform Engineering in Norway (https://m.youtube.com/watch?v=3WFZhETlS9s&pp=ygUYc3RhdGUgb2YgcGxhdGZvcm0gbm9yd2F5)

performance cloud agent norway native pitfalls nav kubernetes observability spans cncf platform engineering dynatrace public money k8s otel hans kristian

So you think you should Serverless? Things to know before you do with Sebastian Vietz!

Play Episode Listen Later Aug 26, 2024 61:30

Has one of the decision makers in your organization decided that you have to go "all in on technology X" because they saw a great presentation at a conference or got a great sales pitch from a vendor? If that is the case then this episode is for you and you should forward it to those decision makers.Sebastian Vietz, Director of Reliability Engineering and Host of the Reliability Enablers Podcast, shares his thoughts on considerations when picking a technology like Serverless. We discuss the importance of knowing limits, best fit architectural patterns and things that should influence your technology decisions!Being aware of coldstarts, a 20000 concurrent request limit or 512mb being an ideal size for Lambda are just some of the things we can all learn from Sebastian.Additional links we discussed:Sebastians LinkedIn: https://www.linkedin.com/in/sebastianvietz/Reliability Podcast: https://podnews.net/podcast/ibe8kMore things on serverless: https://serverlessland.com/

director performance engineering reliability functions lambda serverless dynatrace reliability engineering

Observability that is Battle tested by Millions with Marco Sussitz and Wolfgang Ziegler

Play Episode Listen Later Aug 12, 2024 52:38

When your code runs on more than 6 million systems - many of them business critical - then this is really exciting news for Marco and Wolfgang, Dynatrace OneAgent Java Team members. Their code powers auto-instrumentation and collection of all observability signals of Java based applications running on every possible stack: container in k8s, serverless, VM, on your workstation or even the mainframe.Tune is as we sat down with Marco and Wolfgang to learn what it means to continuously innovate on agent-based instrumentation with 160+ other engineers across the globe that also focus on OneAgent. They share insights on how they develop their observability code, how they continuously test across all supported environments, what the processes at Dynatrace look like to avoid situations like the recent CrowdStrike outage and how they integrate and collaborate with other communities and tools such as OpenTelemetry!Things we discussed during the episodeDynatrace OneAgent: https://www.dynatrace.com/platform/oneagent/Dynatrace for Java: https://www.dynatrace.com/technologies/java-monitoring/OpenTelemetry and Dynatrace: https://docs.dynatrace.com/docs/extend-dynatrace/opentelemetryJobs at Dynatrace: https://careers.dynatrace.com/

development millions java wolfgang vm crowdstrike ziegler observability battle tested dynatrace

Claim PurePerformance

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel

PurePerformance

Search for episodes from PurePerformance with a specific topic:

Latest episodes from PurePerformance

Blueprints for OTel Success: Standardizing Observability at Scale with Dan Gomez Blanco

OpenTelemetry and the Reality of Vendor Choice with Adriana Villela and Josh Lee

AI Is a Gift: Rethinking Software Engineering Education and Hiring

Beyond the Hype: Open Source, Observability, and Finding Your AI Breakthrough

8 Factor Producers to Scale Platform Engineering in an AI-First world with Abby Bangser

Observability in the AI‑Native Era with Hilliary Lipsig and Rob Rati

Don't babysit your AI Agents to keep them on track with Lukas Holzer

From Bowling Lanes to AI Lanes: Chris LaBrado on MDCD and the AI Interface Era

AI-Ready Codebases: Engineering Discipline for Agentic AI with Adam Tornhill

AI‑Native: Building Faster Than We Can Spec with Wolfgang Heider & Benedict Evert

Resilience in the Age of AI and Why we Still Suck at it with Adrian

From Zero to Open Source Contributor with Diana Todea

The many facets of an SRE with Alexandra Franz

10 Fundamentals to get Vibe Coding right with Jeff Blankenburg

Semiotics - A Future of Observability we are yet to see with William Louth

From Vibe Coding to Vibe Architecting with Abhimanyu Selvan

AI-Augmented Chaos Engineering in Practice with Bartek Pisulak

The Pragmatic Approach to Becoming AI-Native with Pini Reznik

Back to Basics: Increase DevEx in the Age of AI with Laura Tacho

Whats Hot in Cloud and AI-Native and what we learned from the AWS Outage

How to test, optimize, and reduce hallucinations of AIs with Thomas Natschlaeger

Hello BOB - Cloud Native Cybersecurity with Bill of Behaviors with Constanze Roedig

AI-Native: The Next Revolution after Cloud Native with Pini Reznik

State of AI Observability with OpenLLMetry: The Best is Yet to Come with Nir Gazit

Platform Engineering is not just a trend and why Terraform is not dead with Artem Lajko

What is Privacy Engineering and Why Its not as complicated as it sounds with Cat Easdon

Platform Democracy NOW! How to keep your Platform Promise with Daniel Bryant

DX Core 4 Applied - Measuring Developer Productivity with Dušan Katona

In the AI Age being Smart is not enough for Tech Leadership with Marian Kamenistak

The Research Behind the AI and Observability Innovation with Otmar Ertl and Martin Flechl

Organizational Sustainability through Platform Engineering with Lesley Cordero

Run Towards the Fire: Why we should love incidents with Lisa Karlin Curtis

MCPs (Model Context Protocol) are not that magic, but they enable magic things with Dana Harrison

The History & Power of Distributed Tracing with Christoph Neumueller & Thomas Rothschaedl

An Inside Look into Platform Engineering for Architects with the authors Max, Hilliary & Andi

How CERN analyzed 1 PetaByte per second using K8s with Ricardo Rocha

Why Compliance is Important and not Boring with Michiel de Lepper

What's next for Feature Flagging and OpenFeature with Ben Rometsch

Observability Predictions 2025 Under the Covers with Bernd Greifeneder

From Infra to Services to Happy End Users: The role of SLOs at Uber with Vishnu Acharya

The Road to OpenTelemetry Adoption at Booking with Anton Timofieiev

Why Security and Compliance must not be a showstopper for SaaS with Milan Steskal

Every Byte Counts: Web Performance Flashback with Andreas Taranetz

The Security and Resiliency Challenges of Cloud Native Authorization with Alex Olivier

Open Source: Why its the Best Thing that happened to IT with Marcio Lena

Understanding DORA - Europe's Digital Operational Resiliency Act with Kay Young

Lessons learned when building the NAIS Platform with Hans Kristian Flaatten

Why Developer Observability is not a tooling problem with Viktor Farcic

Pitfalls to avoid when going all-in on OpenTelemetry with Hans Kristian Flaatten

So you think you should Serverless? Things to know before you do with Sebastian Vietz!

Observability that is Battle tested by Millions with Marco Sussitz and Wolfgang Ziegler

Claim PurePerformance

On the way!