Page It to the Limit focuses on what it means to operate software in production. Hosted by the PagerDuty Community Team, we cover the leading practices used in the software industry to improve both system reliability and the lives of the people responsible for supporting it.
Right before hoping on stage at PagerDuty on Tour London, Andy Slater joins us to chat about automation and how at Specsavers they scaled from 0 to over 1000 automated runbooks
Right before hoping on stage at PagerDuty on Tour London, Andrew White joins us to chat about their journey at Checkout.com to migrate from legacy operations, how AI helped this and how to balance cultural changes.
Right before their session at PagerDuty on Tour London, Laura Morgan joins us to chat about first incident stories, the future of AI in Spotify and more.
Dotan Horovits joins us this week to chat about what's been going on in the world of Open Source, license changes, and community forks.
It's the end of a very busy year, with a team that is half brand-new! The gang's all here for our 2024 wrap episode.
This week, Mandi talks to our newest Dev Advocate, Daniel Afonso. Daniel joins our team in Portugal.
This week, Groundcover CEO and Co-Founder Shahar Azulay joins us to talk eBPF and what's next in observability.
This week we chat with our newest Developer Advocate, Sid Verma.
Incidents are stressful, but we can use them to learn more about our systems in order to improve. What can we do about incidents that happen with the vendors we rely on? Jeff Martens returns to the show to talk about managing vendor incidents.
It's time to get back to learning! This month's book club pick is *Confident DevOps* by Mark Peters, and Mark joins Mandi to chat about the book.
This week Brandy Smith joins Mandi to talk all things Grafana, and some cool Raspberry Pi projects!
Sometimes an operations problem feels like waking up with amnesia on a spaceship orbiting another star. This month Joe Block joins Mandi to discuss Project Hail Mary by Andy Weir.
Time for some summer reading! Elora and Mandi talk about crypto, Walmart parking lots, and money laundering.
Refactoring code can be a big job and take weeks or months. But what about just doing a bit of 'tidying'? This month, we talk about Kent Beck's new book, *Tidy First?*.
This week we change gears a little bit to chat with Michelle Glauser, Founder and CEO of Techtonica, about starting and running an organization that helps underrepresented folks train for and obtain jobs in technology.
This month our book club features Flow Engineering: From Value Stream Mapping to Effective Action, out May 14, 2024 from IT Revolution Press. The authors, Steve Pereira and Andrew Davis, join Mandi to talk about the book, the practice of Value Stream Mapping, and achieving Flow in an organization.
Cloud development can be tricky. It needs many different skills related to both infrastructure and software. Here's where Wing comes in. It combines infrastructure and runtime code in one language. This helps developers stay focused and creative. The result? Faster, safer, and better software. Join us to see what makes Wing's way of doing things special.
Spotting a security issue, or even thinking you may have caused one can be nerve-wracking, and the last thing anyone wants is to accidentally create noise for another team. Getting to know your security team can help make it all a little less scary. Breanne Boland, Product Security Engineer at Gusto, joins us to talk all things security alerting and the steps to create a culture where others feel at ease reporting security concerns.
Dealing with legacy systems is challenging for both technical and organizational reasons. This book explores various aspects of dealing with older systems, organizing teams and projects to modernize them, and cope with the process. Hannele and Mandi cover some of the highlights of this excellent book.
Successful Site Reliability Engineering (SRE) teams are skilled in both software and systems engineering, allowing them to manage reliable, scalable systems. They proactively identify and address potential issues, use failures as learning opportunities, and automate processes to reduce toil. They also prioritize communication and collaboration with other teams to ensure service reliability and performance. Join us, as we discuss the journey of SRE teams at Adidas.
In this episode, we welcome Nora Jones, Founder and CEO of Jeli, which PagerDuty acquired in 2023. We talk with Nora about expanding incident response into incident management and learning from incidents to improve reliability.
What is a software *engineer*? Software drives so much of our everyday lives, yet software development as a field has not adopted the kind of licensure other engineering disciplines have long been subject to. Hannele and Mandi discuss this classic set of essays by Steve McConnell, covering many of the same questions we still have today.
Internal Developer Portals have become crucial for organizations seeking to enhance developer experience, reduce cognitive load, and adhere to company standards. In this episode, we welcome Dave Bresci, who explains why and how PagerDuty uses Backstage internally. Additionally, our Developer Advocate, Tiago Barbosa, will provide some relevant points on PagerDuty's plugin for Backstage.
In this episode, Kat chats with PagerDuty's own EMEA support team lead, John O'Donnell, about the challenges of scaling a CS Team across global offices, mentoring others, and building your own career, all while remembering to stop and take a breath once in a while
If you've ever been in a conference talk or been chatting with somebody and they say, 'You really should read this book. Here's this thing that I learned from this book that really made a difference for me', we've been there too, and we're going to read some of these books for you. Our first book club book is Close to the Machine: Technophilia and its Discontents by Ellen Ullman. Mandi is joined by Tara King to chat about the book and their thoughts.
On this episode, Heitor Lessa, Chief Architect at AWS, shares some insights on the journey of Powertools for AWS Lambda and the practices involved in growing and maintaining an open source community.
Platform Engineering. You've probably heard it mentioned, maybe discussed if your team would benefit from it. What is platform engineering, and why are organizations looking for solutions? This week we talk with Abby Bangser, Principal Engineer at Syntasso and lead of the Platforms Working Group for CNCF to learn more.
Our team has had a busy year. We've welcomed new members and gotten to chat with lots of great folks all over. This week we chatted about some of the things we enjoyed about 2023 and what we're looking forward to in 2024.
Use cases for the Python programming language are everywhere. This week we talk to Mike Fiedler, Security and Safety Engineer for PyPI about keeping those use cases secure, working in Open Source, and sending Python to space.
This week we talk to Matt Davis about how leaders can build a culture of learning in modern organizations. Leaders foster continuous learning opportunities for teams and help employees cope with environments where change is constant.
For our 100th episode, we reached out to the folks who have been at PagerDuty the longest, and asked them to share some of their stories with us and with you! These folks are from all over PagerDuty and had some amazing one-of-a-kind experiences.
Working with older monolithic applications can be cumbersome, and many organizations have taken the journey to devolve their monolith into smaller individual services. That path isn't for everyone, and JJ Asghar joins us to talk about when to just let your monolith keep on doing its thing.
J. Paul Reed returns to the show to talk more about a series of posts he published on Medium discussing how likely post-incident action items are to be completed.
This week we welcome to the show Manuel Pais who is the co-author of "Team Topologies: Organizing business and technology teams for fast flow". Manuel will walk us through some of the concepts that enable companies to deliver value more frequently and effectively to their customers by organizing their teams in an optimized way.
Threat modeling is one of those things that teams say they should be doing, but many never quite do it. Putting together a formal threat model with input from the whole organization is daunting. Where do you start? Where do you draw the line? In this episode we talk to Gene Gotimer about how to approach threat modeling without losing focus and getting too off track in what-ifs.
This week we check in with PagerDuty's CTO, Tim Armandpour, to learn more about his perspectives on the state of incident response, where reliability work is headed, and the challenges organizations are still facing.
This week we're joined by Mark Hatch, Senior Director of Infrastructure at Recharge, for a bit of a trip down memory lane. How has infrastructure, monitoring, and response changed for Operations folks in the last 20 years? Mark shares his experiences with us.
This is an exciting episode: we're welcoming a new team member to PagerDuty's Developer Advocacy team, as well as an additional host for this podcast! Listen in to meet Tiago Barbosa and learn more about what he's excited to bring to the PagerDuty community going forward.
In this episode, we dive into the world of artificial intelligence (AI) and its impact on developer experience. James Governor of Redmonk joined us to explore the varying opinions on AI, from the fear of a robot takeover to the integration of AI in our daily lives. Tune in for a discussion on how AI is shaping the future of developers and how we can seek balance amidst rapid innovation. _To add emphasis to a point made in this episode: host Kat Gaines asked ChatGPT to write the first draft of our episode description. Three versions + her human edits later, she felt we had landed on something publishable._
How and when you communicate about an incident is important for keeping stakeholders, users, and customers informed. It also has ramifications for your Support teams and how disruptive an incident can be for them. In this episode, PagerDuty DevOps Advocates Kat Gaines and Mandi Walls discuss the intricacies of communicating during an incident.
PagerDuty alum and former Page It host, Scott McAllister, returns to the pod in this episode to talk about what he's doing now and his experience jumping back into the startup world.
Managing work processes across an organization can be a challenge. Shared responsibilities and bottlenecks can cause confusion, stress, and delays. In this episode, we talk with Helen Beal about Value Stream Management, a practice for gathering insights on workstreams from ideas to value realization.
Many work processes require sophisticated orchestration of data among any number of different software applications. Low-Code and No-Code solutions can help folks create powerful automation to assist in this process. Sarah Ryan joins us to talk about how teams can benefit from these solutions.
In this episode, PagerDuty talks to PaigerDuty! Paige Cruz from Chronosphere joins us to discuss what a sustainable on-call culture really means, maintaining healthy on-call hygiene to avoid burning out the humans who have to respond to alerts, and more.
Startups have unique needs and challenges. Cloud computing platforms have made getting technical projects up and running far easier than they used to be, but there's still a place for help and advice. This week we talk with Chris Munns about AWS's startup program.
Are your customer support and engineering teams working in silos, unaware of what is happening on the other end? Kat Gaines and Rachel Stephens discuss some of the pitfalls that can lead to this situation, and how to avoid them for smoother customer operations and happier teams.
Modern application architectures are reliant on third party, cloud-based dependencies. Services might have dozens of external services they rely on for everything from search to shopping carts to authentication. How do you know your dependencies are meeting your needs? We talk to Jeff Martens, CEO & Co-Founder at Metrist, about this key component of reliability.
In this episode, another PagerDuty employee joins us to tell us aboout an unusual approach to problem solving! Listen in to hear Dylan Lingelbach all about applying DNA sequencing algorithms in unexpected places.
Educating developers is more than just on-boarding. It's continually preparing them to perform well at their job as demands and technology change. Join us as we talk with Eric Potter, Director of Developer Education at Sweetwater, about keeping developers educated.
The PagerDuty Community Team joined the Ops.IO Community earlier this year. We wanted to learn more about Ops.IO and the folks who started it.
The communities that grow up around software products can have many different characteristics. This week Mary and Julie chat with Mandi about developer communities and the people parts of developer 'marketing'.