This is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. Catch our Outage Deep Dive series for special coverage of Internet outages. We go under the hood to determine what happened, covering key lessons and ways IT teams can minimize downtime in similar situations. Also tune in every other week for the Pulse Update podcast series to hear from the Internet experts at ThousandEyes as they share the latest data on ISP outages, public cloud provider network outages, collaboration app network outages, and more. Then, the hosts dig into some of the most interesting outage events from the last few weeks.
Dive into recent service disruptions at Zoom, Spotify, SAP Concur, and Vanguard UK, and explore what they reveal about troubleshooting best practices for ITOps teams.Tune in now for insights from The Internet Report team or use the chapters below to jump to the sections that most interest you.CHAPTERS:00:00 Intro00:52 Zoom Outage04:40 SAP Concur Disruption 07:28 Spotify Outage10:58 Vanguard Outage13:59 By the Numbers16:01 Get in Touch———For additional insights, check out the links below:- The Internet Report's latest blog: https://www.thousandeyes.com/blog/internet-report-troubleshooting-tips-zoom-spotify-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast- The Five Phases of Internet Outage Recovery: https://www.thousandeyes.com/resources/five-phases-internet-outage-recovery-infographic?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast- The Guide to Next-generation Assurance: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X: @thousandeyes
Packet loss can be bad news for network flows and customer experience. However, in our experience, NetOps teams tend to focus on major spikes in packet loss, while overlooking smaller amounts like 1 or 2%.This might be a mistake. Tune in for a deep dive into research findings suggesting that even 1% packet loss can significantly impact user experience—and recommendations for steps NetOps teams should take as a result.———CHAPTERS00:00 Intro01:07 The Surprising Impact of 1% Packet Loss02:50 Research Methodology08:17 Key Findings13:55 Recommendations for NetOps Teams16:48 Additional Research22:30 Get in Touch———For additional insights on our packet loss research, explore all three parts of our Path Quality blog series:- Path Quality Part 1: The Surprising Impact of 1% Packet Loss: https://www.thousandeyes.com/blog/path-quality-surprising-impact-one-percent-packet-loss?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast- Path Quality Part 2: Understanding the Impact of Packet Loss on Applications: https://www.thousandeyes.com/blog/path-quality-understanding-impact-packet-loss-applications?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast- Path Quality Part 3: Is BBR the Future of Congestion Avoidance?: https://www.thousandeyes.com/blog/path-quality-brr-future-congestion-avoidance?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcastAnd to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes———ABOUT THE INTERNET REPORTThis is The Internet Report, a podcast uncovering what's working and what's breaking on the Internet—and why. Tune in to hear ThousandEyes' Internet experts dig into some of the most interesting outage events from the past couple weeks, discussing what went awry—was it the Internet, or an application issue?Plus, learn about the latest trends in ISP outages, cloud network outages, collaboration network outages, and more.Catch all the episodes on YouTube or your favorite podcast platform:- Apple Podcasts: https://podcasts.apple.com/us/podcast/the-internet-report/id1506984526- Spotify: https://open.spotify.com/show/5ADFvqAtgsbYwk4JiZFqHQ?si=00e9c4b53aff4d08&nd=1&dlsi=eab65c9ea39d4773- SoundCloud: https://soundcloud.com/ciscopodcastnetwork/sets/the-internet-report
Go under the hood of recent service disruptions at X, Workday, and Mastercard—and explore why it's so important to quickly (and accurately) identify the root cause of an outage.———CHAPTERS00:00 Intro00:59 X Outage07:08 Workday Outage11:00 Mastercard Service Disruption14:48 By the Numbers16:05 Get in Touch———For additional insights, check out The Internet Report's latest blog: https://www.thousandeyes.com/blog/internet-report-service-disruptions-x-workday-mastercard?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep4_podcastAnd to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep4_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
Dive into the recent Slack outage and disruptions at Microsoft 365, Grafana Cloud, and Otter.ai—plus, explore key takeaways for ITOps teams.———CHAPTERS:00:00 Intro00:48 Slack Outage06:55 Microsoft 365 Outage11:44 A Pair of Otter.ai Outages14:21 Grafana Cloud Disruption15:55 By the Numbers17:58 Get in Touch———To learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep3_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes.
Outages connected to configuration mishaps were a common theme last year, and we've continued to see incidents like these in 2025. Configuration changes triggered two consecutive Asana outages in early February, and configuration or update-related issues may also have contributed to recent disruptions at Barclays, ChatGPT, Jira, and Discord.Tune in to hear The Internet Report's Mike Hicks unpack these incidents and discuss ways ITOps teams can guard against similar issues.———CHAPTERS:00:00 Intro01:06 Asana Outages11:40 ChatGPT Disruption19:34 Barclays Outage21:57 Jira Outage22:59 Discord Outage24:31 By the Numbers30:15 Get in Touch———For additional insights, check out The Internet Report's latest blog: https://www.thousandeyes.com/blog/internet-report-configuration-mishaps-asana-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep2_podcastAnd to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep2_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes.
What does it take to deliver successful digital experiences at major events like concerts and conferences? With special guest Dominic Hampton—Managing Director at attend2IT—we'll explore the dynamic world of event IT and key takeaways ITOps teams at enterprise companies can apply to their own events as well as in their day-to-day operations.We'll also discuss insights from recent incidents that impacted Azure, Microsoft 365, and more.CHAPTERS00:00 Intro01:34 Behind the Scenes of Event IT: Lessons for Enterprise ITOps22:42 Microsoft Azure Incident24:15 Microsoft 365 Disruption25:31 Atlassian Bitbucket Cloud Outage27:22 TikTok's Shutdown30:41 Get in Touch———ABOUT DOMINIC HAMPTONDominic Hampton is the Managing Director of attend2IT, a UK-based company that provides comprehensive IT services for events of all kinds, from music festivals to major corporate conferences. An IT industry veteran, Dom has more than two decades of experience in the space and has worked on events for many leading companies and organizations. Learn more and connect with Dom on LinkedIn: https://www.linkedin.com/in/attend2/———For additional insights, check out the links below:The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-event-it-best-practices?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep6_podcastWebinar: Top Outages of 2024, Explained: Lessons in Digital Resilience: https://www.thousandeyes.com/resources/na-top-outages-2024-lessons-in-digital-resilience-webinar?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep6_podcast*NOTE: The discussed Atlassian Bitbucket Cloud outage occurred on January 21, starting at 3:30 PM (UTC), not January 22.———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
Configuration changes played an outsized role 2024 outages. Tune in to hear more about this and other outage trends—and learn how ITOps teams should plan accordingly in the year ahead.We'll also share insights from recent incidents at OpenAI and Google Cloud's Pub/Sub, and dive deeper into a degradation incident that Netflix experienced at the end of 2024.Read on to learn more, or use the chapters below to jump to the sections that most interest you.CHAPTERS00:00 Intro00:58 Cloud Service Provider (CSP) Outages Continue To Rise 01:52 Accidental Misconfigurations Trending for Clouds and Apps07:10 OpenAI Outage09:55 Google Cloud's Pub/Sub Disruption14:47 Lessons From a Netflix Incident18:57 Recent Outage Trends: By the Numbers21:01 Get in Touch———For additional insights, check out the links below:- The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-configuration-change-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast- 2024 Outage Trends Solidify; Plus OpenAI & Meta Outages: https://www.thousandeyes.com/blog/internet-report-2024-outage-trends-openai-meta-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast- Netflix Broadcast Disruption: Lessons for Major Live Events: https://www.thousandeyes.com/blog/netflix-disruption-analysis-november-15-2024?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast- And join our upcoming webinar, “Top Outages of 2024, Explained: Lessons in Digital Resilience.” We'll unpack notable outages and performance degradations of 2024 and share lessons IT Operations teams can take away from these incidents to strengthen their digital resilience: https://www.thousandeyes.com/webinars/na-top-outages-2024-lessons-in-digital-resilience?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
With nearly a year of data available, the topline outage trends for 2024 are coming into focus. Tune in to see what the numbers are showing.The Internet Report team will discuss how Internet service provider (ISP) outage numbers are continuing to increase, while cloud service provider (CSP) outages are also becoming more frequent, indicating a changing landscape in service reliability. They'll also unpack the recent OpenAI and Meta outages.———CHAPTERS:00:00 Intro00:49 Outage Trends Across 202407:37 OpenAI Outage13:10 Meta Outage18:48 Get in Touch ———For additional insights, check out The Internet Report's latest blog: https://www.thousandeyes.com/blog/internet-report-2024-outage-trends-openai-meta-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep4_podcastAnd join our upcoming webinar, “Top Outages of 2024, Explained: Lessons in Digital Resilience.” We'll unpack notable outages and performance degradations of 2024 and share lessons IT Operations teams can take away from these incidents to strengthen their digital resilience: https://www.thousandeyes.com/webinars/na-top-outages-2024-lessons-in-digital-resilience?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep4_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn and X at @thousandeyes
The past few weeks are somewhat of a representative sample of 2024 from an outage perspective, with connectivity issues and updates at the root of the four recent incidents.Both DigitalOcean and real-time payments provider Worldline experienced connectivity issues to data centers that made services unreachable. Meanwhile, Microsoft and Reddit encountered problems following changes to their systems that appeared to have unexpected user impacts and had to be rolled back. Tune in to hear The Internet Report team unpack these incidents and discuss the latest outage trends.———CHAPTERS:00:00 Intro00:50 Reddit Server Errors04:33 Issues for Microsoft 36509:42 DigitalOcean's Network Issues10:52 Worldline's Payment “Perturbations”12:40 Outage Trends: By the Numbers15:06 Get in Touch ———For additional insights, check out this week's The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-digitalocean-reddit-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep3_podcastAnd to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep3_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
Powerful things happen when ITOps teams move beyond a break-fix approach and lean into proactive optimization. Instead of just responding to issues as they occur, when teams have independent visibility into their end-to-end service delivery chain, they can proactively identify possible areas for optimization and improvement. For example, streamlining one small part of a complex process could shave seconds off the total transaction time; do this for every part of the process, and the efficiency savings can quickly add up.In recent weeks, it appeared OpenAI's ChatGPT may have been undergoing this type of optimization, as we observed material improvements in page load times and some evidence suggesting possible configuration changes and re-architecture in the pursuit of performance improvements.Read on to learn more about what we saw at ChatGPT, as well as insights from incidents at companies including Grammarly, Bluesky, and Netflix, or use the chapters below to jump to the sections that most interest you.CHAPTERS00:00 Intro00:48 ChatGPT Disruptions08:17 Grammarly Issue10:59 Netflix Issues12:09 Bluesky Disruptions13:26 Salesforce Outage14:50 Verizon Fios Disruption16:14 Outage Trends: By the Numbers18:36 Get in Touch———For additional insights on understanding and proactively optimizing digital experiences, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep2_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
The Digital Operational Resilience Act (DORA) goes into effect on January 17, 2025, and financial institutions serving the EU will need to meet an enhanced set of requirements related to risk management, network resilience, and incident reporting.While DORA is directly applicable to EU financial institutions, it prompts important discussions about resilience and ensuring digital experiences that are relevant for all IT operations teams, regardless of industry or region.Tune in to the podcast to hear The Internet Report team and special guest Bernie Clairmont, Product Solution Architect at ThousandEyes, dive deeper into DORA.As usual, we'll also unpack recent outages from the last few weeks, examining the DDoS attacks against the Internet Archive as well as some power and cooling problems at Google Cloud. We'll also revisit a previous Azure outage and discuss the latest global outage numbers.——— CHAPTERS:00:00 Intro01:20 DORA: What ITOps Teams Need to Know18:38 BMO Disruption21:16 Incident Retrospective: Azure Virtual Desktop Outage25:00 Google Cloud Outage27:47 Internet Archive Outage29:35 Get in Touch ———For more insights, check out the link below:- Blog: Introducing Adaptive API Monitoring: https://www.thousandeyes.com/blog/introducing-adaptive-api-monitoring?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy25q2_internetreport_q2fy25ep1_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
A recent Salesforce outage highlighted the limitations of status pages and the importance of considering a variety of data points when identifying the source of an outage.Tune in to hear The Internet Report team discuss what happened and why. They'll also share insights from a recent Microsoft Outlook outage and cover the latest Internet outage trends.Listen now or use the chapters below to jump to the sections that most interest you.CHAPTERS00:00 Intro00:48 Salesforce Outage10:00 Microsoft Outlook Outage14:43 Outage Trends: By the Numbers17:22 Get in Touch ———For additional insights, check out the latest Internet Report blog: https://www.thousandeyes.com/blog/internet-report-salesforce-microsoft-outage?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy25q1_internetreport_q1fy25ep6_podcastAnd for more on troubleshooting digital experiences across owned and unowned networks, watch this webinar featuring The Internet Report's Mike Hicks: https://www.thousandeyes.com/resources/apjc-troubleshooting-digital-experiences-across-owned-unowned-networks-webinar?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy25q1_internetreport_q1fy25ep6_podcast ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
A recent certificate problem impacted ServiceNow, and other issues prevented users from accessing key cloud services including Microsoft 365, Azure Virtual Desktop, and Workday.Tune in to hear what happened during these incidents and a separate data center fire that caused a Reliance Jio outage for customers across multiple areas of India.Listen now or use the chapters below to jump to the sections that most interest you.CHAPTERS00:00 Intro00:59 ServiceNow Outage03:20 Microsoft 365 Outage04:35 Azure Virtual Desktop Outage05:50 Workday Outage09:39 Reliance Jio Outage13:06 Outage Trends: By the Numbers15:16 Get in Touch ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
During high-traffic seasons like Black Friday or a much-anticipated product launch, maintaining good digital experiences for customers is vital. We've all heard tales of floods of eager shoppers crashing a website during a major sale—leaving them unable to make their coveted purchases. To guard against a breakdown like this during high-traffic periods, companies sometimes use various traffic management strategies such as digital waiting rooms.In this episode, The Internet Report team discusses the pros and cons of traffic management and looks at the different techniques used by ticketing platforms for the upcoming Oasis reunion tour concerts. They also cover an AT&T issue that impacted Microsoft, as well as disruptions at Akamai, Alibaba Cloud, and Cloudflare.Listen to the full episode now or use the chapters below to skip to the sections that most interest you.CHAPTERS00:00 Intro00:54 Oasis Reunion Ticket Issues10:16 Microsoft Outage13:00 Akamai Outage14:40 Alibaba Cloud Outage16:28 Cloudflare Incident17:40 Outage Trends: By the Numbers18:40 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Let's dive into the fascinating world of subsea cables. With special guest Murray Burling—Executive Director of Oceans and Environment at RPS—we'll explore the current subsea cable ecosystem and chat about what the future might hold.Tune in for insights on how important subsea cables are for today's digital experiences, how decisions are made on where to place them, the consequences of cable cuts, and route diversity and Internet resilience.CHAPTERS00:00 Intro02:29 Current Subsea Cable Ecosystem07:16 Subsea Cable Cuts15:15 Route Diversity & Internet Resilience18:51 What's Next22:05 Get in TouchABOUT MURRAY BURLINGMurray Burling is the Executive Director of Oceans and Environment at RPS. Murray has extensive experience as a consulting coastal engineer, oceanographer, and marine modeler, undertaking multifaceted studies in Australia, New Zealand, South East Asia, and the Middle East. He has managed the implementation of large web-based data and model applications and his team supports data acquisition and analysis for a wide range of industries developing and operating in the world's oceans. These sectors include conventional and renewable energies, shipping, ports, and communications. He is also an expert in complex dataset analysis and visualization, and has authored and co-authored many technical reports and publications.Connect with Murrary on LinkedIn: https://www.linkedin.com/in/murray-burling-a527839a/———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Explore the recent Google Cloud and GitHub outages, plus get insights from a network perspective into the August 12 X livestream event featuring Elon Musk and Donald Trump.In the case of Google Cloud, a power issue in one of its European regions impacted connectivity and affected several services and networking equipment. The problems disrupted connectivity into the region as well as some Partner Interconnect connections and associated routes between other Google regions.Traffic to and from GitHub.com encountered an issue when a database configuration change resulted in critical services unexpectedly losing connectivity.And when access issues during the August 12 X “Spaces” livestream were attributed to a network load-related issue, traffic behavior observed during the event suggested a different technical explanation.Tune in to learn more about these three events or use the chapters below to jump to the sections that most interest you.CHAPTERS:00:00 Intro00:47 Google Cloud Outage05:03 GitHub Outage11:26 X Spaces Insights14:16 Outage Trends: By the Numbers15:57 Get in Touch ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
This week, The Internet Report team and special guest Dave Anderson—a tech industry veteran and co-host of "A Very Melbourne Podcast," which covers the Australian Football League and more—are chatting about how to assure great digital experiences at major sporting events.Large sporting events are always logistically complex, and today that's even more the case with digital technology permeating every part of operational and experience delivery. And due to the real-time nature of live sports, any glitch can have a big impact on fan experiences—whether they're at the stadium or joining in from their living rooms.For the teams managing this web of digital services that brings sports content to global audiences, it's vital to guard against potential issues, as well as quickly detect and remediate problems when they do occur. Tune in to learn more or use the chapters below to skip to the sections that are most interesting to you.CHAPTERS00:00 Intro01:50 Digital Experiences in the Sports World08:57 The In-Stadium Experience13:18 Hybrid Sports: Catching the Game From Anywhere18:38 Sports Betting & Digital Experiences23:06 The Athlete Experience29:00 Key Takeaways31:48 Get in Touch———ABOUT DAVE ANDERSONA tech industry veteran, Dave Anderson currently serves as the Head of Product Marketing and Strategy at Contentsquare. He also co-hosts "A Very Melbourne Podcast," which covers the Australian Football League and more.———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes———ABOUT THE INTERNET REPORTThis is The Internet Report, a podcast uncovering what's working and what's breaking on the Internet—and why. Tune in to hear ThousandEyes' Internet experts dig into some of the most interesting outage events from the past couple weeks, discussing what went awry—was it the Internet, or an application issue?Plus, learn about the latest trends in ISP outages, cloud network outages, collaboration network outages, and more.Catch all the episodes on YouTube or your favorite podcast platform:- Apple Podcasts: https://podcasts.apple.com/us/podcast/the-internet-report/id1506984526- Spotify: https://open.spotify.com/show/5ADFvqAtgsbYwk4JiZFqHQ?si=00e9c4b53aff4d08&nd=1&dlsi=eab65c9ea39d4773- SoundCloud: https://soundcloud.com/ciscopodcastnetwork/sets/the-internet-report
On July 19, many organizations around the globe—including airlines, banks, and hospitals—experienced outages as Windows machines reportedly got stuck in a boot loop that ultimately resulted in the Blue Screen of Death (BSOD). These disruptions had a common source: an update from CrowdStrike, a managed detection and response (MDR) service used to protect Windows endpoints from attack. Tune in to hear The Internet Report team's insights on this CrowdStrike update and the ensuing IT outages. We'll also dive into the separate Azure outage that occurred just hours before, as well as some other recent disruptions.———CHAPTERS:00:00 Intro00:48 CrowdStrike Sensor Update Incident05:40 Azure Outage10:47 Workday Outage12:11 Grammarly Incidents14:36 Outage Trends: By the Numbers16:57 Get in Touch ———For more insights, check out The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-crowdstrike-update-azure-outage?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q4_internetreport_q4fy24ep6_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
On May 17, X reached a major milestone when the social media platform completed its full migration from twitter.com to x.com. While the number and frequency of outages did increase after the company's acquisition by Elon Musk, following the domain migration, there don't appear to have been any significant disruptions to the X.com platform. In this week's podcast, The Internet Report team discusses what they observed during (and after) the domain migration, and analyzes X's performance pre- and post-acquisition. ——— CHAPTERS:00:00 Intro00:55 Performance in the Twitter Era05:13 The Sale and Post-sale Periods12:25 Domain Migration15:47 Outage Trends: By the Numbers17:34 Get in Touch ———For more insights, explore the domain migration in the ThousandEyes platform (no login required): https://avlwlcpshpfzyqsusguxxurkhgexgpxx.share.thousandeyes.com———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Three recent outages at Starlink, Charles Schwab, and the Internet Archive highlight key reminders for NetOps teams around backup options, the role of intelligence, and understanding your end-to-end service delivery chain.A subset of Starlink users were unable to establish a connection; some users of Schwab.com and its apps may have found themselves unable to transact or trade due to an authentication issue; and the Internet Archive and the Wayback Machine were intermittently overwhelmed by unexpected traffic floods.Tune in to learn more about what happened and why, or use the chapters below to jump to the sections that most interest you: CHAPTERS:00:00 Intro00:54 Starlink Outage07:35 Charles Schwab Outage10:20 DDoS Attack Impacts Internet Archive13:33 Outage Trends: By the Numbers16:12 Get in Touch ———For more insights, check out The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-schwab-starlink-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q4_internetreport_q4fy24ep4_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Believe it or not, we're already about halfway through 2024. Looking at the outage data from this year so far, we see continued evolution, following patterns observed over the past few years. Notably, the percentage of cloud service provider (CSP) outages is still increasing—though at a more accelerated rate than seen in recent years.Tune on to learn more about this trend and other themes we're noticing in the Internet ecosystem, as well as tips for how IT teams can respond to these evolving challenges.———CHAPTERS:00:00 Intro01:14 Cloud Incident Trends Accelerate13:56 Blending Real Usage Data With Synthetics16:35 AI & NetOps20:52 Get in Touch ———For more insights, check out The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-cloud-outages-rise-2024-trends?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q4_internetreport_q4fy24ep3_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
When it comes to assuring great digital experiences for your users, intermittent issues can be incredibly difficult to discover and diagnose because the service is both working and not working simultaneously—or, it may simply be running slow. Some users may experience issues, while for others, everything will work just fine.In this week's episode, The Internet Report team will explore the complexities that intermittent issues can bring by examining two recent incidents at Meta and Salesforce. They'll also cover an automation bug at Google Cloud that caused problems for a range of customers, as well as recent Internet outage trends.Listen now or use the timestamps below to jump to the sections that most interest you:CHAPTERS00:00 Intro01:00 Meta Disruption05:07 Salesforce DNS Disruption10:08 Google Cloud Issues Impact Spotify, GitLab, and More15:09 Outage Trends: By the Numbers17:14 Get in Touch ———For more insights, check out the related Internet Report blog: https://www.thousandeyes.com/blog/internet-report-meta-salesforce-disruptions?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q4_internetreport_q4fy24ep2_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Explore what happened during recent outages at google.com, X (formerly Twitter), and CDN service jsDelivr. The Internet Report team will also discuss why a detailed understanding of every component in your service delivery chain is vital to maintain the availability and resiliency of your service. If even one component encounters challenges, the entire service can be impacted.In jsDelivr's case, for example, the detail at issue was an expired cert, which created problems serving content and impacted many websites that rely on the CDN service.Listen now to learn more or use the timestamps below to jump to the sections that most interest you:CHAPTERS:00:00 Intro00:50 google.com Outage05:23 X Outage10:33 jsDelivr Outage15:01 Outage Trends: By the Numbers17:27 Get in Touch ———For more insights, check out the links below:- The Internet Report: Pulse Update blog: https://www.thousandeyes.com/blog/internet-report-x-jsdelivr-google-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q4_internetreport_q4fy24ep1_podcast- Explore the X and google.com outages in the ThousandEyes platform (no login required): X outage: https://aitcvdgvbgkzdxlzjvazipwezvmjeljr.share.thousandeyes.comgoogle.com outage: https://azraqdwpahxlrnviylmbvpumoonabsnh.share.thousandeyes.com———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Go under the hood of a ChatGPT outage, H&R Block's Tax Day disruption, and more incidents from the past few weeks. The Internet Report team will also discuss Microsoft's update on recent subsea cable cuts and the latest global outage trends.———CHAPTERS:00:00 Intro00:57 ChatGPT Outage03:35 Revisiting West Coast of Africa Cable Cuts09:07 H&R Block Outage11:32 Sky Mobile Outage12:25 Outage on unpkg CDN14:06 PlayHQ Outage16:40 Outage Trends: By the Numbers19:33 Get in Touch ———For more insights, check out the links below:- The Internet Report: Pulse Update blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-chatgpt-outage-more-news?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse34_podcast- Explore the ChatGPT outage in the ThousandEyes platform (no login required): https://akhdjionjdupmjvkqfyigkaorlxgzgir.share.thousandeyes.com- Review Microsoft's post-incident report on the west coast of Africa cable cuts (see the entry labeled March 14, tracking ID: VT60-RPZ): https://azure.status.microsoft/en-gb/status/history/- Also watch Microsoft's related Azure Incident Retrospective video: https://www.youtube.com/watch?v=ypU_UuwW_w8&list=PLmsFUfdnGr3xomlYbZPAYTtFdkcvbv2ye———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
With tax season coming to a close in the United States, IT teams at tax preparation companies and other organizations in the industry will be taking extra care to make sure that their systems can handle a spike in traffic due to a potential last-minute rush of filings. Tune in to hear The Internet Report hosts discuss how IT teams can navigate major spikes in demand and give customers the best possible digital experience, whether it's Tax Day, Black Friday, or another high-traffic period.They'll also unpack recent outages at companies including Apple and WhatsApp, and explore why a sudden reduction in page load times may indicate functional issues rather than true improvements to the customer's digital experience.———CHAPTERS:00:00 Intro01:11 Navigating U.S. Tax Season and Spikes in Demand06:37 Pulse Update: Outage Analysis06:40 Apple Outage: App Store, TV+, and Music13:35 WhatsApp, Other Meta Services Disrupted15:04 Square Outage17:19 Performance as a Cyber Indicator: XZ Utils20:48 Panera Bread Outage22:33 Ray AI Disruption24:13 Outage Trends: By the Numbers27:05 Get in Touch ———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
The end-to-end delivery of modern digital services can introduce a complex web of dependencies and failure points, which can stem from direct relationships as well as third-party providers, introducing layers of abstraction for operations teams to keep track of. Managing this complex ecosystem can be challenging. Unexpected issues may arise from seemingly insignificant components, surprising even the largest, most technologically sophisticated organizations.For example, in recent weeks, problems at third-party providers led to outages at McDonald's and the Department of Motor Vehicles. Tune in to the episode to hear what happened and explore other incidents from the last couple of weeks.——— CHAPTERS00:00 Intro01:09 Payment and Ordering Systems Cut Out at McDonald's04:36 Disruptions at the DMV08:48 Cable Cut in West Africa10:16 Another Fiber Cut, but With Layered Redundancy14:12 PlayStation Network, GeForce Now Experience Issues16:21 Get in Touch ———For more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-mcdonalds-dmv-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse32_podcast- Explore the Africa undersea cable cut in the ThousandEyes platform (no login required): https://alxucgibmtsxarllvvybwfagzkvvlokf.share.thousandeyes.com/———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Over a two-day period this past week, major social media platforms—Meta's Facebook and Instagram, LinkedIn, and Discord—all experienced disruptions. In the same timeframe, Comcast was also impacted by an outage that affected access to specific services and applications.Meta experienced issues with its log-in process, Discord navigated unexpectedly high load volumes, Comcast dealt with 100% packet loss in part of its backbone, and—the following day—LinkedIn worked its way through a backend issue.These incidents each leave valuable reminders for NetOps teams as they seek to minimize downtime and assure exceptional digital experiences.Tune in to the episode to learn more and also gain insight into the latest outage numbers and trends.——— CHAPTERS:00:00 Intro00:58 Comcast Outage05:09 Meta Outage09:38 Discord Disruption10:25 LinkedIn Outage13:22 DIRECTV Service Disruption15:25 Outage Trends: By the Numbers17:03 Get in Touch ———For more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-meta-linkedin-comcast-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse31_podcast- Explore the outages in the ThousandEyes platform (no login required):Comcast Outage: https://afplzaqpehynhosldcqefycfhqqcrbzh.share.thousandeyes.com LinkedIn Outage: https://anxjzezcpptrnpodjucgzzixdkldguzm.share.thousandeyes.com - Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse31_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Load is a fundamental but, at times, challenging variable for networks and operations teams to handle. In the past few weeks, ThousandEyes saw various load-related problems impact organizations including Google Cloud, Front, several Australian banks, and Minnesota State University Moorhead.Tune in to learn more about what happened during these incidents, as well as hear our commentary on the recent outage impacting AT&T. Use the timestamps below to jump to the sections that most interest you: CHAPTERS:00:00 Intro00:59 AT&T outage impacts cellular services nationwide04:40 Australian banks appear to lose online and app-based services for 24 hours07:46 Google Cloud metadata store faces sudden demand spike09:19 Front's “large unexpected increase in web traffic”10:23 Box experiences outage as third-party network component fails11:23 Minnesota State University Moorhead's case study on good visibility12:44 Outage trends: By the numbers15:35 Get in touch ———For more insights, check out these links:- Explore the Front disruption in the ThousandEyes platform (no login required): https://aczocbxpamiipkdqnaqdedkytwhlfjtw.share.thousandeyes.com?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse30_podcast- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse30_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
When outages happen, it's what you do next that matters. It's important to have a backup plan in place that you can quickly activate to minimize the impact of an incident.Over the past two weeks, companies initiated a range of resiliency actions, including asking customers to use alternate authentication methods (or to avoid logging out of a service), setting up a new contact center to re-establish lines of communication, and reverting to manual processes.Tune in to learn more about what happened during these and other recent incidents.CHAPTERS:00:00 Intro01:00 Square Outage Impacts “Multiple” Services04:18 Applied Digital's Multi-week Data Center Issue07:14 UC Berkeley Data Center Outage08:44 Russia .ru Domain Outage10:05 Cyber Attack at Lurie Children's Hospital11:36 Outage Trends: By the Numbers15:55 Get in Touch———For more insights on outage trends and analysis of some of the most notable outages of 2023, check out our on-demand webinar: https://www.thousandeyes.com/resources/amer-top-outages-2023-analyses-takeaways-webinar?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse29_podcastIn EMEA and want to tune in live? We're hosting one more live webinar session on Feb. 22 at 10 AM (GMT). Register now: https://www.thousandeyes.com/webinars/emea-top-outages-2023-analyses-takeaways?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse29_podcastAnd also check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-square-outage-and-more-news?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse29_podcast- Explore the .ru domain outage in the ThousandEyes platform (no login required): https://awuyqonlmzxmvizsdcgevvdhcbzervks.share.thousandeyes.com?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse29_podcast- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse29_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
The ThousandEyes Internet Intelligence team joins us from Cisco Live in Amsterdam, talking about a major theme from the event—security.Tune in to hear their thoughts on how visibility can help companies in their security efforts, the sovereignty of data in flight, and why you don't have to choose between security and performance.———CHAPTERS00:00 Intro01:09 Evolving Security Landscape04:53 Security Excellence & Optimal Digital Experience10:13 Sovereignty of Data in Flight14:57 Key Takeaways15:55 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. And follow us on X: @thousandeyes
What happened during the recent Microsoft Teams and Azure disruptions? Go under the hood of these incidents and also explore other recent disruptions in this week's Pulse Update.CHAPTERS- 01:03 Network issue leads to Microsoft Teams service disruption- 04:09 Azure Resource Manager exhausts capacity, causing service issues- 06:20 Oracle Cloud experiences network outage- 09:56 Jira users encounter 503s and other errors- 10:30 Sage outage impacts South Africa- 11:08 Red Hat experiences four search-related incidents- 11:45 Recent outage trends and numbersFor more insights on outage trends and analysis of some of the most notable outages of 2023, check out our on-demand webinar: https://www.thousandeyes.com/resources/amer-top-outages-2023-analyses-takeaways-webinar?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse28_podcastIn EMEA and want to tune in live? We're hosting one more live webinar session on Feb. 22 at 10 AM (GMT). Register now: https://www.thousandeyes.com/webinars/emea-top-outages-2023-analyses-takeaways?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse28_podcastAnd also check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-microsoft-teams-azure-outage?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse28_podcast- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q3_internetreportpulse28_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
What caused recent dips in performance for OpenAI's ChatGPT? Tune in to hear The Internet Report team unpack this and other recent disruptions, including a hack that led to an outage at the Spanish branch of the Orange mobile network, and a blip for customers of the cloud services provider DigitalOcean.They'll also cover the outage trends they're seeing in 2024 so far and how extreme cold weather can cause problems for data centers.For more insights on outage trends and analysis of some of the most notable outages of 2023, register for the upcoming Top Outages of 2023 webinar: https://www.thousandeyes.com/webinars/amer-top-outages-2023-analyses-takeaways?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse27_podcastAlso check out these links:- Blog: 2023 Internet Outage Trends & the New Outage Landscape: https://www.thousandeyes.com/blog/internet-report-pulse-update-2023-internet-outage-trends?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse27_podcast- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse27_podcast———CHAPTERS00:00 Intro01:12 Two Consecutive Service Degradations at ChatGPT04:43 Hack Leads to Orange Spain Outage13:05 DigitalOcean Disruption15:55 By the Numbers23:26 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
As they launch into 2024, organizations are facing a different outage landscape than they had at the start of 2023. The past year saw increases in cloud service provider (CSP) outages, application outages, and the percentage of U.S.-centric outages—all of which point to an evolution in the way outages happen and the need for different strategies to minimize the impact of disruptions.In this episode, Mike Hicks (Principal Solutions Analyst at ThousandEyes) unpacks these trends and shares practical tips for mitigating disruptions and optimizing performance. Listen on YouTube or tune in on your favorite podcast platform.And for more insights, check out these resources:- Top Outages of 2023 Webinar: https://www.thousandeyes.com/webinars/amer-top-outages-2023-analyses-takeaways?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse26_podcast (*In APJC? Check out the APJC session: https://cisco.webex.com/webappng/sites/cisco/meeting/register/f22aa6a322284e07abf0350b255c88c8?ticket=4832534b0000000658abcedc3030906188c1af83e52ab18645645592a5962e7bdd3a5f2afdc393ae×tamp=1705102128048&RGID=r57f263db115ebc6efa4c0a05429caa6f)- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse26_podcast———CHAPTERS00:00 Intro00:38 Cloud Service Provider Outages Trending Up02:30 Percent of U.S.-centric Outages Rises06:55 Application Outages Up in 202309:55 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
As 2023 comes to a close, in the spirit of Dickens' holiday classic “A Christmas Carol,” let's reflect on the valuable insights left by the ghosts of network operations teams past, present, and yet to come. Tune in to hear host Mike Hicks (Principal Solutions Analyst at ThousandEyes) discuss lessons from the NetOps teams of the past, the current state of NetOps, and what the future might hold—all with the goal of helping teams take steps to optimize performance and deliver delightful digital experiences in 2024.And also check out Mike's related article in TechRadar: https://www.techradar.com/pro/the-ghosts-of-network-operations-past-present-and-yet-to-come———CHAPTERS00:00 Intro01:06 NetOps Past06:24 NetOps Present12:14 NetOps Yet to Come20:35 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. And follow us on X: @thousandeyes
Recent changes appeared to trigger a series of events for two peering points internationally—with very different impacts. Tune in to learn more about these incidents, why they differed, and the lessons they leave.Mike Hicks, Principal Solutions Analyst at ThousandEyes, will also cover the latest outage numbers and explore other recent incidents, including an Oracle Cloud outage and a duo of disruptions at Alibaba Cloud.Interested in more outage analysis? Check out our Internet Outages Timeline, which covers several notable Internet outages and application issues from the past year, along with the lessons they leave: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse24_podcast———CHAPTERS00:00 Intro00:45 Optus Outage02:07 AMS-IX Outage06:50 Oracle Cloud Outage08:39 Duo of Alibaba Cloud Incidents 09:44 By the Numbers13:13 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
As companies gear up for Black Friday, The Internet Report team shares some best practices for delivering great customer experiences and minimizing downtime during one of the retail industry's biggest days of the year. Mike Hicks, Principal Solutions Analyst at ThousandEyes, will cover some helpful case studies of Black Fridays that experienced some hiccups and what you can do to guard against similar disruptions.To learn more, check out the link below: - https://www.thousandeyes.com/blog/internet-report-episode-54-black-friday-2023———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. And follow us on Twitter: @thousandeyes
Backend-related incidents have been a recurring theme in outages across 2023, caused by everything from data center issues and hardware mishaps to failures at common (shared) services.Recently, we saw two examples of these backend issues when data center power problems led to outages at both Cloudflare and Workday.Tune in to hear more about what happened at Cloudflare and Workday, as well as our analysis of disruptions at OneLogin and GitLab.———CHAPTERS00:00 Intro01:00 OneLogin Disruption05:22 GitLab.com Availability Issues09:14 Workday and Cloudflare Outages31:16 Get in Touch———For more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-workday-cloudflare-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse23_podcast- Interested in more outage analysis? Check out our Internet Outages Timeline, which covers several notable Internet outages and application issues from the past year, along with the lessons they leave: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q2_internetreportpulse23_podcast———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
This Halloween, The Internet Report team is sharing some of their most thrilling (and chilling) networking tales.Pull up a chair (and a big bowl of your favorite Halloween candy) to hear what happened—and important lessons learned.———CHAPTERS00:00 Intro01:40 Haunting obstacles with a dynamic routing protocol that thwarted crew changes on an oil platform10:00 A spooky code base rollout that unleashed memory leak mischief18:58 A chilling application rollout that failed to deliver on user expectations around the globe29:45 Mysterious application issues that sent shivers down spines, before they were discovered to be caused by a wicked broadcast storm42:43 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. And follow us on X: @thousandeyes
In recent weeks, back-end infrastructure work and other backend-related issues impacted various online and consumer banking services, including DBS and Citibank in Singapore.Simple front-facing customer experiences that we've become accustomed to today can often mask considerable complexity on the backend. The service delivery chain of technologies powering the front end often comprises a mix of on-premises assets, cloud services, containers, and APIs.A degradation or outage to just one of those components can have massive impact. Depending on the architecture of the app and resilience of the backend, an incident in one part can be routed around in the best case scenario, or take down critical systems for hours in the worst case.Tune in to this episode to learn more about how backend changes led to outages at DBS, Citibank, and a number of Japanese banks—and how other backend issues appeared to contribute to a Google Cloud VMware Engine disruption and potentially also a Microsoft Exchange incident.For more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-dbs-citibank-outages?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q1_internetreportpulse22_podcast- Explore the Equinix issues that impacted DBS and Citibank: https://ajhrlohbopohbnmekzbcvrbeslqaijfr.share.thousandeyes.com/- Interested in more outage analysis? Check out our Internet Outages Timeline, which covers several notable Internet outages and application issues from the past year, along with the lessons they leave: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q1_internetreportpulse22_podcast———CHAPTERS00:00 Intro00:47 The Download04:10 By the Numbers06:40 Equinix Chiller Upgrade Leads to DBS, Citibank Outages in Singapore23:19 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
Outages and degradations can happen when underlying data isn't fresh enough. In recent weeks, stale data may have contributed to incidents at both Slack and Cloudflare. Slack began experiencing issues when, by our best guess, its app stopped trusting the freshness of the data in the cache; and, separately, Cloudflare's 1.1.1.1 DNS resolver ran into some issues related to stale root zone data.Watch this Pulse Update episode to hear more about the Cloudflare and Slack outages, and also explore recent disruptions at Google.For more insights, check out these links:- Explore the Slack outage in the ThousandEyes platform (NO LOGIN REQUIRED): https://apiyhhcphzaowmpqpyxrtdgggadiiujg.share.thousandeyes.com- Interested in more outage analysis? Check out our Internet Outages Timeline, which covers several notable Internet outages and application issues from the past year, along with the lessons they leave: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=na_fy24q1_internetreportpulse21_podcast———CHAPTERS00:00 Intro01:11 The Download04:46 By the Numbers09:30 Slack Outage: Cached Data Freshness Issues23:08 Cloudflare Outage: Resolvers Use Stale Root Zone29:55 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X (formerly Twitter): @thousandeyes
Providing great digital experiences relies on a complex service delivery chain. The past few weeks brought multiple reminders that the root cause of cloud and app disruptions often comes down to one single link in this chain. While the component at issue may appear small, if it's not functioning normally, the consequences can be significant. Additionally, the impact of a malfunctioning “link” is often intensified by a lack of understanding or visibility into the entire end-to-end service delivery chain, especially in situations where a change is made outside standard operating procedures or pipelines.In this episode, explore how this phenomenon appeared to play out recently when .au domains failed to resolve, as well as during disruptions at Salesforce and Microsoft Azure.For more insights, check out these links:- Explore the .au incident in the ThousandEyes platform (NO LOGIN REQUIRED): https://ajdaombojgmvnbvsclhtvgmrskaicjvo.share.thousandeyes.com/- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=internetreportpulseep20———CHAPTERS00:00 Intro00:54 The Download06:01 By the Numbers08:23 .au Domains Fail to Resolve15:42 PlayStation Network Disruption21:10 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X (formerly Twitter): @thousandeyes
In a world that operates at “hyperscale,” the potential for hyperscale-sized problems is also very real. The measure of a good provider—and a well-engineered system—is how well they handle these anomalous conditions and minimize disruption.During recent weeks, some of these hyperscale-sized outages hit, including data center-focused disruptions that impacted companies like Square, Oracle OCI, NetSuite, and Microsoft Azure. Tune into this Pulse Update episode to go under the hood of these outages and discover how the companies responded—and important lessons learned.For more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-square-down?utm_source=transistor&utm_medium=referral&utm_campaign=internetreportpulseep19- Explore the Square outage in the ThousandEyes platform (NO LOGIN REQUIRED): https://akinmwcoyjwhlwmhnykikzqkxcltwasv.share.thousandeyes.com———CHAPTERS00:00 Intro00:59 The Download04:33 By the Numbers09:02 Square Outage23:08 Oracle OCI, NetSuite, and Microsoft Azure Outages32:19 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X (formerly Twitter): @thousandeyes
An outage occurs, a change is rolled back, and everything stabilizes. But what happens when the change is attempted a second time?These second tries often go much more smoothly. While another outage might still occur during this “take two,” the impact is usually far less severe. The engineering team has learned from what went wrong the first time and is ready to stop at the first hint of trouble. Slack recently experienced a pair of disruptions that appear to illustrate this “take two” scenario: a longer disruption resulting from a routine database cluster migration, followed by a much shorter outage a few weeks later that also involved database work, potentially indicative of related work that went more smoothly.And for more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-slack-x-outage?utm_source=transistor&utm_medium=referral&utm_campaign=internetreportpulseep18- Explore the Slack and X disruptions in the ThousandEyes platform (NO LOGIN REQUIRED): Slack: https://afkmcwbeszwdtqqpvouwgjolywiugryx.share.thousandeyes.com/X: https://adcsnhfupsardmzyocrxqdcvriengkew.share.thousandeyes.com———CHAPTERS00:00 Intro00:47 The Download04:06 By the Numbers05:41 Slack Disruptions09:25 X Disruptions20:20 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X (formerly Twitter): @thousandeyes
Context matters when working on a distributed web-based application or service where everything is linked and dependent on each part functioning correctly. It's all too easy for one team to make a change that unexpectedly affects something another team is working on. Or the combined impact of both changes may also accidentally break something.To avoid such mishaps, teams should cut back on silos as much as possible.However, it's hard to completely eliminate siloed operations or decision-making. But the potential negative effects of silos can be reduced if each team has a view of the end-to-end service that's tailored to their specific area or domain—that is, presented to them in a context that they understand.Tune in to this week's episode to learn more about mitigating silos and also explore lessons from recent disruptions at Slack, Spotify, and Wells Fargo.And for more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-slack-outage?utm_source=transistor&utm_medium=referral&utm_campaign=internetreportpulseep17- Explore the Slack disruption in the ThousandEyes platform (NO LOGIN REQUIRED): https://apiijiaoljxvlpnzjkxwwlcqmmcovppx.share.thousandeyes.com/———CHAPTERS00:00 Intro00:49 The Download04:35 By the Numbers06:55 Slack Disruption27:18 Spotify Disruption33:15 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X (formerly Twitter): @thousandeyes
In an end-to-end service delivery chain, isolated changes can have broad consequences. This played out recently when an erroneous SSL certificate change at Microsoft appeared to cause a SharePoint Online and OneDrive for Business outage.While this incident definitely underscores the importance of valid security certificates, it's also a reminder of what can happen when even one component in an end-to-end service delivery chain experiences issues. Every component needs to work in sync to maintain the service's availability. As a result, all changes, especially manual ones, should be made with care and teams should have a deep understanding of every dependency and interconnection within their service delivery chain.Watch this week's episode to learn more and explore other recent outages that impacted Slack, Starbucks, and NASA.And for more insights, check out these links:- The Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-sharepoint-outage?utm_source=transistor&utm_medium=referral&utm_campaign=internetreportpulseep16- Explore the OneDrive outage in the ThousandEyes platform (NO LOGIN REQUIRED): https://arcbfeptuostafskynuemdpgwcyerodc.share.thousandeyes.com———CHAPTERS00:00 Intro00:43 The Download04:03 By the Numbers06:09 SharePoint Outage21:47 Slack Outage25:31 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes
Let's face it. Not every contingency can be planned for. Sometimes an outlier scenario pops up and causes an unexpected outage or disruption.Over the past few weeks, multiple companies appeared to be impacted by such edge cases: Azure; GitLab; and Meta's WhatsApp, Facebook, Instagram, and Threads—its newest addition.Tune into the latest Pulse Update episode to learn more about what happened during these disruptions and why robust visibility is so important for navigating unexpected outlier scenarios.And for more insights, check out these links:- Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-azure-disruption?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp15- Explore the Azure disruption in the ThousandEyes platform (NO LOGIN REQUIRED): https://augfulkplwamllucivbbxxahisxddgay.share.thousandeyes.com- Cloud Performance Report: https://www.thousandeyes.com/resources/cloud-performance-report-2022?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp15———CHAPTERS00:00 Intro00:41 The Download03:34 By the Numbers05:26 Azure Disruption12:01 GitLab Outage18:20 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes
The application opens, but users encounter errors when they try to do anything—what gives? It's the curious case of the disappearing backend. Discover why application issues often show up like this, with the service reachable but unresponsive beyond rendering a basic landing page, and sometimes an accompanying error message.In this episode, hosts Mike Hicks and Brian Tobia discuss this common problem and explore related incidents at CBA, GitHub, and Microsoft Teams. They also unpack other recent outage trends and disruptions, including the UK emergency services outage.To learn more, check out these links:- Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-explaining-application-outages?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp14- Explore the Microsoft Teams outage in the ThousandEyes platform (NO LOGIN REQUIRED): https://asinlehhlglxpowzmpqnuiyrzmniyozf.share.thousandeyes.com———CHAPTERS00:00 Intro00:26 The Download04:55 By the Numbers07:08 Microsoft Teams Outage13:55 GitHub Outage16:42 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes
Though network outages are still far more common, application outages seem to be increasing in 2023—and having bigger impacts. Tune in to learn more about this trend and dive into incidents at Okta and Instagram. Host Mike Hicks will also explore other outage trends from the first half of the year in this special episode reflecting on the state of the Internet in 2023 thus far.To learn more, check out these links:- Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-application-outages-increasing?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp13- Explore the Instagram outage and Okta disruption in the ThousandEyes platform (NO LOGIN REQUIRED): Instagram: https://azavwfwqcgxyeqjyhwkicqxgsqwtcmzq.share.thousandeyes.com/Okta: https://awoleuudwuvnwklukifbrpghghynjjwy.share.thousandeyes.com- Internet Outages Timeline: https://www.thousandeyes.com/resources/internet-outages-timeline?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp13- Microsoft Outage Analysis: https://www.thousandeyes.com/blog/microsoft-outage-analysis-january-25-2023?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp13- AWS Outage Analysis: https://www.thousandeyes.com/blog/aws-outage-analysis-june-13-2023?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp13———CHAPTERS00:00 Intro00:40 The Download04:02 2023 Outage Trends: By the Numbers11:37 Instagram Outage17:20 Okta Disruption21:07 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes
For three consecutive years, there appears to have been a spike in outages and degradations in May. A potential “spring cleaning effect” may explain why. Tune in to learn more about this possible trend and explore what happened during recent incidents at Twitter; Microsoft 365; Slack; Instagram; Apple's iMessage; and subscription-based streaming service, Max (formerly known as HBO Max).After watching, check out these links to dive deeper: Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-spring-cleaning-outage-spike?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp12Explore the Instagram outage in the ThousandEyes platform (NO LOGIN REQUIRED): https://aljqalczdkfpdqjynukcshuifepbwasi.share.thousandeyes.com———CHAPTERS00:00 Intro01:22 The Download07:38 Outage Trends: By the Numbers11:40 Twitter Outage15:01 Microsoft 365 Outages16:23 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes
Tune in to explore ways that outages can impact distributed software development teams and what companies can learn from recent incidents at GitHub, Google Cloud, and Apple.To learn more, check out these links: Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-outages-and-distributed-dev-teams?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp11Explore the GitHub service degradation in the ThousandEyes platform (NO LOGIN REQUIRED): https://agiebiuwxkwqowctctfvdaazvvfpxzew.share.thousandeyes.com/———CHAPTERS00:00 Intro00:39 The Download04:40 Outage Trends: By the Numbers09:22 GitHub Service Degradation19:13 Update: Google Cloud Outage22:45 Apple Authentication Issues25:56 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes And if you want to connect with Mike in person, join us for the Cisco Live conference from June 4 - June 8 in Las Vegas. Register now and be sure to stop by the ThousandEyes booth: https://www.thousandeyes.com/events/2023/cisco-live?utm_source=youtube.com&utm_medium=referral&utm_campaign=InternetReportPulseEp11 And don't miss Mike's talk on optimizing IT operations with ThousandEyes and OpenTelemetry: https://www.ciscolive.com/global/learn/technical-education/session-catalog.html?search=BRKAPP-2731#/
When it comes to your technology strategy, it's a good idea to have more than one way to access every resource—just in case. As IT environments have changed, so has the thinking around the right approaches to achieve this desired redundancy.Two recent incidents at Google Cloud and Microsoft 365 reinforce the importance of redundancy—and the need for evolving strategies to meet this goal.To learn more, check out these links: Internet Report: Pulse Update Blog: https://www.thousandeyes.com/blog/internet-report-pulse-update-redundancy-in-cloud-era?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp10Cloud Performance Report: https://www.thousandeyes.com/resources/cloud-performance-report-2022?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp10———CHAPTERS00:00 Intro00:35 The Download03:13 Outage Trends: By the Numbers06:58 Google Cloud Outage18:12 Microsoft 365 Outage25:24 Get in Touch———Want to get in touch?If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on Twitter: @thousandeyes And if you want to connect with Mike and Kemal in person, join us for the Cisco Live conference from June 4 - June 8 in Las Vegas. Register now and be sure to stop by the ThousandEyes booth: https://www.thousandeyes.com/events/2023/cisco-live?utm_source=transistor&utm_medium=referral&utm_campaign=InternetReportPulseEp10And don't miss Kemal's breakout session on rethinking network monitoring: https://www.ciscolive.com/global/learn/technical-education/session-catalog.html?search=BRKAPP-2013#/