As AI transforms IT infrastructure, it’s also reshaping what it takes for IT operations teams to assure performance and maintain quality digital experiences.
In this episode, we’ll explore the new challenges facing ITOps teams as AI becomes more integrated into IT environments, covering key digital resilience strategies and important considerations.
CHAPTERS
00:00 Intro
00:54 AI & IT Infrastructure
03:15 Assuring Performance on Your AI Journey
04:55 Distributed Architecture
07:49 Digital Resilience
10:21 Catching Issues in the AI Era
11:47 Performance Problems
13:57 AI Readiness: A Journey, Not a Destination
15:07 Get in Touch
———
For additional insights, check out this Guide to Next-generation Assurance: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep5_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X @thousandeyes.
Service delivery chains are often made up of a longer string of dependencies than you might expect. When an outage happens, the root cause might not be in your systems or even with a third-party provider you depend on. It could actually trace back to yet another third-party provider they rely on.
We saw this phenomenon in action recently when some Cloudflare services were affected by an outage ultimately caused by Google Cloud issues.
Tune in to hear more about what happened at Google Cloud and Cloudflare, and also explore takeaways from a recent OpenAI outage.
———
CHAPTERS
00:00 Intro
00:50 Google Cloud Outage
04:20 Google Cloud & the Cloudflare Outage
08:26 OpenAI Outage
12:45 Outage Trends: By the Numbers
16:46 Get in Touch
———
For additional insights, check out the links below:
- Explore the Google Cloud outage further on the ThousandEyes platform (no login required): https://alksjqkoqpogyviwdqlukavercvlzlsi.share.thousandeyes.com/
- Google Cloud Outage Analysis: June 12, 2025: https://www.thousandeyes.com/blog/google-cloud-outage-analysis-june-12-2025?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep4_podcast
- The Guide to Next-generation Assurance: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep4_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X: @thousandeyes
Cloud monitoring requires holistic, end-to-end visibility across complex, interconnected environments rather than isolated metrics. Here are best practices CloudOps teams should keep in mind.
———
CHAPTERS
00:00 Intro
00:47 What CloudOps Should Focus On
13:14 Decision-making
16:25 AI
17:53 Get in Touch
———
For additional insights, check out the links below:
- The Ultimate Cloud Migration Survival Kit: https://www.thousandeyes.com/resources/the-ultimate-cloud-migration-survival-kit?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep3_podcast
- Cloud ROI: How To Measure Your Migration’s Impact: https://www.thousandeyes.com/blog/cloud-migration-roi?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep3_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X: @thousandeyes
Just because an outage is subtle, doesn’t mean it’s harmless. Learn how to catch those pesky “stealth outages” that can so easily slip under the radar, and also unpack recent service disruptions at Slack, Microsoft 365, and X.
CHAPTERS
00:00 Intro
00:56 Slack
08:16 Microsoft 365
11:22 X
13:26 Outage Trends: By the Numbers
16:26 Get in Touch
———
For additional insights, check out the links below:
- The Five Phases of Internet Outage Recovery: https://www.thousandeyes.com/resources/five-phases-internet-outage-recovery-infographic?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep2_podcast
- The Guide to Next-generation Assurance: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep2_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes.
Journey through the evolution of network architecture and explore what the future might hold in this conversation with APNIC’s Chief Scientist Geoff Huston.
Geoff and The Internet Report team will cover how the Internet has transformed significantly over the past four decades, scaling to meet rapidly growing demand. And they’ll also discuss how the challenge to “scale still more” continues today as the networking community evolves infrastructure to support emerging technologies like artificial intelligence (AI).
CHAPTERS
00:00 Intro
00:11 Meet Geoff Huston
02:56 The Shift to Asymmetry
10:58 The Challenge of Scale
22:07 Moore's Law and Networking
25:05 The Rise of CDNs
26:07 Name-driven Architecture
36:57 AI's Impact on Network Architecture
45:08 Get in Touch
ABOUT GEOFF HUSTON
Geoff Huston AM is Chief Scientist at the Asia Pacific Network Information Centre (APNIC), the Regional Internet Registry (RIR) for the Asia Pacific region. An industry veteran, Geoff researches Internet infrastructure, IP technologies, and address distribution policies, among other topics. Learn more, explore his recent articles and presentations, and connect with Geoff through his website, www.potaroo.net.
———
For additional insights, check out The Internet Report’s latest blog: https://www.thousandeyes.com/blog/internet-report-evolution-network-architecture?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep1_podcast
And to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q4_internetreport_q4fy25ep1_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X: @thousandeyes
Dive into recent service disruptions at Zoom, Spotify, SAP Concur, and Vanguard UK, and explore what they reveal about troubleshooting best practices for ITOps teams.
Tune in now for insights from The Internet Report team or use the chapters below to jump to the sections that most interest you.
CHAPTERS:
00:00 Intro
00:52 Zoom Outage
04:40 SAP Concur Disruption
07:28 Spotify Outage
10:58 Vanguard Outage
13:59 By the Numbers
16:01 Get in Touch
———
For additional insights, check out the links below:
- The Internet Report’s latest blog: https://www.thousandeyes.com/blog/internet-report-troubleshooting-tips-zoom-spotify-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast
- The Five Phases of Internet Outage Recovery: https://www.thousandeyes.com/resources/five-phases-internet-outage-recovery-infographic?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast
- The Guide to Next-generation Assurance: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep6_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X: @thousandeyes
Packet loss can be bad news for network flows and customer experience. However, in our experience, NetOps teams tend to focus on major spikes in packet loss, while overlooking smaller amounts like 1 or 2%.
This might be a mistake.
Tune in for a deep dive into research findings suggesting that even 1% packet loss can significantly impact user experience—and recommendations for steps NetOps teams should take as a result.
———
CHAPTERS
00:00 Intro
01:07 The Surprising Impact of 1% Packet Loss
02:50 Research Methodology
08:17 Key Findings
13:55 Recommendations for NetOps Teams
16:48 Additional Research
22:30 Get in Touch
———
For additional insights on our packet loss research, explore all three parts of our Path Quality blog series:
- Path Quality Part 1: The Surprising Impact of 1% Packet Loss: https://www.thousandeyes.com/blog/path-quality-surprising-impact-one-percent-packet-loss?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast
- Path Quality Part 2: Understanding the Impact of Packet Loss on Applications: https://www.thousandeyes.com/blog/path-quality-understanding-impact-packet-loss-applications?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast
- Path Quality Part 3: Is BBR the Future of Congestion Avoidance?: https://www.thousandeyes.com/blog/path-quality-brr-future-congestion-avoidance?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast
And to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep5_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on X: @thousandeyes
———
ABOUT THE INTERNET REPORT
This is The Internet Report, a podcast uncovering what’s working and what’s breaking on the Internet—and why.
Tune in to hear ThousandEyes’ Internet experts dig into some of the most interesting outage events from the past couple weeks, discussing what went awry—was it the Internet, or an application issue?
Plus, learn about the latest trends in ISP outages, cloud network outages, collaboration network outages, and more.
Catch all the episodes on YouTube or your favorite podcast platform:
- Apple Podcasts: https://podcasts.apple.com/us/podcast/the-internet-report/id1506984526
- Spotify: https://open.spotify.com/show/5ADFvqAtgsbYwk4JiZFqHQ?si=00e9c4b53aff4d08&nd=1&dlsi=eab65c9ea39d4773
- SoundCloud: https://soundcloud.com/ciscopodcastnetwork/sets/the-internet-report
Go under the hood of recent service disruptions at X, Workday, and Mastercard—and explore why it’s so important to quickly (and accurately) identify the root cause of an outage.
———
CHAPTERS
00:00 Intro
00:59 X Outage
07:08 Workday Outage
11:00 Mastercard Service Disruption
14:48 By the Numbers
16:05 Get in Touch
———
For additional insights, check out The Internet Report’s latest blog: https://www.thousandeyes.com/blog/internet-report-service-disruptions-x-workday-mastercard?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep4_podcast
And to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep4_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
Dive into the recent Slack outage and disruptions at Microsoft 365, Grafana Cloud, and Otter.ai—plus, explore key takeaways for ITOps teams.
———
CHAPTERS:
00:00 Intro
00:48 Slack Outage
06:55 Microsoft 365 Outage
11:44 A Pair of Otter.ai Outages
14:21 Grafana Cloud Disruption
15:55 By the Numbers
17:58 Get in Touch
———
To learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep3_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes.
Outages connected to configuration mishaps were a common theme last year, and we’ve continued to see incidents like these in 2025. Configuration changes triggered two consecutive Asana outages in early February, and configuration or update-related issues may also have contributed to recent disruptions at Barclays, ChatGPT, Jira, and Discord.
Tune in to hear The Internet Report’s Mike Hicks unpack these incidents and discuss ways ITOps teams can guard against similar issues.
———
CHAPTERS:
00:00 Intro
01:06 Asana Outages
11:40 ChatGPT Disruption
19:34 Barclays Outage
21:57 Jira Outage
22:59 Discord Outage
24:31 By the Numbers
30:15 Get in Touch
———
For additional insights, check out The Internet Report’s latest blog: https://www.thousandeyes.com/blog/internet-report-configuration-mishaps-asana-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep2_podcast
And to learn more about how to deliver seamless digital experiences in a distributed IT landscape, read this eBook: https://www.thousandeyes.com/resources/guide-to-next-generation-assurance-ebook?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q3_internetreport_q3fy25ep2_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes.
What does it take to deliver successful digital experiences at major events like concerts and conferences? With special guest Dominic Hampton—Managing Director at attend2IT—we’ll explore the dynamic world of event IT and key takeaways ITOps teams at enterprise companies can apply to their own events as well as in their day-to-day operations.
We’ll also discuss insights from recent incidents that impacted Azure, Microsoft 365, and more.
CHAPTERS
00:00 Intro
01:34 Behind the Scenes of Event IT: Lessons for Enterprise ITOps
22:42 Microsoft Azure Incident
24:15 Microsoft 365 Disruption
25:31 Atlassian Bitbucket Cloud Outage
27:22 TikTok’s Shutdown
30:41 Get in Touch
———
ABOUT DOMINIC HAMPTON
Dominic Hampton is the Managing Director of attend2IT, a UK-based company that provides comprehensive IT services for events of all kinds, from music festivals to major corporate conferences. An IT industry veteran, Dom has more than two decades of experience in the space and has worked on events for many leading companies and organizations. Learn more and connect with Dom on LinkedIn: https://www.linkedin.com/in/attend2/
———
For additional insights, check out the links below:
The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-event-it-best-practices?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep6_podcast
Webinar: Top Outages of 2024, Explained: Lessons in Digital Resilience: https://www.thousandeyes.com/resources/na-top-outages-2024-lessons-in-digital-resilience-webinar?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep6_podcast
*NOTE: The discussed Atlassian Bitbucket Cloud outage occurred on January 21, starting at 3:30 PM (UTC), not January 22.
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes
Configuration changes played an outsized role 2024 outages. Tune in to hear more about this and other outage trends—and learn how ITOps teams should plan accordingly in the year ahead.
We’ll also share insights from recent incidents at OpenAI and Google Cloud’s Pub/Sub, and dive deeper into a degradation incident that Netflix experienced at the end of 2024.
Read on to learn more, or use the chapters below to jump to the sections that most interest you.
CHAPTERS
00:00 Intro
00:58 Cloud Service Provider (CSP) Outages Continue To Rise
01:52 Accidental Misconfigurations Trending for Clouds and Apps
07:10 OpenAI Outage
09:55 Google Cloud’s Pub/Sub Disruption
14:47 Lessons From a Netflix Incident
18:57 Recent Outage Trends: By the Numbers
21:01 Get in Touch
———
For additional insights, check out the links below:
- The Internet Report blog: https://www.thousandeyes.com/blog/internet-report-configuration-change-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast
- 2024 Outage Trends Solidify; Plus OpenAI & Meta Outages: https://www.thousandeyes.com/blog/internet-report-2024-outage-trends-openai-meta-outages?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast
- Netflix Broadcast Disruption: Lessons for Major Live Events: https://www.thousandeyes.com/blog/netflix-disruption-analysis-november-15-2024?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast
- And join our upcoming webinar, “Top Outages of 2024, Explained: Lessons in Digital Resilience.” We’ll unpack notable outages and performance degradations of 2024 and share lessons IT Operations teams can take away from these incidents to strengthen their digital resilience: https://www.thousandeyes.com/webinars/na-top-outages-2024-lessons-in-digital-resilience?utm_source=transistor&utm_medium=referral&utm_campaign=fy25q2_internetreport_q2fy25ep5_podcast
———
Want to get in touch?
If you have questions, feedback, or guests you would like to see featured on the show, send us a note at InternetReport@thousandeyes.com. Or follow us on LinkedIn or X at @thousandeyes