Podcast Index

Podcasts

Explora podcasts por categoría, abre episodios recientes y descarga audio para escucharlo sin conexión.

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering Fexingo Billianon Billianon Billianon Quick Reads Quick Reads Pratham Malhotra Marketing Talks Marketing Talks Catherine and Tom Trends-Tendances Trends-Tendances Trends-Tendances Veterans Corner Radio Veterans Corner Radio Joe Muhlberger Your Career On Target Your Career On Target InterCoast Media Network Protect & Grow Protect & Grow Tim Stearns I AM HealingStrong I AM HealingStrong HealingStrong Trappin Tuesday's Trappin Tuesday's Wallstreet Looks Like Us Now Network The Accountability Minute:Business Acceleration|Productivity The Accountability Minute:Business Acceleration|Productivity Anne Bachrach Property Prophets Property Prophets Travis Wells The Golfi Real Estate Show The Golfi Real Estate Show Rob Golfi Global Economic Press Global Economic Press Global Economic Press 5 Minute AI News - Daily 5 Minute AI News - Daily Kitchen Table Media Bitcoin News Digest Podcast Bitcoin News Digest Podcast Mike Richardson SILVER / GOLD / $UFD Meme Coin Investing & Global Economics SILVER / GOLD / $UFD Meme Coin Investing & Global Economics Ronald Branstetter 박연미의 목돈연구소 박연미의 목돈연구소 SBS Journey of Hope Journey of Hope Heart for Lebanon Bourseko MOAT Bourseko MOAT Bourseko Happy Work - Management & bien-être au travail Happy Work - Management & bien-être au travail Gaël Chatelain-Berry Legalmente Productivos Legalmente Productivos Legalmente Productivos The Market Huddle The Market Huddle Patrick Ceresna & Kevin Muir Reuters World News Reuters World News Stephani Schupbach On The Chain - Blockchain and Cryptocurrency News + Opinion On The Chain - Blockchain and Cryptocurrency News + Opinion On The Chain Your Happy Hour Your Happy Hour The Feels 人性的弱点-解读分享 人性的弱点-解读分享 终身都要成长 Franklin Matters Radio Franklin Matters Radio Steve Sherlock Mostly All Figured Out Mostly All Figured Out Emily Rose The Rich Somers Report The Rich Somers Report Rich Somers Thoughtful Money with Adam Taggart Thoughtful Money with Adam Taggart Adam Taggart | Thoughtful Money 真誠presents 大久保佳代子・森本晋太郎のどうぞご自由に 真誠presents 大久保佳代子・森本晋太郎のどうぞご自由に CBCラジオ The Classy Career Girl Podcast The Classy Career Girl Podcast Classy Career Girl International, LLC Ekantik Vartalap Ekantik Vartalap Bhajan Marg Due For A Win: The Atlantic City Podcast Due For A Win: The Atlantic City Podcast Due For A Win / Kyle Askine and Craig Stone Accounting Best Practices with Steve Bragg Accounting Best Practices with Steve Bragg Steve Bragg Meditación del Día Meditación del Día Radio Ebenezer RD
The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

Negocios

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

Fexingo

How SRE Teams Use Incident Metrics to Improve Response

July 03, 2026 5:33pm 9 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the world of incident metrics — not just DORA or SLOs, but the specific numbers that help SRE teams get faster and better at incident response. Th...

How SRE Teams Use Cost Optimization to Reduce Cloud Waste

July 03, 2026 5:22am 8 min

Episode 88 of The Site Reliability Podcast with Fexingo dives into how SRE teams can cut cloud costs without sacrificing reliability. Lucas and Luna discuss the rise of FinOps, the hidden waste in over-provisioned resour...

How SRE Teams Use Toil Budgets to Protect Engineering Time

July 02, 2026 5:25pm 11 min

Episode 87 of The Site Reliability Podcast explores toil budgets — a practice Google SRE pioneered to cap repetitive, non-valuable operational work. Lucas and Luna break down why Google set a 50% toil limit, how to measu...

How SRE Teams Use Structured Fails to Learn Faster

July 02, 2026 5:26am 10 min

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams deliberately inject small, controlled failures into production not to break things but to build collective learning. They dissect the ...

How SRE Teams Use Post-Incident Reviews for System Improvements

July 01, 2026 5:28pm 8 min

In Episode 85 of The Site Reliability Podcast, Lucas and Luna explore how SRE teams turn post-incident reviews into actionable system improvements. They focus on a real-world case: a major streaming service's 2023 outage...

How SRE Teams Use Capacity Planning to Prevent Outages

July 01, 2026 7:47am 7 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into capacity planning for SRE teams — the proactive discipline that keeps systems running when traffic spikes. Using the example of a major streaming ...

How SRE Teams Use Chaos Engineering to Build Resilient Systems

June 30, 2026 5:31pm 11 min

Lucas and Luna dive into chaos engineering, using Netflix's Chaos Monkey and the Simian Army as the prime example. Lucas explains how Netflix intentionally broke its own systems in production to uncover weaknesses before...

How SRE Teams Use Cost of Delay to Prioritize Reliability Work

June 30, 2026 5:18am 12 min

Episode 82 of The Site Reliability Podcast examines how cost of delay — a concept borrowed from product development — helps SRE teams decide which reliability projects to tackle first. Lucas and Luna walk through a real ...

How SRE Teams Use Latency Budgets to Meet Performance SLOs

June 29, 2026 5:27pm 9 min

Lucas and Luna dive into latency budgets — a less-discussed SRE tool that maps acceptable delay across each microservice in a user request chain. They use the example of a social media app's photo upload feature: if the ...

How SRE Teams Use Runbooks to Streamline Incident Response

June 29, 2026 5:28am 13 min

In episode 80 of The Site Reliability Podcast, Lucas and Luna dive into the practical world of runbooks — the step-by-step guides that SRE teams use to respond to incidents faster and more consistently. They explore how ...

How SRE Teams Use Observability to Reduce Mean Time to Detect

June 28, 2026 5:21pm 8 min

Episode 79 of The Site Reliability Podcast looks at how modern SRE teams are using observability tools to shrink mean time to detect — the gap between a system failure and the team knowing about it. Hosts Lucas and Luna ...

How SRE Teams Use Service Level Agreements to Set Expectations

June 28, 2026 4:59am 8 min

Lucas and Luna dive into the often-overlooked difference between Service Level Agreements (SLAs) and Service Level Objectives (SLOs) in site reliability engineering. They explore how SLAs are not just legal documents but...

How SRE Teams Use Canary Deployments to Reduce Risk

June 27, 2026 5:28pm 10 min

Episode 77 of The Site Reliability Podcast dives into canary deployments: rolling out code changes gradually to a small subset of users before a full release. Lucas and Luna explain how companies like Netflix and Etsy us...

How SRE Teams Use DORA Metrics to Measure DevOps Performance

June 27, 2026 5:05am 10 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into DORA metrics — the four key DevOps Research and Assessment measures that elite SRE teams use to quantify software delivery and operational perform...

How SRE Teams Use Service Level Objectives to Drive Reliability

June 26, 2026 5:14pm 10 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practical use of Service Level Objectives (SLOs) in site reliability engineering. They discuss how a major European bank reduced pager fatigue...

How SRE Teams Use Blameless Culture to Improve Incident Response

June 26, 2026 5:10am 8 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into how a blameless culture can actually improve incident response times and reduce recurrence. They explore a real case from a mid-size SaaS company ...

How SRE Teams Use Blameless Postmortems to Build Trust

June 25, 2026 5:23pm 8 min

In Episode 73 of The Site Reliability Podcast, Lucas and Luna explore how blameless postmortems transform incident response culture. Using examples from a major e-commerce platform's 2024 database outage, they break down...

How SRE Teams Use Fault Tree Analysis to Prevent Root Causes

June 25, 2026 4:58am 11 min

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams apply fault tree analysis (FTA) from aerospace and nuclear engineering to reduce incident recurrence. Using a real 2025 outage at a ma...

How SRE Teams Use AI for Incident Triage and Root Cause Analysis

June 24, 2026 5:21pm 11 min

Episode 71 of The Site Reliability Podcast with Fexingo dives into how SRE teams are applying large language models and AI assistants to accelerate incident triage and root cause analysis. Lucas and Luna examine a real c...

How SRE Teams Use Game Days to Test Incident Response

June 24, 2026 5:11am 6 min

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practice of game days — structured simulations where SRE teams deliberately inject failures to test their incident response and on-call proces...

Envía tu emisora ​​favorita

Completa el formulario de abajo. Asegúrate de seleccionar tanto País como Géneros.

Nombre
Categoría
Mantenga presionada la tecla Ctrl (Cmd en Mac) para seleccionar varios.
URL de transmisión
Logo (JPG, JPEG o PNG)

Contáctanos

Envíanos un mensaje abajo. Te responderemos en un plazo de 24 horas.

Asunto
Tu nombre
Correo electrónico
URL de la emisora o de la página
Mensaje
Cuanto es 13 más 16?
También adjuntamos tu país, navegador, página actual y algunos datos técnicos para ayudarnos a investigar.