57 min listen
Incident Response with Emil Stolarsky
Incident Response with Emil Stolarsky
ratings:
Length:
52 minutes
Released:
Nov 21, 2017
Format:
Podcast episode
Description
As a system becomes more complex, the chance of failure increases. At a large enough scale, failures are inevitable. Incident response is the practice of preparing for and effectively recovering from these failures. An engineering team can use checklists and runbooks to minimize failures. They can put a plan in place for responding to failures.
The post Incident Response with Emil Stolarsky appeared first on Software Engineering Daily.
The post Incident Response with Emil Stolarsky appeared first on Software Engineering Daily.
Released:
Nov 21, 2017
Format:
Podcast episode
Titles in the series (100)
Hadoop Ops: Rocana CTO Eric Sammer Interview: Rocana applies big data, advanced analytics, and visualizations to dev ops in order to guide users to the root causes of problems. Eric Sammer is the co-founder and CTO of Rocana. At Cloudera, he served as an Engineering Manager responsible for tools a... by Cloud Engineering Archives - Software Engineering Daily