2023-05-31 –, Music Hall
Incidents and outages are an inevitable reality for software engineers. While there are always many lessons to be learned from them, there are certain lessons that are often overlooked.
Video: https://youtu.be/qXabr_7wWas
Have you ever been in a tech incident? The kind that leaves your team scrambling, your boss furious, and your customers frustrated? They are inevitable. But do you ever wonder why we keep making the same mistakes?
I've seen my fair share of outages and incidents. And while we always walk away with practical takeaways (and that's great), there are certain lessons that just seem to slip through the cracks. The ones that we should have learned from before, but somehow, they keep happening. I've observed these patterns that make up the lessons (unfortunately) "not learned" from incidents and outages.
In this talk, I'm going to shine a spotlight on those lessons we just can't seem to learn. I'll delve into the psychology of incidents and explore some attitudes towards monitoring that need a serious overhaul. I'll share some practical practices that will help your team stay ahead of the game and avoid that all-too-familiar panic.
So buckle up and get ready for a self-therapy session like no other. It's time to face the truth and learn from our mistakes once and for all.