Tai Huynh
Aspiring to protect engineering organizations against the downside of unexpected failures and the lack of graceful degradation, I am leading a Chaos Engineering team to build scalable tooling to safely inject failures at scale, and facilitate gamedays to validate resilience hypothesis against reality.
Session
In this session, we’ll navigate the intricate landscape of distributed systems and discuss how Chaos Engineering offers a hands-on approach to gaining deeper insights into system behavior. We'll examine how teams leverage failure injection and error simulation to proactively identify weaknesses and strengthen resilience. From there, we'll dive into Gameday exercises, where teams deliberately push their systems to the limit to expose hidden resilience gaps. Finally, we’ll reflect on the current challenges of distributed systems and the realities teams face in maintaining resilience at scale.