DENOG13

Two tools for monitoring and debugging large L3 Clos networks
11-08, 15:05–15:15 (Europe/Berlin), Main Stage

Booking.com runs large parts of its infrastructure on on-premise L3 Clos networks of non-trivial sizes. This talk presents our in-house end-to-end monitoring system that checks and reports on the health of the network, and helps us in checking our SLOs, and an ad-hoc tool used to debug ECMP issues in these networks.

Ralf is a meteorologist by trade (with a diploma and everything), but doesn't look good on camera, so has been trying to get paid for playing with computers and networks for the last 20 years instead.