It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet.
Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden.
In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
VIDEOS RELATED TO MANAGING
Scott Gerlach, Co-founder - CSO at Stackhawk
Kimberly Wiefling, Cofounder at Silicon Valley Alliances
Lisa van Gelder, VP Engineering at Avvir.io
Vadim Supitskiy, CTO at Forbes
Mike Boufford, CTO at Greenhouse
Emily Nakashima, VP Engineering at Honeycomb.io
Adam Zimman, Strategic Advisor at LaunchDarkly
Jonathan Moore, Founder & CEO at RowdyOrb.it
Sam Schillace, CVP, Deputy CTO at Microsoft
Marko Gargenta, Founder at PlusPlus.co
Neha Agarwal, Director of Engineering at Thumbtack
Loick Michard, Global Head of Engineering at CoderPad
Copyright © 2024 CTO Connection, All Rights Reserved