It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet.
Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden.
In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
VIDEOS RELATED TO MANAGING
Eliot Horowitz, Founding CTO at MongoDB
Katie Womersley, VP Engineering at Buffer
Colin Bodell, VP Shopify Plus RnD at Shopify
Vivek Sagi, CTO at Eventbrite
Belle Walker, Founder and Lead Consultant (prior roles include Director of Engineering) at Belleview Consulting
Neetu Rajpal, VP, Engineering at Oscar Health
Ale Paredes, VP Engineering at Code Climate
Dalia Havens, VP Engineering at Netlify
Jack Humphrey, VP Engineering at Indeed
Glyn Roberts, CTO of Digital Solutions at iTechArt
Claudius Mbemba, CTO at neu
James Kenigsberg, CTO at 2U
Copyright © 2024 CTO Connection, All Rights Reserved