It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet.
Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden.
In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
VIDEOS RELATED TO MANAGING
Dan Langevin, CTO at Vericred, Inc.
Mark Van de wiel, VP of Technology at Fivetran
Ramana Satyavarapu, CTO at Finix
Ron Lichty, Interim VP Eng at Ron Lichty Consulting
Joe Bradley, Chief Scientist, SVP Data Science at LivePerson
Claudius Mbemba, CTO at Spritz (formerly Neu)
Kevin Goldsmith, CTO at Anaconda
Kimberly Wiefling, Cofounder at Silicon Valley Alliances
Mona Soni, CTO at Sustainable1 at S&P
Ann Lewis, Senior Advisor for Technology Delivery at US SBA
Labhesh Patel, CTO at Jumio
Jossie Haines, VP Software Engineering & Head of DE&I at Tile
Copyright © 2024 CTO Connection, All Rights Reserved