It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet.
Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden.
In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
VIDEOS RELATED TO MANAGING
Kareen Kircher, Founder at DevOps Advisors
Nick Rockwell, CTO at New York Times
Stacy Gorelik, Director Engineering at Flatiron Health
Robin Ducot, CTO at SurveyMonkey
Dalia Havens, Senior Director of Engineering at Netlify
James Spivey, Director of Engineering at Shutterstock
Claudius Mbemba, CTO at neu
Kathy Keating, CTO & Co-Founder at Apostrophe
Zach Beer, Manager of DevOps at InRule Technology
Mercedes Bernard, Engineering Manager at Tandem
Jenny Farver, CTO at LightStream
Jonathan Graham, CTO at Transaction Assurance Group
Copyright © 2024 CTO Connection, All Rights Reserved