It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet.
Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden.
In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
VIDEOS RELATED TO MANAGING
Christine Spang, Cofounder and CTO at Nylas
Michael Machado, Head of Product at DevRev
Jossie Haines, Executive Coach/Technical Advisor at Jossie Haines Consulting
Jesse White, Chief Technical Officer at The OpenNMS Group
Raffi Krikorian , CTO at Emerson Collective
Ron Lichty, Interim VP Eng at Ron Lichty Consulting
Matt Powell, CTO at FTD
Maher Saba, VP of Remote Presence / Head of Eng at Meta
Steven Gaffney, President and CEO at Steven Gaffney Company
Vidal Graupera, Engineering Manager — Productivity Tools UI/UX at LinkedIn
Mark Porter, CTO at MongoDB
Mindaugas Mozūras, VP of Engineering at Vinted
Copyright © 2023 CTO Connection, All Rights Reserved