Reliability 101
Kolton Andrus
CTO
Gremlin
It is 7 AM; you awake after a night of uninterrupted slumber. Being on-call, you check for issues, was your pager out of batteries? Nope, things are quiet. Imagine a world where outages are a myth. Where a failure occurs, but there is no customer impact and no engineer is engaged. This is the aspiration of Reliability Engineering - to operate complex distributed systems effectively, without customer facing outages or heavy operational burden. In this 101 talk, I will share the basics every team should know to start their reliability journey off on the right foot.
Interested in Managing?
Visit our Managing community!
Managing Engineers and Technical professionals is not an easy task however as a community of highly motivated and experienced managers we want to find the best approaches and solutions to manage our teams in the best possible way. Currently our discussion topics are: managing stakeholder expectations, direct management, leadership and org management, creating and maintaining your culture, remote-first, hybrid teams.
VIDEOS RELATED TO MANAGING
Mindaugas Mozūras, VP of Engineering at Vinted
Juan pablo Buritica, SVP Engineering at Ritchie Bros.
On Freund, Cofounder and CEO at Wilco
Joris Dries, CTO at Resonate Solutions
James Kenigsberg, CTO at 2U
Jon Williams, Fractional CTO , Technology Consultant at Gumtree Tech LLC
Ellen Chisa, Founder-in-Residence at Boldstart Ventures
Andrew Haines, Global Head of Fintech at iTechArt Group
Jeff Casimir, Executive Director at Turing School of Software & Design
Claudius Mbemba, CTO at Spritz (formerly Neu)
Aaron Erickson, Co-Founder and CEO at Orgspace

Copyright © 2022 CTO Connection, All Rights Reserved