Why systems fail and what you can do about it
Anurag Gupta
CEO & Founder
Shoreline
While running database services for AWS, Anurag Gupta, now Founder & CEO of Shoreline.io, learned the importance of going beyond monitoring and incident response to implement incident automation to automatically resolve common (and uncommon) production issues. In this episode, he talks about the importance of automating production ops, why just using cloud hosting or containers doesn’t fully solve the problem and how to think about building out an effective SRE team.
Interested in Tooling?
Visit our Tooling community!
We are using more and more tools every day. Here we discuss new and all tools every CTO or engineering leader should be aware of, we share feedback and best practices and help each other to use tools more efficiently. Currently, our main topics are Project management, CI/CD, Feature flagging, Security, Incident Response, Reliability/chaos engineering, monitoring/observability, low code/no-code/Serverless, Hosting.
VIDEOS RELATED TO TOOLING
Mason Jones, Senior Staff SRE at Credit Karma
Jonathan Lenaghan, Director of Engineering at Datadog
Daniel Spoonhower, Cofounder & Chief Architect at LightStep
Nadia Alramli, Director of Engineering, Products at Hubspot
John Egan, Cofounder & CEO at Kintaba
Randy Shoup, VP Engineering and Chief Architect at eBay
Kolton Andrus, CEO at Gremlin
Shelby Spees, Developer Advocate at Honeycomb
Tammy Bryant, Principal SRE at Gremlin
Alpesh Gaglani, Director of Engineering at Thumbtack

Copyright © 2024 CTO Connection, All Rights Reserved