Incident Management
Goal
Update the core runbook technology to cater for the major incident use case.
Problems to solve
#Rigidity
Introduce a dynamic mode (allowing edits which were previously restricted to planning or paused runbook states)
#Visibility
Implement an activity feed
#Extensibility
Allow custom apps to be hosted within a runbook
Solutions
Take aways
- Context is everything, having a deep understanding of the client’s needs is essential in helping devise solutions that solve problems and make sense within the existing product.
- Everything is an event! By thinking in this way it is easy to introduce side effects that are asynchronous which results in a quick app
- Protect against race conditions. Often the sequence of events cannot be guaranteed. I learnt a lot about handling this accordingly.
- Compromise is key. There was a lot of debate around how we allowed 3rd party apps to be embedded. By developing the apps in house we can see common patterns and have a better understanding of how the apps are constructed. This allows us to create larger components that are less granular and less prone to design issues.