On call
1. Intro
With the increasing number of customers and end-users, Vital is now starting to feel the so-called growing pains. These are a mix of scalability issues, customers tapping into unforeseeable edge-cases, new features introducing bugs and 3rd Party Providers changing their APIs. Given the nature of our product as consumers of many 3rd Party, issues are bound to happen. Some of these bugs will be caught by our Observability capabilities, before they reach our customers. Others wonβt. Reliability and great customer service are two qualities that will make us stand above competition.
The goal of this document is to describe the expectations, actions and schedule of the on-call engineer. This document is related to non-working hours, rather than working hours support.
2. Expectations & Actions
The on-call engineer expectations are:
Monitor our services via:
#production-alerts
,#production-snitch
,#sandbox-alerts
,#sandbox-snitch
. The folllowing dashboards and links are useful:Monitor customers' channels and reply to their queries only if they are genuinely reporting downtime or you think it could represent an open issue that could result in a P0.
Acknowledging means making both the customer and / or the rest of Vital team aware.
Create a ticket marked as
Customer Issue
+On-Call
.We expect at most taking 60 minutes to reply to a customer.
For every ticket created by an on-call engineer, they need to set a priority. The following table should aid this endeavour:
Only P0s should be immediately actioned by the on-call engineer. In case they feel they are not able to solve it, they should contact a second engineer that might be able to help. All other issues (P1 and P2) should be solved during regular working hours.
3. Schedule
Weekends - 9am to 10PM GMT
Week days - 6:30PM to 10PM GMT
We rotate this cover between all engineers, taking on call for 1 week every 5 weeks (less frequency as we grow the team). You are paid $500/week regardless of your location.
It is expected to acknowledge issues in a 30 minutes window. Likely you need to set Slack alerts on your phone, so you can be notified if a customer raises an issue. You should also keep an eye on the previously mentioned Slack channels, for any out of ordinary pattern.
Last updated