I've had a lot of experiences lately with application downtime, from both vendors and my own applications. This has got me thinking and as best I can google there isn't really a good or standard way of managing customer communication during downtime incidents.
I've seen this handled a lot of ways from the "blame everyone but us" approach to the "we screwed up and we're sorry" approach.
So my questions are... when you screw up with an app and cause downtime:
- Do you admit fault immediately? (Should you, legally?)
- How much info do you give the customer regarding what went wrong? ("An issue" vs. "A code syntax error in one of our SQL queries")
- Do you come back with a follow-up prevention plan, or just leave it at "this has been resolved"?
- Do you provide real-time updates? How often? Via Twitter or public-facing website?
Any other best practices for this that you've found successful?
Here is what I do:
Send updates when there is good news, before office closing time ("all staff will continue through the night" - account for timezones if necessary) and again around office opening time.
When the issue is resolved (for any definition of that word), send:
Keep any calls for blame, guilt or lynching in separate mails, preferably after some cooldown time.
Do not commit to anything during the downtime unless you are really, really sure you can deliver. Somehow two separate "bad news" situations are worse than a long one.
I prefer to use a medium where a notification is pushed on every message (mail, Twitter, ..)
The most important thing I've found both as a service provider and a service user is proactive responsibility. It's not able what you say, but when (how soon) you say it.
If you are notified that a problem happened and was fixed (or is being worked on), it's a lot better than discovering the problem yourself and trying to contact the vendor to figure out what in the world is going on. It also helps with the blame game and saves a lot of troubleshooting time (is it us or is it them?).
As far as detail goes, I find that giving a simple summary of what happened is nice unless users specifically request more information. There will be some people who always want as much detail as they can get, but most people just want things to work (even if they are highly technical).
Lastly, being able to explain what steps you have taken so that it won't happen again go a long way toward future goodwill and trust.
Without knowing much more about your particular app, how it's licensed, the field your providing services for, etc; it's impossible to answer your questions without guessing.
I prefer my vendors to provide too much information about downtimes. But many businesses just can't or they're zipped up by the lawyers. Consult your lawyer/insurance if you have any doubts.