Dead Man’s Curve

In my last post, I tried really hard to focus on the message of employee reviews, and how that message shouldn’t be a surprise. I said things like: “I’m not a fan of the system…”, and “It doesn’t matter how messed up the system is…” – yet the mention of the dreaded-curve of the MS review system was still the focal point of some of the comments, and of many private emails. I had always planned to follow the Three Surprises post with my view of the curve-based review system, but I’ve gone back and forth several times in the last day. If you’ve skimmed ahead, you probably figured out that I decided to go ahead and share my thoughts. There’s probably nothing new here, but I think there’s probably something here that both MS and non-MS employees will be able to learn from.

As I also mentioned in my last post, I’m not particularly fond of the curve , but consider it the tax I pay in order to get paid extremely well to do a job I enjoy. The most recent version of the review curve is the third variation I’ve experienced at Microsoft and is only 18 months old. All versions have their pros and cons, and over beers, I will gladly share the full story and dozens of anecdotes. For this forum, however, I’ll keep things (relatively) short and focus on the latest revision.

The most positive aspects about the curve are the simplicity and the transparency. There’s little mystery to the review system – peer groups (people in the same level band) are stack ranked from 1-n, and then the rank is broken into ratings groups based on known percentages. The bonus percentages for each score and level band are known, so there’s not that much mystery in the system (other than the surprises I mentioned in the last post). It’s easy to apply and easy to manage at all levels of the organization.

On the negative side (from my view at least), there are just a few things I think are worth pointing out. Microsoft values differentiation of employees. What this means in practice is that once the lines are drawn to determine review rankings, everyone in each peer group receives the exact same reward. In a ranking system of 1-5, this means that the person who just missed getting a “1” ranking gets the exact same rewards as the person who barely squeaked out of a “3” rating to get a “2”. As I mentioned above, this makes the system simple, but also can send some difficult messages. Despite this, since the goal is to differentiate employee rewards, this aspect of the curve works as promised.

It’s also a big challenge for a ‘superstar’ team. The curve applies to the team (or, for higher levels, across a division), but it’s possible to get a lower review score just by being on a good team. Even worse, the opposite is true. One can guarantee a good review score by finding a dysfunctional team and stepping in as the superstar.

And that leads to the part I really worry about – the side effects of the curve and competition. Behaviors visibly change as calibration season approaches. Some people begin to suck up in attempt to play the “review game”, while others refuse to take risks they would take at any other part of the year. A co-worker was commenting recently on strange behavior in a meeting when I asked him, “would the conversation have been different at a different time of the year?” (the answer was a resounding yes). I am a huge believer in collaboration (I once had a manager who said, collaboration was my super-power). The problem is that collaboration and competition don’t work well together. It’s in my best interest (as far as rewards go) to do everything I can to make sure all of my peers fail – and they, likewise, should feel the same. Given the system, that case could indeed happen, but fortunately most teams are able to rise above it – despite the potential hit on our own rewards and career growth. Alfie Kohn has written some fantastic books on this subject – including twoI keep on my desk this time of year: Punished by Rewards: The Trouble with Gold Stars, Incentive Plans, A’s, Praise, and Other Bribes, and No Contest: The Case Against Competition. To me, Kohn’s work says a lot about review systems in general. Combine these with Dan Pinks work on motivation, and you can form a pretty good picture for yourself with review systems.

Psychologically, as you can imagine, it’s a mess. Once the posturing is over and the calibration happens, news of the actual review scores slowly leak out – causing a zombie-like depression over a fair share of employees. After a few months, the fog shakes off, and it’s back to business as usual – for another 8-10 months or so.

I realize, that by sharing these thoughts, I’m opening the floodgates for criticism and complaints about the system. I live with this system in order to get to do what I do. If I ever get to a point where my job isn’t the best possible job I can imagine for myself, or if a review system conflicts with what I get to do, my story will change. Until then, it’s definitely a price I can pay.

Comments

  1. I have always thought that the curve and the internal interview policy was designed to distribute talent across the company and not end up with pockets of ‘superstar’ teams and ‘dysfunctional’ teams. From the company’s perspective, it is probably better to have the superstars spread across different parts of the organization so the company ends up with multiple good teams rather than a single great team.

    Of course, we can question the wisdom of such thinking but at least I tend to rationalize the curve to myself this way 🙂

  2. In general, the review model appears to be defined by a classical economic theorist. It’s assuming it will be participated in by purely rational beings consistently out for their own self-interest. It overvalues the role of competition and undervalues the irrational emotional behavior people make.

    The problem I see with the “distribute talent around the company” theory is that there is massive friction in your way if you score in the low buckets in the curve. There is an implicit assumption in the model (which has consistently held to be false in my experience) that scoring low on one team means you will score
    low on another.

    People seem to prefer to follow the path with least friction.

    So, if you score high, you are motivated to stay. The team loves you, right?
    If you score low, you are also motivated to stay (new teams don’t want you and/or you are forced to stay (or leave the company))

    That leaves the middle bucket, the 3. If you view yourself as a superstar and got this message, you might choose to leave. However, the bigger bet is that you’ll stay. Most people like the folks they are working with and I think most nowadays prefer collaboration as does Alan.

    People do move around a lot, but I find it not due to review. It’s usually the people that individual is working with that’s incenting them.

  3. I have to review my employees twice a year, and having the same complaints. But I still couldn’t think of a better way to decide on bonuses and compensation while keeping them more-or-less uniform across a company of 10’s of thousands, it can work though if the company has a few 100’s employees so one HR team can manage everyone personally.
    The same applies for promotions, and expected performance to title matching

  4. Very nicely written post Alan. I absolutely agree with your main theme (if I can paraphrase you): stack ranking is bad for employee morale, discourages collaboration, and inhibits innovation, but it’s a price MS employees pay.

    One side effect of stack ranking that I haven’t seen mentioned at all is the policy’s possible effect on bringing in top new talent to MS. I heard the following story and I am confident it actually happened. At an event a few weeks ago with 28 Microsoft technical people (developers, testers, program managers, and a couple of other categories), an impromptu straw poll asked, “If the son or daughter of a close friend just graduated from college with a degree in computer science, would you recommend that they apply to Microsoft for a job?”

    The results were stunning to me — zero of the 28 MS technical employees would recommend applying to MS. Follow-up conversations revealed that in all 28 cases, the MS stack-ranking policy was the reason for the no vote.

    I’ve worked with HR people for many years. In general most don’t understand technical talent and don’t truly understand how any technical company’s success depends on the talent of the technical employees. I think the reason for this is that the difference between the best and worst HR employee is nowhere near as large as the delta between average and good-to-great technical talent (there is quite a bit of research on this).

    So, if stack-ranking at a company waters down the quality level of the pool of job applicants, then the stack-ranking policy has a very serious, very negative, long-term side effect.

    I hope that Microsoft HR has carefully weighed the benefits of using stack-ranking against all the negative side effects.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.