Arrow’s Theorem
By Thoreau
Since I’ve resumed work on my paper on the mathematics of voting, I want to blog about Arrow’s Theorem, one of the most important theorems in the mathematics of voting. The first thing you should know about it is that it doesn’t quite say that there’s no “perfect” voting method. “Perfect” is a word that only makes sense if you first define what it is that you’re looking for. Rather, it proves that no voting method (in a fairly wide class of methods) can simultaneously satisfy certain criteria (more on those criteria below). Also, it certainly does not say (contrary to some popular accounts) that the only “perfect” voting method is a dictatorship. The word “dictatorship” does come up in the theory, but I’ll explain below.
Also, I’m not blogging about this to advocate for or against alternative voting systems. The desirability (or undesirability) and feasibility (both technical and political) of alternative voting systems is certainly an interesting topic, but it’s not one that I feel like arguing over right now. The science of voting is interesting even just as an intellectual topic, because it is closely related to certain issues in formal logic and computer science: Given certain information, can you construct a procedure for making a decision from that information, and ensure that the procedure will satisfy certain properties?
Let’s start by defining the types of voting methods under consideration. We’ll focus on voting methods in which every voter writes down a ranking of the candidates: First choice, second choice, third choice, etc. The election outcome is then decided solely from this information. Examples of such methods include Instant Runoff, Condorcet voting, Bucklin, Borda Count, and numerous others that you can google for. Even our current voting system is in some sense equivalent to a ranked voting system, because it would be equivalent to a system where you submit your list but the people tallying the votes only pay attention to the first name on your list.
While most alternative voting systems fit that description, not all do. The most popular example of one that doesn’t use ranked lists is Approval Voting (a method that I like, actually, but we’ll leave that aside for another time). In Approval Voting, you basically write “yes” or “no” for each candidate, and the candidate with the most “yes” votes wins. This is different from a ranked voting method because you can’t infer the yeses and nos from somebody’s list. If a person put down 5 names in order, did he intend to say “yes” to the top 4? Top 3? Top 2? Top 1? Different voters could have different cutoffs. So it uses a different type of information. The nature of the information is important here, and is part of the reason why this is academically interesting: The type of information that you start form constrains some of the properties of the decision procedure.
Anyway, there are lots of possible procedures, and Arrow decided to impose 2 requirements on them: (Some people impose additional requirements, state it differently, or include some additional assumptions very explicitly, but this isn’t a formal proof here)
1) Pareto: If every single voter prefers candidate A to candidate B, then candidate B can’t win. (A might not win either, of course, if, for instance, enough people prefer C to A.) This is an easy requirement to satisfy, and just about every voting procedure that I’m aware of satisfies it.
2) Independence of Irrelevant Alternatives (IIA): Suppose that candidate C wins the election. Suppose that we then take the set of ballots and cross off some candidate (let’s call him “Ralph”) while leaving the relative rankings of the other candidates unchanged. If we then look to see who the new winner is, the winner should be the same (unless “Ralph” was the original winner). There are other ways of stating this condition, but it’s an especially simple one.
Interestingly, Arrow did not assume that all candidates are treated equally, nor did he assume that all voters are treated equally. Leaving aside the Florida and Diebold jokes, that actually makes the theorem more applicable to real situations, because there are situations like shareholder meetings where different voters carry different weight, and situations like legislative votes where some options (equivalent to candidates) require a simple majority to pass while others will need 2/3 to pass. All he assumed was that the method uses ranked ballots, and that conditions 1 and 2 are met.
Surprisingly, there is only one procedure that satisfies these requirements: A procedure where a single voter ALWAYS makes the choice. No other procedure can always satisfy 1 and 2. Say what you will about your favorite method (Instant Runoff, for instance, has a ton of fans) but if you search hard enough you’ll find a case where it fails one of those requirements, and in every case that I’m aware of it’s the IIA requirement that’s failed.
People draw a variety of conclusions from this. Some make a big deal about the fact that the only method satisfying 1 and 2 is a method where the same voter always gets his way, and conclude that Arrow was arguing for dictatorship. No, not really. He was just observing that if you impose those requirements, that’s what you’re left with. So find some other criteria. Others say “There are no good voting methods, so just go with (insert that person’s preferred method here) because they all suck, man.” No, not quite. Different methods do have different properties, some might be better than others, depending on your criteria. You just have to pick some criteria, argue for why those criteria are desirable, and then see if you can find any methods satisfying those criteria. Finally, some make a related argument and say that all alternative voting methods suck just as much as the status quo, so stick with that. Again, no. If the status quo has properties that you like, fine, argue for it. But don’t drag Arrow into it, because his critique of the other methods is just as applicable to the status quo.
Anyway, let me give a simplified proof of a weaker version of Arrow’s Theorem. Instead of imposing Pareto, I’ll impose a more limiting requirement: If there are only 2 candidates in the race, then we have a simple majority vote to pick between them. I’ll then prove that any method which reduces to a simple majority vote in the case of 2 candidates cannot satisfy IIA.
Suppose we have a case where a majority of the electorate prefers candidate A to candidate B, a majority of the electorate prefers B to C, and a majority of the electorate prefers C to A. That may sound irrational, and it is unlikely to happen if all of the voters and candidates line up along a single left-right issue axis. However, if candidates are evaluated along multiple issue axes (e.g. social and economic) and many of them (and many voters) are not on the main left-right axis, then it’s quite likely.
Anyway, let’s say C is our winner. We then start removing candidates, and (in keeping with IIA) C remains the winner. Eventually, we get to the point where only 3 candidates remain: A, B, and C. If we remove candidate A, then the final choice is between B and C, and a majority prefer B to C. So by removing A (a losing candidate) we change the outcome. This violates IIA. Switch around candidate names, say that A is your winner when you have 3 left, and then remove B, and it’s an A-C face-off, and IIA is violated again. And switch names around yet again, so that A is your winner in the 3-way race, remove B, and it’s an A-C face-off and C wins when you remove losing candidate B. So no matter how you do it, if there’s a cyclic paradox (B beats C, C beats A, A beats B) you violate IIA. This is simpler than the case Arrow did (he didn’t assume that 2-way races reduce to majority vote) but it illustrates the point.
All this is a fancy way of saying that the spoiler effect happens. Still, Arrow’s work is significant for 2 reasons. First, he proved that it’s impossible to get rid of the spoiler effect. It happens less frequently in some methods, but you can never completely get rid of it. (No, not even in Instant Runoff, but thanks for asking.) Second, more importantly, he demonstrated that voting procedures could be analyzed by very general methods, and that you could make statements that apply to all possible procedures. That’s an important thing, just as in computer science and formal logic it’s significant that we have theorems that limit wide classes of algorithms and procedures, not just methods for analyzing some particular algorithm, and then having to redo the analysis every time the algorithm is tweaked.
Anyway, that’s what Arrow’s Theorem is about.

Comment by Tony P. —
April 5, 2008 @ 3:10 am
The Rock-Paper-Scissors paradox is presumably paradoxical because no individual voter has a “cyclic preference”, and yet the electorate as a whole can. The practical question then becomes: under which system of voting is the paradox least likely to arise?
For instance, any voter will have a non-cyclic preference order, say R>P>S. There are 6 possible preference orders. If the voters are evenly split among those preference orders, then I can imagine system X falls prey to the paradox. Okay, but an even 6-way split is not guaranteed. It is less likely to arise, in real life, than a 30-20-20-15-10-5 distribution. If system X still falls prey to the paradox in that case, but system Y does not, then maybe Y is “better”.
Tangentially, I wonder whether anybody has ever considered a system of “voting” wherein voters are allowed to sell their votes, and candidates to buy them. I mean, for real money.
For instance, I would gladly sell my vote to McCain for $25K, to Hillary for $100, and I’d be willing to pay somebody else $50 to vote for Obama. Now, my price list may be extreme, but it is honest: I would feel adequately compensated for the horrors of a McCain presidency by a cash payment of $25K. If we can’t construct a foolproof system of voting, maybe the next best thing is a system whereby the winners compensate the losers.
– TP
Comment by Michael DW —
April 5, 2008 @ 12:05 pm
I’m not sure I understand why IIA is an objective that we would value.
Perhaps I’m misunderstanding it, but it seems to me like confounding IIA is the whole point of instant runoff voting.
Comment by Greg —
April 5, 2008 @ 12:18 pm
Actually, Arrow never used the term “spoiler” or “spoiler effect”. He proved it’s impossible to satisfy IIA. Whether or not you consider each irrelevant alternative a “spoiler” depends on your definition of “spoiler”, a loosely-defined term that has no consensus meaning in electoral discourse.
According to my definition — and the one I think agrees most with colloquial usage, both IRV and Condorcet eliminate spoilers. A “spoiler effect” occurs when an irrelevant alternative with weak core support causes the Condorcet winner to lose. When an irrelevant alternative has strong core support, that’s more commonly known as the “center squeeze”. The term “weak core support” can be precisely defined as a candidate that is partitioned into the minority by a majority of voters (in the sense of the mutual majority criterion).
Comment by Thoreau —
April 5, 2008 @ 1:05 pm
Greg-
You are correct, Arrow did not (to the best of my knowledge) talk about “spoilers.” I was presenting this on an informal level, and a candidate who enters a race and changes the outcome without becoming the new winner is at least akin to the “spoiler effect.” (In the post I referred to losing candidates leaving the race and changing the outcome, but it arises from the same mechanism as candidates entering the race and changing the outcome without winning.)
I will grant that not all of these candidates who enter a race and change the outcome will be candidates with weak support. In that sense they may not be “spoilers” in the way of, say, a candidate with 3% support sapping the support of somebody who had 51%, throwing the race to a candidate with 49%.
IRV, however, does not eliminate spoilers, even if we define spoilers as only those candidates with weak support. Consider a widely-supported candidate who is the first choice of only a small number but is the second choice of most. As long as he isn’t the first candidate eliminated he’ll frequently win because in subsequent rounds of the procedure he’ll pick up more votes. However, a fringe candidate could enter the race and take enough votes for this consensus candidate to go to last place. The consensus candidate is then eliminated in the first round. This is distinct from the “center squeeze” effect because what did him in was not competition from the slightly right-of-center and slightly-left-of-center candidates, but rather competition from the fringe candidate.
Michael DW-
Whether or not we should value IIA as an objective seems unimportant to me, because Arrow proved that it can’t be achieved without either accepting dictatorship (no way) or getting rid of Pareto (and I can’t even think of a system that violates Pareto but passes IIA).
And IRV doesn’t confound IIA. You can construct examples where a candidate enters the race and changes the outcome without winning. That isn’t a criticism of IRV, because just about every other voting system is prone to that problem as well.
Comment by greenish —
April 5, 2008 @ 1:37 pm
There is one implicit requirement that I think is worth mentioning: *A voting system must determine a winner*. This requirement is, to my mind, utterly bonkers. If the purpose of a voting system is to accurately condense the opinions of a populace, then it *should* end in a tie on cycles. Any system which takes a cycle and comes up with a winner is satisfying the rather different objective of putting an ass, any ass, in the seat. We shouldn’t expect that to have nice properties.
Comment by Thoreau —
April 5, 2008 @ 1:45 pm
greenish-
Well, most voting theorists find ways to relax the “must return a winner” criterion to exclude ties. However, cycles are more common than ties. Whether or not there should be a result in the case of a cycle depends more on a value judgement than on the insights of theory.
It is worth noting that if people are allowed to rank candidates equally then there are cases where you can have both a cycle and a majority preference. Say, for instance, that A beats B 48-40, B beats C 45-43, and C beats A 55-45. (Yes, I’m aware that the percentages didn’t all add up to 100%, but we’re allowing the possibility that voters submitted ballots listing 2 candidates equal.) In that case, there is a clear majority preference for C over A. There is no majority preference on any of the other questions.
Where you consider this a valid situation in which to pick a winner is a value judgement more than a matter of theory. All that the theory can really do is inform you of the consequences of choosing different procedures.
Comment by greenish —
April 5, 2008 @ 3:27 pm
Oh, I understand – I was just rendering my value judgement that systems which pick winners out of cycles are necessarily imperfect, so Arrow’s theorem shouldn’t be taken to mean “all voting systems suck, at least a little”, because one of the premises ensured that they suck, at least a little.
Comment by Dave W. —
April 5, 2008 @ 10:03 pm
I apologize for going offtopic, but my teevee appearance is on the net:
http://today.msnbc.msn.com/id/21134540/vp/23918702#23918702
btw, T. my offer to help you with the article stands. I still think a probabilistic element would help.
Comment by Greg —
April 5, 2008 @ 11:58 pm
Thoreau –
I said weak core support, and I further defined that as falling into a minority partition by a majority of voters. According to this definition, every electoral system that satisfies the mutual majority criterion, including IRV and Condorcet, eliminates all spoilers.
Comment by Nell —
April 6, 2008 @ 9:44 am
I would feel adequately compensated for the horrors of a McCain presidency by a cash payment of $25K.
You might feel that way, but you wouldn’t be.
Comment by Timothy —
April 7, 2008 @ 12:29 am
IIRC can’t some methods of Condorcet voting be shown to satisfy a weakened version of IIA? Personally, I like Condorcet voting, but it’s complicated to implement any of the types that always give you a winner.
Comment by Thoreau —
April 7, 2008 @ 12:33 am
You can weaken it to say that adding a new candidate won’t change the outcome unless the new candidate either (1) wins or (2) can at least defeat the old winner in a one-on-one contest. In that case, methods that select the Condorcet winner (when it exists) satisfy a weakened IIA.
Comment by Randolph Fritz —
April 8, 2008 @ 3:22 pm
See also Poundstone on range voting, which seems to evade the problems. But the fact of the matter is, plurality voting is one of the worst systems there is, and we’d do well to try a number of experiments with different systems, even though there is no perfect system.