Map of Voting Systems

cfrank

@rob well, it had better be colorful, or else there would be no visual contrast to tell the categories apart... it isn’t going to look simple, it’s a ~19-dimensional space being compressed and projected into around 2.5 dimensions. The relative spatial positioning between systems V and W was computed as follows:

First compute the sum over all criteria of
{+1 if V and W either both pass or both fail the criterion;
{0 else.

Call this F(V,W)

Then record the relative frequency of all values of F(V,W) for each (unordered) pair (V,W). This gives a probability distribution for F(X,Y) for a pair (X,Y) chosen uniformly at random.

The distance between V and W is directly related to the relative frequency that F(X,Y) is at least as large as F(V,W). In other words, it measures how relatively dissimilar the two systems are when compared in terms of absolute criteria. Since the projection is in 2 dimensions, I took the square root of that frequency so that it would spread out reasonably in 2-d space. I had to “feel it out” to some degree, because the actual space doesn’t fit in 2 or even 3 dimensions, but the distances according to those calculations are fairly accurate.

The distance is supposed to be a measure of dissimilarity of the voting systems, relative to absolute criteria. This allows the grouping into categories to be visualized with relatively simple shapes. It's just a spatial compression of a subgraph of this table:

This kind of representation lets one quickly and systematically answer questions like, "What monotonic system is similar to IRV?" and "What Condorcet method is similar to STAR?" and “What non-Condorcet method is similar to a Condorcet method?” The answers according to this map are Bucklin and STAR, Ranked Pairs, and IRV, respectively. It also stands to reason that systems that are relatively similar across these diverse criteria are more likely to be relatively similar in other ways as well. For example, the spatial separation has naturally grouped systems with similar ballot types.

rob

@cfrank said in Map of Voting Systems:

"What Condorcet method is similar to STAR?"

Do I understand you to mean "similar in terms of which specific criteria they pass or fail"? If so, ok, but I don't think that actually means they are similar in particular meaningful ways.

I assume you saw that I posted that identical chart yesterday, but not as an example of what I'd consider a "good thing." Actually the exact opposite. It treats so many things as binaries when they, in order to be even remotely meaningful and useful, should be expressed as a matter of degree. It's showing them as black and white issues, rather than ones that lie on a spectrum. I can't overstate the wrongness of that approach.

For instance, Independence of Irrelevant Alternatives is all important, but -- using @robla's car analogy from that other thread -- I see it as being similar to crash-safety of a car: treating it as an all-or-nothing thing is silly. Neither of these cars can check the "perfectly safe" box, but there is a vast difference between them, which is completely ignored if you treat crash safety as a binary:

I also don't agree at all with the idea that Approval or Score pass IIA, at least not by any reasonable definition of the concept, since that conclusion relies on assumptions that might be charitably be called "unrealistic" (and a bit less charitably, "f*cking absurd" )....

Range voting, approval voting, and majority judgment satisfy the criterion if the voters grade or rate the candidates on an absolute scale that doesn't depend on who is in the running. Note that this means no voter can normalize their ballot, and so in a two-candidate election the majority can't vote strategically to make their preferred candidate win.

(from https://electowiki.org/wiki/Independence_of_irrelevant_alternatives)

So, maybe I'm misunderstanding the way you constructed your graphic, but since you posted that "compliance of selected single-winner methods" chart, and that chart really kind of represents all that I see wrong with this field of study... you got me a bit ranty.....

cfrank

@rob I’m not sure whether you misunderstand the way the graphic is constructed, but in any case I do generally agree with you that, individually, absolute voting theoretic criteria aren’t particularly meaningful. However, I think it’s probably mistaken to say that taking all of the criteria at once to compare voting systems is not meaningful in some way. It’s just a binary signature of the system, and even if each individual criterion only contributes some small amount of relevant information, all of them together can easily contribute a significant amount. If similar systems generally give similar signatures, and dissimilar systems generally give dissimilar signatures, it’s a reasonably proxy to allow that similarity of signatures is at least a heuristic indication that two systems are similar.

If we changed some of the categories to things like “elects the Condorcet winner at least 90% of the time when one exists” (based on some empirical data) then we would have another signature that one could argue might be more meaningful. The criteria chart was just easy to access, independent of empirical data and (mostly) independent of opinion.

To address your point about whether or not you agree with the chart on some small number of criteria, unless you disagree with more than a small handful of criteria for more than a small handful of systems, it’s generally unimportant and will not affect most of the graphic much at all.

rob

@cfrank said in Map of Voting Systems:

I do generally agree with you that, individually, absolute voting theoretic criteria aren’t particularly meaningful.

I'd understand your perspective better on this if you'd directly address the car crash analogy. The point is that the Wikipedia chart considers "non-zero chance of failure" to be the only metric, and that is highly misleading when it comes to real world systems where all you can do is reduce -- not eliminate -- the chance and severity of failures.

(and to take the analogy further, think of safety conscious people choosing to use a bicycle rather than a car, because someone made a list where bicycles were the only vehicle listed where you can't die in a "car accident." I think that's what people do when they arrive at Score based on it seeming to be less sensitive to irrelevant candidates than it actually is)

You said above that you wanted to "help newcomers navigate the arena and learn about alternative voting systems". If you are trying to show certain things to others that want to dive in much deeper, that's a bit different, but if you are aiming this at newcomers, I am concerned that you would be using the limited amount of time and cognitive effort they are willing (or able) to spend, only to result in sending them in the wrong direction.

cfrank

@rob Ok let’s do a car analogy. The criteria can be binary, such as “passes crash test A,” “passes crash test B,” “Can accelerate at this rate,” “Gets at least this gas mileage,” “Leather interior,” “No more than X miles,” “Selling for under $Y,” “Has seat warmers,” and eleven others. I think answering yes or no to all of those questions could give a pretty decent characterization of the car, it’s like 20 questions. 20 questions is very different from 1 question.

robla

I applaud @cfrank 's effort on the chart, even if I might disagree with the conclusions. As of right now, I don't entirely understand the chart, but that's okay. I don't entirely understand Arrow's, Gibbard's or Satterthwaite's theorem's either, and I've appreciated their effort for years.

I generally agree with @rob with respect to IIAC. I agree with him that it's absurd how cardinal-voting advocates try to declare victory and say that cardinal methods pass IIAC, and appreciate the example he provided in a different comment (@rob's comment
"For instance, if there are 3 candidates, and I approve Alice and Bob, but not Chris, for it to be independent of irrelevant candidate Chris, they have to assume that I would still approve both Alice and Bob if they were the only candidates."). In general, single-winner elections are asking voters to do one thing: pick a single winner. There is no way to avoid the comparative/competitive aspect of single-winner elections. Trying to escape the clutches of electoral impossibility theorems by saying that ratings of candidates are independent of the other candidates being considered is rank silliness (pun intended).

Generally, it seems wise not to think too hard about a specific impossibility theorem, and just assume that any good table with binary assessments of criteria are going to have "FALSE" or "NOT PASSED" or whatever for every method for at least one criterion. Election methods are graded on a curve, and each table is a different professor. Arrow's table only had four or five columns for the different criteria. Other tables have more. The tables on English Wikipedia are arbitrary and capricious (as many professors are), and seem to be rather moody with respect to pass/fail criteria. Note that when professors grade on a curve, it means that the student is in competition with other students in the class. It seems that rankings are pervasive.

Even if we agree that some tables (or some "professors") aren't very helpful, I hope we can agree that perfection is unattainable in election methods. We shouldn't spend too much time arguing about specific tables as we each try to gain personal insight on different methods. @cfrank - I suspect that your three meta-critieria ("Stable", "Simple", and "Consenual") are a bit too vague to be useful to newcomers to electoral reform. I'm not sure that there are three meta-criteria that work, but please, keep trying to find them! As we all know, it's almost impossible not to get lost in the weeds when trying to understand election methods, and a good visualization could be helpful.

cfrank

@robla thank you! I agree that the 3 meta-criteria are too vague, that’s a part of why I’m trying to map out the space of voting systems in a methodical way to see if any patterns emerge that might inform or conform to my vague intuitions. So far it seems like some of the obvious patterns have emerged, such as ballot type (with the possible exception of approval, which has mildly controversial inputs in the chart due to questionable assumptions).

rob

@cfrank said in Map of Voting Systems:

@rob Ok let’s do a car analogy. The criteria can be binary, such as “passes crash test A,” “passes crash test B,” “Can accelerate at this rate,” “Gets at least this gas mileage,” “Leather interior,” “No more than X miles,” “Selling for under $Y,” “Has seat warmers,” and eleven others. I think answering yes or no to all of those questions could give a pretty decent characterization of the car, it’s like 20 questions. 20 questions is very different from 1 question.

Well I've never seen a chart like that. NHTSA's 5-Star Safety Ratings certainly tries to distill crash safety into something more useful than a binary: https://www.nhtsa.gov/ratings
(although your mentioning multiple tests is at least better than just a single test where the only way it can pass is by a technicality such as "it isn't actually a car", which is similar to what the wikipedia chart does with giving cardinal methods a pass on IIA)

On other stuff, like "has seat warmers" or "leather interior", sure. No problem there. It would be weird, though, to try to somehow distill all those binaries down to something meaningful, rather than just putting them all out there for you to look at individually.

I would certainly not be impressed by a chart that gave binary ratings in things like "under this price point" or "gets at least this gas mileage” unless it allowed me to type in the price point or gas milage. I'd rather it just list the price and gas milage, and all the better if it allowed me to sort the list on them.

cfrank

@rob oh certainly, my list of criteria was totally haphazard. But even then I think answering many of those binary questions would generally lead you to the consideration of similar cars.

I don’t consider the criteria themselves to be necessarily meaningful other than being an organized collection of binary facts about the voting systems. If we used a different collection of facts, we would get a different map. It’s interesting to me to consider how or whether the map changes significantly when different chosen criteria are used. What I would want for a good characterization is for the map to be robust against reasonable changes in the criteria. In fact, showing that the important features of the map using these criteria are not robust to alterations in them is, to me, an argument against them being generally useful for characterization.

robla

@cfrank wrote:

some of the obvious patterns have emerged, such as ballot type

I think making the taxonomy of voting system start with "ballot type" is a mistake. Granted, a cardinal ballot can easily be converted into ranked (ordinal) ballot, but that's not the defining characteristic of an election method. It's possible to have a Condorcet-winner compliant method that uses either rated (cardinal) ballots or uses ranked ballots. Election methods should be classified by their tallying algorithm, not their user interface.

cfrank

@robla yes definitely, I agree. The separation here is not explicitly by ballot type, but the space of voting systems according to the way the graphic is constructed seems to be roughly split into two camps along an axis, where systems on one end of the axis seem to have cardinal/score-like ballots while those on the other end tend to have rank-order ballots. Somewhere in the middle of those for example is STAR, which has characteristics of both possibly due to the way the ballot is operated upon. Then there seem to be roughly two other dimensions (according to my rough embedding), and those dimensions in theory might somehow characterize the algorithms.

This specific map though is independent of how the systems are actually used, so the similarities being depicted will probably have a lot to do with the mathematical structure of the systems compared with the degree to which the similarities relate to how the systems operate in practice. As @rob is addressing, it would be more interesting to create a map using practical criteria based on empirical data rather than (and/or in addition to) absolute criteria.