I. Introduction to Frequentist Statistics
In a few earlier, backhanded comments I mentioned that much of the mathematics that undergirds modern statistics derives from Continental haute couture’s obsession with so-called “Games of Chance” - i.e. gambling on, or ruminating on, things like dice throws, or coin flips, or card games, or… drawing small, colored balls or marbles from giant urns. In what follows I will attempt to demonstrate that the current crop of inferential statistics1 that serve as both (1) necessary characteristics of what is considered #peer-reviewed-science and (2) determinant of which “studies” are used to “prove” X or Y effect - and more importantly, desired social policy position - are not just the completely wrong tools for science, but illogical for that purpose even on their own terms.
We begin first, however, with two critical questions: what is probability and what precisely does it measure? In the answers to these two questions we will see what’s been previously discussed as a philosophical schism between Laplacians and Humians manifest as a mathematical schism between the followers of Thomas Bayes (Bayesians) versus the Frequentists, which generally includes those using the stats of Ronald Fisher, or Neyman & Pearson, as well as the unholy hybrid of both, the NHST - null hypothesis significance testing.2 A person’s answer to the above two questions about probability tells us, from a mathematical perspective, whether one is a Bayesian or a Frequentist - but these two categories each carry with them a cartload of associated philosophical “baggage”.3 One’s sense of the Universe and how it is organized - or disorganized, depending upon your view - hinges upon one’s view of the mathematics that helps to validate Modern Science vs. the one that currently in use by “The Science.”4 We’re going to spend a good bit of time on this subject from a variety of perspectives because, to quote from a book simply by its name, “Ideas Have Consequences.” A culture with bad ideas about science,5 like Russia and China both had vis a vis agriculture in the 20th century, will eventually reify those scientistic ideas into law - i.e. dogma. And the consequences of ideas that conflict with Reality will come washing up on a culture’s shores, in worst cases, like a tsunami - a tidal wave of consequence from an unseen, underwater seismic event.
II. Games of Chance?
Card games seem to be a near universal human preoccupation, so we’ll skip their history. A standard deck of cards has 52 cards split between four different sets (“suits”) of cards numbered 2 through 10, and then Jack, Queen, King, and an Ace. There are seemingly endless games with equally various rules that give different values and importance to suits and cards and how they interact. Games can be played alone (solitaire, anyone?), but the most fun - and the ones involving the most money and aggravation - usually involve multiple players playing a “hand” of cards they are dealt and that are invisible - kept hidden from - the other players. In other words, one of the critical parts of card games is that there is an element of uncertainty because players are “blind” to what’s in the other players’ hands. This is true when playing card games with a teammate (partner) like bridge, or even when playing against the dealer with 5 or 6 other players, as in Blackjack (“21”), or when playing community cards with two “in the hole” like Texas Hold ‘Em. Said another way, card games depend upon knowledge asymmetry, but the underlying, and unspoken, presupposition is that the knowledge asymmetry is equal for all of the players.
Part of the draw6 of card games, however, particularly for games that use only a single deck, is that the 4 suits of 13 identically numbered cards and the relatively low number of cards - 52 - means that a person of moderate intelligence can work to overcome the knowledge asymmetry as the game goes along and more information is revealed. The “odds” of what other cards could be “out there” - either in the deck yet to be dealt, or in your opponent’s hands - changes dynamically as the game goes on. Concrete examples work best, but let’s start simply.
You are sitting across the table from the dealer and he asks you - before dealing a single card - “what are the odds that after a thorough shuffle of the cards that I deal you an Ace on the first card?” Simple, yes? There are 4 aces in a deck of 52 cards, so the “odds” (theoretically, anyway) of pulling an Ace are 4 in 52, or 1 in 13… right?
Basic (orthodox) statistics says that the Probability (Pr) of an event (r) is the ratio of favorable outcomes divided by all possibilities and, thus, it is always a number ranging from 0 and 1 - with zero representing impossibility and 1 representing certainty.
Pr = Successful Outcomes/Total Possibilities
In mathematical notation this is written Pr = f/N where f = the frequency of successful outcomes and N = total possible outcomes.
We’re looking for an Ace, there are four possible Aces, and the total of all cards we could draw is 52, thus PA = 4/52, which reduces to 1/13 = 0.07692. Converted to a percentage this means that there is ~ a 7.7% chance of pulling an Ace on the first card he deals you… right? Hold that thought for a moment while we do something a little more complicated just to introduce the product rule.
Statistics also says that the event of two events occurring is the product of their individual Probabilities. That is to say, if we’re trying to find the probability that two events, A and B, occurring, we multiply their individual Probabilities together.
P(A and B) = P(A) × P(B)
For example, the odds of rolling a 6 (or any number 1-6) on a normal7 die is 1 chance in 6, or 1/6. The chances of rolling doubles, therefore, double sixes, or double threes, etc., is P(A) = 1/6 x P(B) = 1/6 = P(A) x P(B) = 1/36. Okay, so far so good. Let’s go back to cards for a little more complexity.
Suppose you wanted to know the Probability that the dealer deals you not one, but two(!) Aces - a pair of bullets - on the first two cards. Product rule says it’s to multiply P(A) x P(B), BUT we have to remember that the Probability of getting the second Ace changes after he deals you the first one. We know that the probability of one ace - getting the first one - is P(A) = 4/52 = 1/13, but the odds of getting the second one, P(B) are now different because you’ve already got one Ace and there is one less card in the deck. So, the P(B) is actually 3/51, or 1/17, thus making the Probability of getting dealt two Aces on the first two cards P(A) x (P(B) = 1/13 x 1/17 = 1/221 = Or there is a .4525% chance of getting Aces dealt back-to-back… right? For simplicity’s sake, I’m leaving out how the other players’ cards being dealt around you, as well as the rules of the game, i.e. what cards you need to win, radically alters all of these calculations. For the moment, let’s just ignore those.
Now make sure to watch this short ~2 min video all the way to the end. Pay attention to the reveal.
Jason Ladanye is what is known in the biz as a card mechanic. (I love that term). It takes a long time to get that good, but tracking and manipulating cards is a skill that can be learned, as all sleight of hand and various prestidigitation can be learned over time. But wait a second, Dale, just what the hell do card cheats have to do with statistics?8
What I’m trying to do is slip something into your subconscious mind about the assumptions that underlie even something as (seemingly) straightforward and mundane as the mathematical probability of a particular card being drawn from a deck after a “fair” shuffle. What does it really mean for a shuffle, or a hand of cards in poker, or the deal in a game of casino blackjack to be “fair”? If you’re playing with this guy in the video above, the probability of you getting one specific card - say the Ace of diamonds - isn’t 1 in 52, nor is the probability of getting any Ace (of any suit) 4 in 52. The probability is, instead, entirely at his discretion when he’s dealing. If he doesn’t want you to have the Ace of diamonds, you’ll never get it.
Pr in reality equals 0.
III. Life as the Limit of Infinite Flips?
Let’s go simpler - let’s talk about the odds of a coin coming up heads or tails on a flip. And no two-sided, trick coins or magicians/con-men this time. Aren’t all coin tosses “50-50” propositions - “tossups?” As a matter of modern parlance, the words and phrases around a “coin toss” have (in fact) come to signify a perfectly uncertain state of affairs:
“Is it going to rain later?”
“I don’t know… probably 50-50. It’s a tossup.”
This is what a modern might even call common sense. But is it? Are all coin tosses 50-50 propositions? Is any toss of a coin equally likely - equally probable - to be heads as it is to be tails?
You don’t have to watch all of it.9 Forgetting the coin flipper for a moment, it’s a good excuse to introduce you to Prof. William Briggs, author of “Uncertainty,” who discusses some of these same subjects from a more mathematical bent at his Substack.10 As soon as you see the little machine that the good Professor has built, if you had the thought, “wait, that’s not fair…” that is a view of the world through a Frequentist lens.
Much of what falls under the heading of frequentist statistics is the result of the work of Francis Galton, Ronald Fisher, and Karl Pearson. Aubrey Clayton traces some of the mathematical discoveries and currents of thought in the mid-1800s that set the stage for the “Frequentist Jihad” that would successfully dominate in English academe.11 Fisher and Pearson (and others, like John Venn) were joined by Karl Popper in denying (and even ridiculing) the Laplacian view of probability as a quantitative measure of our state of knowledge about the world, and instead they insisted that probability inheres in objects or systems: that probability is a property of an object, like mass, or color, or of populations - creating the fiction of the “average man.” This is the view that also insists that coins have a certain 50-50ness or that dice have a one-in-six-ness. To be fair, these beliefs are not completely arbitrary and have both mathematical and a certain intuitive appeal to them.
For example, if you were to “roll” two “fair” dice - and we will allow the circular requirement that a “fair” dice is one weighted such that the weight is uniformly distributed no matter which side is facing up - over and over again, thousands upon thousands of times, where a legitimate “roll” consists of throwing the dice through the air onto a completely flat, level piece of felt, and the dice must travel and hit both the felt table and then a back felt wall at least 5 feet from you, and then come to rest on the felt table, it does seem to comport with our sense of how things work that over a sufficient number of rolls you would eventually see a distribution of die totals between 2 and 12 that comports with the distribution derived from the mathematical probabilities of each one. Which is to say, that you would find empirically that the probability of rolling a 2 (or a 12), is ~1/36, and the probability of rolling a 11 (or 3), is 1 in 18, over time and that this can be described by a limit or sum function such that it all seems to work out perfectly. The same seems to apply with even more force for the 1 in 2-ness of a coin being flipped onto a table.
But all of it is an illusion. First, there is no world in which anyone has ever flipped a coin or rolled dice an infinite number of times; and so the use of mathematical functions that rely upon such constructs should at least suggest that we’re not dealing with anything empirical.
More importantly, it only takes a moment to recognize that what is going on in “games” of “chance” - including cards and coin flips and dice - is not “random chance” at all, but very carefully controlled and intentionally enforced ignorance, and knowledge asymmetry, and that is how the public loses its fucking money in casinos to the tune of billions each year.
What do I mean? Let me change the requirements for a roll by just one - instead of me throwing the dice each time, I’m going to defer to my little mechanical dice-roller, the CasinoCrusher2000, who tosses the dice with exactly the same NewtonMeters of force, angular momentum, and loft - every. single. time. forever.
Do you now have the same confidence that over infinite rolls that the numbers will have the expected statistical spread? Gulp. Briggs already showed you above you can do the same thing for coin flipping and very quickly turn a coin flip from a 50-50 proposition to something more like 85-15 in favor of whichever you want if you use a device that flips with even a mechanically calibrated amount of force each time.
IV. Who are You Calling Random?
Coin flips and dice throws aren’t random events at all. Note to Statisticians and Jurists: Coins and dice (and playing cards) follow the same laws of physics as rocket ships, bullets, catapults, helicopters, and tennis balls! Coins and dice and cards are relatively simple systems involving small amounts of force, very little wind or other environmental concerns (like ocean tides). The (primary) thing that makes them seem “random” is us - we completely ignore our own presence - the very part that makes it random each time. If your thumb had a little gauge on it that each time gave you an exact readout of how many units of force your coinflip was generating? It wouldn’t take you more than a few weeks to be able to figure out how to flip a coin such that it landed whichever side up you wanted in your other hand. Maybe not 100%, but you would suddenly, magically have destroyed your coin’s 50-50ness and your dice’s 1-in-6ness.
Do you hear me? Table games are all set up so that the House eventually wins. It’s a con; a very sophisticated looking three card monte. It’s Jason Ledanye manipulating and tracking cards, but you can’t see it and think that it’s a fair deal every time. You have no idea why he always has a pocket pair better than yours. You can’t see yourself getting cheated; and Vegas also does it actuarially by its payouts being conservatively lower than what the odds are and cost to bet given the enforced conditions of ignorance.
Briggs has related in a private conversation (and I haven’t checked, but I’ll take his word), but it is a crime to measure or attempt to measure any of the physical characteristics of these systems, like the felt on the table, or to use a machine to throw the dice, etc. Which is to say, games of “chance” are games where you’re specifically forbidden from using any of the tools of physics or science to improve your chances of winning - i.e. you also can’t count cards.
The “randomness” of the outcomes is reinforced by your unknown amount of force, or not being allowed (or able) to count six deck blackjack, or the requirement for the dice to knock around off of a wall. That is what creates the “randomness” of these games. If instead you used a machine throwing into beach sand, you can imagine that it could get sixes every time because of the consistency of the “roll” and lack of bounce of the dice. Same for coin flips.
All of the above are measures intentionally taken by Vegas to make these games have outcomes - or probabilities or odds - that match these idealized statistically consistent numbers, so that you can bet your ass (or your house) that the dice on the tables in Vegas do all have 1-in-6ness, and every coin flipped has 50-50ness because Vegas ultimately secures its money the way insurance actuaries do: Vegas knows what “the odds” - i.e. the probabilities are - and it pays out conservatively lower than what the chances. For example, you can look up roulette on Wikipedia and find that in either the 37 slot wheel (European) or the 38 slot (American) wheel, the probability of getting the single correct slot is 1 in 37, or 1/38, but if you win, they only pay 35 to 1. The “House edge” or “expected value” is 2.7% for a Euro (respectable con-man) wheel and 5.26% for a (greedy capitalist con-man) American wheel. Over the long run, the House wins all around, though there will be occasions where statistical anomalies occur.
All of which is to say that the mathematics used by frequentists statistics is derived not from “games of chance,” but from rigged games, where artificial conditions are enforced in order to ensure that a “fair result” is the long-term outcome. That hardly seems to be a good mathematical basis for judging what is and isn’t “science.” Or viewing life through the lens of frequency distributions.
Next time we’ll look at this in a different context - medical testing.
“Cookbooks” in Jaynes’s vocabulary.
I acknowledge I am cutting out a breathtaking amount of mathematical intellectual history. For a comprehensive treatment of this subject, the best book is Aubrey Clayton’s “Bernoulli’s Fallacy: Statistical Illogic and the Crisis of Modern Science,” Columbia Univ. Press, 2021. My audience and writing is, however, lawyer-slanted, and if my own experience is indicative, besides not being able to pass O-Chem, law students and lawyers have an almost pathological fear of working in the quantitative realm… except for billing hours, of course. In that one domain they do seem to find the necessary mental focus and energy to manipulate decimal points, work in base 60, conduct long division, multiplication, and solve rudimentary linear algebra problems without pencil or paper.
“Priors” is what Bayesians would call all of that.
We will cover some small parts of that third rail of #FormerCurrentThings because the statistics used by pharma is Frequentist statistics.
Or social and political organization, or economics, or human rights, or energy policy, or liberty…
HOHO! Now that’s a good pun right there.
None of those weird, Dungeons & Dragons dice, with 4, 12, and 20 sides, okay?
I met a guy years ago at a friend’s St Patty’s Day party in South Boston - somebody’s brother-in-law who served in the navy in the late ‘60s - and he could do card tricks like you see people do on TV. He was that good. When he handled a deck of cards, it was evident from the moment he picked them up that he could manipulate cards without you ever having any idea. You couldn’t possibly play cards with him and if he won not suspect him of having cheated. I asked him about that and he laughed. He said that once he got that good, when he was still in the Navy, when he played cards for money he would always pass the deal. Always. That was my first realization that card games rely upon the fact that either the player-dealers are all equally inept at handling and shuffling cards, or that an independent, inept dealer is shuffling and dealing the cards.
…But you absolutely should watch the whole thing!
The second reason is because much of the foundational philosophical background predicate that I’m working off of comes from the first 4 chapters of “Uncertainty.” Every lawyer and scientist should read and understand it. Chapters 5 and higher require more technical work.
Clayton, Bernouilli’s Fallacy, pp. 122-126, and Ch. 4, “The Frequentist Jihad,” pp. 131-161.
Thanks, Dale! I'll show my dad.