The Etz-Files

2015-04-15T07:07:08-05:00

Dear Alex Etz,

thank you for this informative post. I think it provides a clear basic introduction to the logic of Bayes-Factors.
The likelihood function plots are particularly informative. What should a researcher conclude from the likelihood plot for 300 heads with 500 coin flips is particularly informative because it has the largest sample size. A simple visual inspections shows that there is very little empirical support for either one of the two a priori hypothesis (p = .50 & p = .75). Moreover, BF comparisons of the a priori hypothesis with the observed data favor the observed data (.60) over the a priori hypothesis .50 and .75. In short the data favor .60 over the a priori hypothesis .50, which is the null-hypothesis that the odds for heads and tail are equal (.50 – .50 = 0).

A traditional significance test would have shown that the probability of obtaining 300 out of 500 heads for a fair coin (a priori hypothesis .50) is p < .00001. It is therefore very safe to reject the null-hypothesis and to conclude that the coin is not fair.

The standard Bayesian argument against the use of p-values in this scenario is that we do not know how the 500 trials were conducted and that the researcher may have capitalized on chance by stopping whenever the result was significant. But how would be explain that the p-value is well below any significance criterion like p < .05. This is inconsistent with optional stopping. Moreover, you can run simulations to see how often optional stopping would produce a result like 300 out of 500 heads. Optional stopping simply does not have the power to produce such a strong effect.

The BF makes things murky because it also tests an alternative hypothesis, p = .75. Where does this hypothesis come from. Why not p = .55, .60, or .95 as alternative. The arbitrary choice of an alternative is the key weakness of BF. They are internally consistent, but they do not tell us what most researchers want to know. Is the coin fair? Not, is the coin more likely to be fair than to be biased by a certain amount.

Take quality control in a casino as an example. The casino makes money from a fair roulette table (equal odds for red and black, take all the money for green). One casino uses frequentists statistics to test whether a table is fair. After 500 games they discover that red occurred 300 times. They remove the table because savy gamblers started capitalizing on the good odds for red on this table. The Bayesian casino tests p = .50 against p = .75. As the actual odds are .60, the BF favors the hypothesis that the table is fair over the hypothesis that the table is biased (increased odds of 50:50 to 75:25) and they keep the table in play. They bayesian casino loses money because frequentists gamblers have figured out that the table gives them better odds of winning.

Frequentists win when Bayesians compute BF with bad priors. So, the trick is to have good priors. Welcome to the world of unknowns. Bayesian statistics is a religion that gives a false promise of certainty to believers in a world of uncertainty.

Sincerely, Dr. R

	## Plots the likelihood function for the data obtained
	## h = number of successes (heads), n = number of trials (flips),
	## p1 = prob of success (head) on H1, p2 = prob of success (head) on H2
	## Returns the likelihood ratio for p1 over p2. The default values are the ones used in the blog post
	LR <- function(h,n,p1=.5,p2=.75){
	L1 <- dbinom(h,n,p1)/dbinom(h,n,h/n) ## Likelihood for p1, standardized vs the MLE
	L2 <- dbinom(h,n,p2)/dbinom(h,n,h/n) ## Likelihood for p2, standardized vs the MLE
	Ratio <- dbinom(h,n,p1)/dbinom(h,n,p2) ## Likelihood ratio for p1 vs p2
	curve((dbinom(h,n,x)/max(dbinom(h,n,x))), xlim = c(0,1), ylab = "Likelihood",xlab = "Probability of heads",las=1,
	main = "Likelihood function for coin flips", lwd = 3)
	points(p1, L1, cex = 2, pch = 21, bg = "cyan")
	points(p2, L2, cex = 2, pch = 21, bg = "cyan")
	lines(c(p1, p2), c(L1, L1), lwd = 3, lty = 2, col = "cyan")
	lines(c(p2, p2), c(L1, L2), lwd = 3, lty = 2, col = "cyan")
	abline(v = h/n, lty = 5, lwd = 1, col = "grey73")
	return(Ratio) ## Returns the likelihood ratio for p1 vs p2
	}

What is likelihood?

The Likelihood Axiom

Likelihoods are meaningless in isolation

Looking at likelihoods

Connecting likelihood ratios to Bayes factors

R Code

References

Rate this:

Share this:

Related

25 thoughts on “Understanding Bayes: A Look at the Likelihood”

Leave a reply to If you did not already know | Data Analytics & R Cancel reply