> On 14 Dec 2014, at 13:54 , Stefan Evert <stefa...@collocations.de> wrote:
> 
<snip>
> 
>> (3) What is people's view on computing the two-tailed test like this,
>> which leads to an ns result unlike binom.test?
>> 2*sum(dbinom(51:235, 235, 1/6)) # 0.05308849
> 
> This is a popular approximation (which I also use most of the time) because 
> it's much less expensive (in computational terms) than computing an exact 
> (likelihood-based) two-tailed p-value as binom.test() does.  This is 
> particularly relevant if you want to compute confidence intervals for the 
> true probability p based on a large sample, which takes ages with 
> binom.test().

When I get drilled about this, I usually say that one really shouldn't use 
"two-tailed" and "exact" in the same sentence, because of the issue with the 
definition of tails. I don't agree that the version in binom.test is in any 
sense _the_ correct one and we probably should make alternatives optional at 
some point. 

One point that is easily overlooked (guilty!) is that defining the p-value as 
the sum over less probable outcomes is _not_ a likelihood theory technique. The 
likelihood ratio test should have a denominator equal to the maximum 
probability of the outcome when the parameter is allowed to vary from the null 
value. It is not that hard to do the actual LRT:

> LRT <-  -2*log(dbinom(0:235,235,1/6)/dbinom(0:235,235,(0:235)/235))
> dist_null <- dbinom(0:235, 235, 1/6)
> sum(dist_null[LRT >= LRT_obs])
[1] 0.05373588 

I believe there are four reasonable contenders for the two sided p-value:

1) sum of probabilities of less or equally probable outcomes
2) sum of probabilities of outcomes with more extreme LRT
3) double minimum one-tailed p
4) tail-balancing: one-sided p plus the max opposite tail probability less than 
p

-- 
Peter Dalgaard, Professor,
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Email: pd....@cbs.dk  Priv: pda...@gmail.com

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to