No, i didnt get that warning ("In eval(expr, envir, enclos) ... : non-integer
#successes in a binomial glm!") because i used the continuous bounded variable
weighted by the effort.
This is, my formula was something like:
gam(SER_CD ~ s(DEP)+s(SST)+s(CLA)+s(SSH)+s(WST), weights=EFFORT, data=CD01,
family=binomial(link="logit"))
Do you still think it is preferable to use a beta / gamma / quasi-poisson
distribution?
And about the negative value of the UBRE score i still can´t understand...i
dont have missing nor negative values in the variables.
Any other suggestion? Which kind of issue might be causing this problem of
getting a negative UBRE?
Thnaks once again!
-------------------------------------------------------------------------------------------------
Message: 92
Date: Fri, 22 Aug 2008 09:48:02 +0200 (CEST)
From: "Fabrizio Cipollini" <[EMAIL PROTECTED]>
Subject: Re: [R] GAM-binomial logit link
To: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain;charset=iso-8859-1
I guess safer to use the option family = quasibinomial since, with a continuous
[0,1]-response, the empirical (conditional) variance of y can significantly
differ from the corresponding theoretical binomial variance.
You can find larger references in
Papke - Wooldridge (1996), 'Journal of Applied Econometrics' (vol. 11, p.
619-632).
Hmmm... On the basis of the UBRE formula within gam{mgcv}, UBRE scores should
be nonnegative. Please inspect the values of the single elements inside the
formula for discovering possible problems.
Fabrizio Cipollini
----------------------------------------------------------------------------------------------------
Message: 54
Date: Thu, 21 Aug 2008 20:09:08 +0000
From: Monica Pisica <[EMAIL PROTECTED]>
Subject: Re: [R] GAM-binomial logit link
To: <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset="iso-8859-1"
Hi,
I am not sure it is the best to use a binomial distribution for a continuous
bounded variable. A beta distribution would be more appropriate, although I
don't know how to define one for the gam() function. On the other hand beta
distribution is closely linked to the gamma distribution so maybe you can use
it to define a beta family for the gam() function.
Some info about beta distribution:
http://www.stat.purdue.edu/~jrnolan/portfolio/the_big_ten/beta.pdf
Also, I am not very sure how you did a gam using binomial family without having
your response data converted in 0 and 1. Didn't you get a warning saying that:
Warning messages: 1: In eval(expr, envir, enclos) ... : non-intege[[elided
Yahoo spam]]
Maybe you can contact the author of the mgcv package. I am curious to see his
response.
Sorry I cannot help much more,
Monica
----------------------------------------------------------------------------------------------
Dear all,
I'm using a binomial distribution with a logit link function to fit a GAM
model. I have 2 questions about it. First i am not sure if i've chosen the most
adequate distribution. I don't have presence/absence data (0/1) but I do have a
rate which values vary between 0 and 1. This means the response variable is
continuous even if within a limited interval. Should i use binomial?
Secondly, in the numerical output i get negative values of UBRE score. I would
like to know if one should consider the lowest absolute value or the lowest
real value to select the best model.
Thank you in advance for your help.
Mar
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.