On Feb 29, 2012, at 5:01 PM, Faryabi, Robert (NIH/NCI) [F] wrote:

Hi there,

Here is the scenario:

I have a measurement of some sort for two variables, I would like to figure out a rough pattern between them. Let say if the values of the first variable are low, middle, high, and extremely high, then what would be the corresponding pattern of the second variable. The idea is not to find the 2d distribution, but plot a conditional distribution of the second variable based on the binning of the the first variable and then present it in a boxplot.

I got the breakpoints for binning the first variables by a bi-modal density estimation. Now I need to bin the first variable accordingly and map them to a categorical value.

Is there an R command that does the binning?

It sounds as though you want `cut` and `table`. Whether that is the best use of the data is more questionable. Generally the categorization process removes quite a bit of the information content and may either introduce significant biases or lower power when the cuts are chosen after looking at the data or lower power when any inferential test is used. You _should_ also look at 2d density estimation as a method that is less susceptible to these distortions.

help( kde2d, package=MASS)

help( bkde2D , package=KernSmooth)

help( s.kde2d , package=ade4)

--
David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to