Hi Jim,

I run the following code

*ds <- read.csv(file="D:/Shreyasee laptop data/ASC Dataset/Subset of the ASC
Dataset.csv", header=TRUE)
> attach(ds)
> str(dos)*

I am getting the following message:

 *Factor w/ 12 levels "0000-00-00","6-Aug",..: 6 6 6 6 6 6 6 6 6 6 ...*

Thanks,
Shreyasee



On Mon, Jan 26, 2009 at 12:20 PM, jim holtman <jholt...@gmail.com> wrote:

> do:
>
> str(dos)
> str(patientinformation1)
>
> They must be the same length for the command to work: must be a one to
> one match of the data.
>
> On Sun, Jan 25, 2009 at 10:23 PM, Shreyasee <shreyasee.prad...@gmail.com>
> wrote:
> > Hi Jim,
> >
> > I tried the code which u provided.
> > In place of "dos" in command "pat1 <- rbinom(length(dos), 1, .5)  #
> generate
> > some data"
> > I added "patientinformation1" variable and then I gave the command for
> > "tapply" but its giving me the following error:
> >
> > Error in tapply(pat1, format(dos, "%Y%m"), function(x) sum(x == 0)) :
> >   arguments must have same length
> >
> >
> > Thanks,
> > Shreyasee
> >
> >
> >
> > On Mon, Jan 26, 2009 at 10:50 AM, jim holtman <jholt...@gmail.com>
> wrote:
> >>
> >> YOu can save the output of the tapply and then replicate it for each
> >> of the variables.  The data can be used to plot the graphs.
> >>
> >> On Sun, Jan 25, 2009 at 9:38 PM, Shreyasee <shreyasee.prad...@gmail.com
> >
> >> wrote:
> >> > Hi Jim,
> >> >
> >> > I need to calculate the missing values in variable
> "patientinformation1"
> >> > for
> >> > the period of May 2006 to March 2007 and then plot the graph of the
> >> > percentage of the missing values over these months.
> >> > This has to be done for each variable.
> >> > The code which you have provided, calculates the missing values for
> the
> >> > months variable, am I right?
> >> > I need to calculate for all the variables for each month.
> >> >
> >> > Thanks,
> >> > Shreyasee
> >> >
> >> >
> >> > On Mon, Jan 26, 2009 at 10:29 AM, jim holtman <jholt...@gmail.com>
> >> > wrote:
> >> >>
> >> >> Here is an example of how you might approach it:
> >> >>
> >> >> > dos <- seq(as.Date('2006-05-01'), as.Date('2007-03-31'), by='1
> day')
> >> >> > pat1 <- rbinom(length(dos), 1, .5)  # generate some data
> >> >> > # partition by month and then list out the number of zero values
> >> >> > (missing)
> >> >> > tapply(pat1, format(dos, "%Y%m"), function(x) sum(x==0))
> >> >> 200605 200606 200607 200608 200609 200610 200611 200612 200701 200702
> >> >> 200703
> >> >>    21     22     16     18     16     15     16     17     14     16
> >> >> 13
> >> >> >
> >> >>
> >> >>
> >> >> On Sun, Jan 25, 2009 at 8:51 PM, Shreyasee
> >> >> <shreyasee.prad...@gmail.com>
> >> >> wrote:
> >> >> > Hi Jim,
> >> >> >
> >> >> > The dataset has 4 variables (dos, patientinformation1,
> >> >> > patientinformation2,
> >> >> > patientinformation3).
> >> >> > In dos variable ther are months (May 2006 to March 2007) when the
> >> >> > surgeries
> >> >> > were formed.
> >> >> > I need to calculate the percentage of missing values for each
> >> >> > variable
> >> >> > (patientinformation1, patientinformation2, patientinformation3) for
> >> >> > each
> >> >> > month.
> >> >> > I need a common script to calculate that for each variable.
> >> >> >
> >> >> > Thanks,
> >> >> > Shreyasee
> >> >> >
> >> >> >
> >> >> > On Mon, Jan 26, 2009 at 9:46 AM, jim holtman <jholt...@gmail.com>
> >> >> > wrote:
> >> >> >>
> >> >> >> What does you data look like?  You could use 'split' and then
> >> >> >> examine
> >> >> >> the data in each range to count the number missing.  Would have to
> >> >> >> have some actual data to suggest a solution.
> >> >> >>
> >> >> >> On Sun, Jan 25, 2009 at 8:30 PM, Shreyasee
> >> >> >> <shreyasee.prad...@gmail.com>
> >> >> >> wrote:
> >> >> >> > Hi,
> >> >> >> >
> >> >> >> > I have imported one dataset in R.
> >> >> >> > I want to calculate the percentage of missing values for each
> >> >> >> > month
> >> >> >> > (May
> >> >> >> > 2006 to March 2007) for each variable.
> >> >> >> > Just to begin with I tried the following code :
> >> >> >> >
> >> >> >> > *for(i in 1:length(dos))
> >> >> >> > for(j in 1:length(patientinformation1)
> >> >> >> > if(dos[i]=="May-06" && patientinformation1[j]=="")
> >> >> >> > a <- j+1
> >> >> >> > a*
> >> >> >> >
> >> >> >> > The above code was written to calculate the number of missing
> >> >> >> > values
> >> >> >> > for
> >> >> >> > May
> >> >> >> > 2006, but I am not getting the correct results.
> >> >> >> > Can anybody help me?
> >> >> >> >
> >> >> >> > Thanks,
> >> >> >> > Shreyasee
> >> >> >> >
> >> >> >> >        [[alternative HTML version deleted]]
> >> >> >> >
> >> >> >> > ______________________________________________
> >> >> >> > R-help@r-project.org mailing list
> >> >> >> > https://stat.ethz.ch/mailman/listinfo/r-help
> >> >> >> > PLEASE do read the posting guide
> >> >> >> > http://www.R-project.org/posting-guide.html
> >> >> >> > and provide commented, minimal, self-contained, reproducible
> code.
> >> >> >> >
> >> >> >>
> >> >> >>
> >> >> >>
> >> >> >> --
> >> >> >> Jim Holtman
> >> >> >> Cincinnati, OH
> >> >> >> +1 513 646 9390
> >> >> >>
> >> >> >> What is the problem that you are trying to solve?
> >> >> >
> >> >> >
> >> >>
> >> >>
> >> >>
> >> >> --
> >> >> Jim Holtman
> >> >> Cincinnati, OH
> >> >> +1 513 646 9390
> >> >>
> >> >> What is the problem that you are trying to solve?
> >> >
> >> >
> >>
> >>
> >>
> >> --
> >> Jim Holtman
> >> Cincinnati, OH
> >> +1 513 646 9390
> >>
> >> What is the problem that you are trying to solve?
> >
> >
>
>
>
> --
> Jim Holtman
> Cincinnati, OH
> +1 513 646 9390
>
> What is the problem that you are trying to solve?
>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to