I tried several ways:

1.  I used the scan() function, it can read the 6GB file into the memory
without difficulty, just took some time. But just read into the memory was
definitely not enough, when I did the next step, which was to plot() and
then tried to build the nonlinear regression model, it was stucked at the
plot() part, since it has already reached the memory limit, even though I
have 64-bit version system and huge memory size.

2. I tried the bigmemory() package. It can read the dataset into the memory
as well, but since it stores the data into a matrix format, and the normal
functions such as nls(), plot()... cannot work on matrices--that is the
problem. What should I do then?

Or do I need to change to SAS? I believe there are a lot of people who are
dealing with large datasets, what did you do in this situation?

Thanks.




2010/7/24 <babyfoxlo...@sina.com>

>
> -------------- Original Message --------------
>
> You may want to look at the biglm package as another way to regression
> models on very large data sets.
>
> --
> Gregory (Greg) L. Snow Ph.D.
> Statistical Data Center
> Intermountain Healthcare
> greg.s...@imail.org
> 801.408.8111
>
>
> > -----Original Message-----
> > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-
> > project.org] On Behalf Of babyfoxlo...@sina.com
> > Sent: Friday, July 23, 2010 10:10 AM
> > To: r-help@r-project.org
> > Subject: [R] How to deal with more than 6GB dataset using R?
> >
> >  Hi there,
> >
> > Sorry to bother those who are not interested in this problem.
> >
> > I'm dealing with a large data set, more than 6 GB file, and doing
> > regression test with those data. I was wondering are there any
> > efficient ways to read those data? Instead of just using read.table()?
> > BTW, I'm using a 64bit version desktop and a 64bit version R, and the
> > memory for the desktop is enough for me to use.
> > Thanks.
> >
> >
> > --Gin
> >
> > [[alternative HTML version deleted]]
> >
> > ______________________________________________
> > R-help@r-project.org mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide http://www.R-project.org/posting-
> > guide.html
> > and provide commented, minimal, self-contained, reproducible code.
>



-- 
Best,
Jing Li

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to