[R] Help me make faster R code for Kennard-Stone algorithm [My code is so slow from Matlab]

2013-04-24 Thread Kevin Hao
Hi all, Can you help me change my Kennard-Stone algorithm to faster one? [The original code can run fast in matlab, but when I change matlab code to R code, it is so slow.] Since my code so crude and too many loops (changed from matlab code), it is too slow. I hope that you can help to improve t

[R] Read big data (>3G ) methods ?

2013-04-26 Thread Kevin Hao
Hi all scientists, Recently, I am dealing with big data ( >3G txt or csv format ) in my desktop (windows 7 - 64 bit version), but I can not read them faster, thought I search from internet. [define colClasses for read.table, cobycol and limma packages I have use them, but it is not so fast]. Cou

Re: [R] Read big data (>3G ) methods ?

2013-04-26 Thread Kevin Hao
> And another tip here is, you can split the large file into smaller ones. > > > > On Fri, Apr 26, 2013 at 8:09 AM, Kevin Hao wrote: > >> Hi all scientists, >> >> Recently, I am dealing with big data ( >3G txt or csv format ) in my >> desktop (win

Re: [R] Read big data (>3G ) methods ?

2013-04-26 Thread Kevin Hao
0:58 AM, Ye Lin wrote: > >> Have you think of build a database then then let R read it thru that db >> instead of your desktop? >> >> >> On Fri, Apr 26, 2013 at 8:09 AM, Kevin Hao wrote: >> >>> Hi all scientists, >>> >>> Recently, I

Re: [R] Read big data (>3G ) methods ?

2013-04-26 Thread Kevin Hao
Hi Ye, Thanks. That is a good method. have any other methods instead of using database? kevin On Fri, Apr 26, 2013 at 1:58 PM, Ye Lin wrote: > Have you think of build a database then then let R read it thru that db > instead of your desktop? > > > On Fri, Apr 26, 2013 at 8:0

Re: [R] Read big data (>3G ) methods ?

2013-04-26 Thread Kevin Hao
Thank you very much. More and more methods are coming. That sounds great! Thanks, kevin On Fri, Apr 26, 2013 at 7:51 PM, Duncan Murdoch wrote: > On 13-04-26 3:00 PM, Kevin Hao wrote: > >> Hi Ye, >> >> Thanks. >> >> That is a good method. have any oth

Re: [R] Read big data (>3G ) methods ?

2013-04-27 Thread Kevin Hao
ert it to some > binary format first (RDS, ff, sqlite, bigmemory, ...). Most packages have > routines to convert CSV files to the binary format. > > With all of the above LaF helps. ffbase contains a routine laf_to_ffdf to > convert to to ff format. > > > HTH, > > Jan >

[R] need help for Imbalanced classification problems!!!

2013-05-14 Thread Kevin Hao
Hi all, I am facing the imbalanced classification problems. That means I have a dataset, in which the ratio of majority data to minority data is 100:1 (or more). In addition, the independent variables are many and this is a binary classification questions. The model I built give poor predictive

Re: [R] K Fold CrossValidation

2013-05-15 Thread Kevin Hao
Below is the some code, may be helpful for you. [maybe have the finished package which includes mlr with crossvaliation], you can check http://cran.r-project.org/web/packages/available_packages_by_date.html. you can check this "ChemometricsWithR

Re: [R] K Fold CrossValidation

2013-05-15 Thread Kevin Hao
Maybe the following code is helpful for you. At the same time, you can check the ChemometricsWithR package from http://cran.r-project.org/web/packages/available_packages_by_date.html. # product crossvalidation index crossvalind