Hi Stephen, Not responding to the R memory question, but to the racing.
I worked on this many years ago and found no way of overcoming the 19% or so paramutual take. That being said, I suggest you take class into account (based on purse, type of race (maiden claiming, claiming $, NWxx allowance, etc). Make sure that you are accounting for the size of the field. it is much easier to win a race of 6 than 12 horses. A similar bias applies to the advantage of inner post position, if you do not account for number of entries. Re validation, I would not build a mode on X years of data and then validate. Patterns change and a model needs to be adaptive. I would use a hold out day, per week (randomly chosen) and then use that. good luck in a difficult task. Gerard ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.