[R] machine learning and horse racing

Gerard Smits Tue, 18 Sep 2007 14:05:36 -0700

Hi Stephen,

Not responding to the R memory question, but to the racing.


I worked on this many years ago and found no way of overcoming the 
19% or so paramutual take.  That being said, I suggest you take class 
into account (based on purse, type of race (maiden claiming, claiming 
$, NWxx allowance, etc).    Make sure that you are accounting for the 
size of the field.  it is much easier to win a race of 6 than 12 
horses.  A similar bias applies to the advantage of inner post 
position, if you do not account for number of entries.

Re validation, I would not build a mode on X years of data and then 
validate.  Patterns change and a model needs to be adaptive. I would 
use a hold out day, per week (randomly chosen) and then use that.

good luck in a difficult task.

Gerard

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] machine learning and horse racing

Reply via email to