On Wed, 30 Jul 2014 18:57:15 -0600, Vincent Davis wrote: > On Wed, Jul 30, 2014 at 6:28 PM, Vincent Davis > <vinc...@vincentdavis.net> wrote: > >> The real slow part seems to be >> for n in drugs: >> df[n] = >> df[['MED1','MED2','MED3','MED4','MED5']].isin([drugs[n]]).any(1) >> >> > I was wrong, this is fast, it was selecting the columns that was slow. > using > keep_col = ['PATCODE', 'PATWT', 'VDAYR', 'VMONTH', 'MED1', 'MED2', > 'MED3', 'MED4', 'MED5'] > df = df[keep_col] > > took the time down from 19sec to 2 sec.
19 seconds? I thought you said it was taking multiple minutes? -- Steven -- https://mail.python.org/mailman/listinfo/python-list