I am not convinced about tab. I hate tab, it only causes problems.
On May 1, 12:29 am, Jonathan Lundell <jlund...@pobox.com> wrote: > On Apr 25, 2011, at 7:46 PM, Massimo Di Pierro wrote: > > > > > I agree. > > Perhaps tab should be retained as well. > > > > > > > > > > > On Apr 25, 8:59 pm, Jonathan Lundell <jlund...@pobox.com> wrote: > >> On Apr 25, 2011, at 6:49 PM, Massimo Di Pierro wrote: > > >>> True. It should do what the book says. > > >> Except that we should probably change the definition to exclude 127, seems > >> to me. > > >>> On Apr 25, 6:43 pm, pbreit <pbreitenb...@gmail.com> wrote: > >>>> CLEANUP() seems to be removing more characters than the Book would > >>>> suggest. > > >>>> "It just removes all characters whose decimal ASCII codes are not in the > >>>> list [10, 13, 32-127]" > > >>>> However the regex is '[^ \n\w]' which I think is more like alphanumeric > >>>> plus > >>>> underscore. Is that right?