On Sun, 22 Jul 2007 05:34:17 -0300, "Gabriel Genellina" <[EMAIL PROTECTED]> wrote: >Try to avoid using ".*" and ".+" (even the non greedy forms); in this >case, I think you want the scan to stop when it reaches the ending </span> >or any other tag, so use: [^<]* instead. > >BTW, better to use a raw string to represent the pattern: pattern = >r"...\d+..."
Thanks everyone for the help. It did improve things significantly :-) -- http://mail.python.org/mailman/listinfo/python-list