Error with long running web spider

2007-08-22 Thread Josh Volz
Hi everyone:

I have a spider that is relatively long running (somewhere between
12-24 hours).  My problem is that I keep having an issue where the
program appears to freeze.  Once this freezing happens the activity of
the program drops to zero.  No exception is thrown or caught.  The
program simply stops doing anything.  It even stops printing out its
activity to stdout.  The program itself appears to run in about 14
megs of memory.  Basically, the program looks up pages on a particular
website, and then reads the HTML of those pages, parses it (lots of
long regular expressions are used), and saves the found information to
an object (which is later translated to SQL and the SQL is written to
a file).

I've actually had this same problem with several long running Python
programs.  Any ideas?

Thanks in advance.

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Error with long running web spider

2007-08-22 Thread Josh Volz
On Aug 22, 10:58 am, Josh Volz <[EMAIL PROTECTED]> wrote:

I'm running this program on Windows XP, using Python 2.5.  I'm using
Active State Komodo IDE 4.0 as the run environment.

Thanks,
J.


> Hi everyone:
>
> I have a spider that is relatively long running (somewhere between
> 12-24 hours).  My problem is that I keep having an issue where the
> program appears to freeze.  Once this freezing happens the activity of
> the program drops to zero.  No exception is thrown or caught.  The
> program simply stops doing anything.  It even stops printing out its
> activity to stdout.  The program itself appears to run in about 14
> megs of memory.  Basically, the program looks up pages on a particular
> website, and then reads the HTML of those pages, parses it (lots of
> long regular expressions are used), and saves the found information to
> an object (which is later translated to SQL and the SQL is written to
> a file).
>
> I've actually had this same problem with several long running Python
> programs.  Any ideas?
>
> Thanks in advance.


-- 
http://mail.python.org/mailman/listinfo/python-list