Re: Threads not Improving Performance in Program

2009-03-19 Thread Ryan Rosario
On Mar 19, 10:35 am, Jean-Paul Calderone wrote: > On Thu, 19 Mar 2009 09:50:51 -0700, Ryan Rosario > wrote: > >I have a parser that needs to process 7 million files. After running > >for 2 days, it had only processed 1.5 million. I want this script to > >parse severa

Threads not Improving Performance in Program

2009-03-19 Thread Ryan Rosario
I have a parser that needs to process 7 million files. After running for 2 days, it had only processed 1.5 million. I want this script to parse several files at once by using multiple threads: one for each file currently being analyzed. My code iterates through all of the directories within a dire

Re: Trying to fix Invalid CSV File

2008-08-05 Thread Ryan Rosario
On Aug 4, 1:56 pm, Larry Bates <[EMAIL PROTECTED]> wrote: > Ryan Rosario wrote: > > On Aug 4, 8:30 am, Emile van Sebille <[EMAIL PROTECTED]> wrote: > >> John Machin wrote: > >>> On Aug 4, 6:15 pm, Ryan Rosario <[EMAIL PROTECTED]> wrote: > &g

Re: Trying to fix Invalid CSV File

2008-08-04 Thread Ryan Rosario
On Aug 4, 8:30 am, Emile van Sebille <[EMAIL PROTECTED]> wrote: > John Machin wrote: > > On Aug 4, 6:15 pm, Ryan Rosario <[EMAIL PROTECTED]> wrote: > >> On Aug 4, 1:01 am, John Machin <[EMAIL PROTECTED]> wrote: > > >>> On Aug 4, 5:49 pm, Ryan Ros

Re: Trying to fix Invalid CSV File

2008-08-04 Thread Ryan Rosario
On Aug 4, 1:01 am, John Machin <[EMAIL PROTECTED]> wrote: > On Aug 4, 5:49 pm, Ryan Rosario <[EMAIL PROTECTED]> wrote: > > > > > Thanks Emile! Works almost perfectly, but is there some way I can > > adapt this to quote fields that contain a comma in them? &g

Re: Trying to fix Invalid CSV File

2008-08-04 Thread Ryan Rosario
On Aug 3, 10:38 pm, Emile van Sebille <[EMAIL PROTECTED]> wrote: > Ryan Rosario wrote: > > I have a very large CSV file that contains double quoted fields (since > > they contain commas). Unfortunately, some of these fields also contain > > other double quotes and I

Trying to fix Invalid CSV File

2008-08-03 Thread Ryan Rosario
I have a very large CSV file that contains double quoted fields (since they contain commas). Unfortunately, some of these fields also contain other double quotes and I made the painful mistake of forgetting to escape or double the quotes inside the field: 123,"Here is some, text "and some quoted t

Reading a Directory of Emails - Problems

2007-07-24 Thread Ryan Rosario
Hi, I have a directory that contains a bunch of email messages and I would like to parse them using the email and mailbox packages. The emails were exported from Apple Mail. From what I gather, I need to use MHMailbox, but I can't get it to do anything useful and I cannot find any examples of how