On Thu, 29 Jan 2009, LuKreme wrote:

On 28-Jan-2009, at 14:43, Karsten Bräckelmann wrote:

You still should not have to split the mbox files. :)

True enough.

If sa-learn is mis-behaving on large mbox files for you, it's worth
investigating the cause. And either fix your system or sa-learn, if it
turns out to be a bug.

Any ideas for, essentially, a non-programmer?

How big is the corpus file that sa-learn is choking on? How many messages are in it? What is the largest message? Are all of the messages properly formed? Can you open that file in a MUA (e.g. PINE) and read every message successfully?

Run sa-learn against that corpus file with debugging enables and look at the last few lines before it dies. Anything suggestive? Post the debug log somewhere we can take a look at it.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Gun Control laws cannot reduce violent crime, because gun control
  laws focus obsessively on a tool a criminal might use to commit a
  crime rather than the criminal himself and his act of violence.
-----------------------------------------------------------------------
 3 days until the 6th anniversary of the loss of STS-107 Columbia

Reply via email to