On Mar 13, 2008, at 11:03 AM, John Wang wrote:

Yes, but usually it's a good idea to add documents in batch and not having
to reinstantiate the writer for every document and then closing it.

It would be nice if one can specify to the writer which analyzer to use.

PerfieldAnalyzer wouldn't work because different analyzers may apply on the
same field depending on the doc, e.g.


Also, I don't know that it is wise to put different langs in the same field. I can't prove it definitively, but it seems to me your corpus statistics could be skewed by terms that are spelled the same but have different meanings across languages.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to