On Mar 13, 2008, at 11:03 AM, John Wang wrote:
Yes, but usually it's a good idea to add documents in batch and not
having
to reinstantiate the writer for every document and then closing it.
It would be nice if one can specify to the writer which analyzer to
use.
PerfieldAnalyzer wouldn't work because different analyzers may apply
on the
same field depending on the doc, e.g.
Also, I don't know that it is wise to put different langs in the same
field. I can't prove it definitively, but it seems to me your corpus
statistics could be skewed by terms that are spelled the same but have
different meanings across languages.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]