On 1 Nov 2005, at 11:02, Malcolm wrote:
Hi, I've been reading my new project bible 'Lucene in Action'
Amen! ;)
about Analysis in Chapter 4 and wondered what others are doing for indexing XML(if anyone else is, that is!). Are you folks just writing your own or utilising the current Lucene analysis libraries?
Analyzers are at a per-field granularity, and more than likely your XML data contains what you would want treated as multiple fields. So while an analyzer _could_ directly deal with XML, it really is unlikely to be the appropriate layer to do so. The majority of scenarios would have XML parsed separately and then the individual separated text fed to Lucene fields for analysis.
Erik --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]