Thanks a lot Daniel :)
   
  Regards,
  Maureen

Daniel Noll <[EMAIL PROTECTED]> wrote:
  maureen tanuwidjaja wrote:

> Before implementing this search engine,I have designed to build the
> index in such a way that every XML tag is converted using binary
> value,in order to reduce the size index and perhaps for faster
> searching.To illustrate:
>
> article will be converted to 0
> article/body will be converted to 0.0
> article/body/section will be converted to 0.0.0
> article/body/p will be converted to 0.0.1
> 
> Now,because of using lucene for the implementation,i wonder wheter
> such conversion will still be useful for efficiency..I wonder wheter
> inside the lucene index itself, such kind of conversion or perhaps even
> further optimization is already done in order to reduce the size index
> or for faster searching.

Indeed you don't need to do this because each field stores its name as 
an integer lookup already. See SegmentTermEnum / FieldInfos.

Daniel

-- 
Daniel Noll

Nuix Pty Ltd
Suite 79, 89 Jones St, Ultimo NSW 2007, Australia Ph: +61 2 9280 0699
Web: http://nuix.com/ Fax: +61 2 9212 6902

This message is intended only for the named recipient. If you are not
the intended recipient you are notified that disclosing, copying,
distributing or taking any action in reliance on the contents of this
message or attachment is strictly prohibited.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



 
---------------------------------
Bored stiff? Loosen up...
Download and play hundreds of games for free on Yahoo! Games.

Reply via email to