Re: Searching by bit masks

2006-11-28 Thread Erick Erickson
You could store a value for each flag then be careful about what analyzers you use. For instance, using WhitespaceAnalyzer (index AND search) and doing your own casing. That is, make sure you lowercase as necessary (NOTE: operators AND, OR NOT must not be lowercased if you send them through queryp

Re: Searching by bit masks

2006-11-28 Thread Biggy
The background of this is also separating content according to domains Example: - pictureA (marked as a "joke" #flag :1) - pictureB (marked as a "adult picture" #flag: 2) Site1: Users allowed to view everything (pictureA, pictureB ) Site2: Users allowed to view everything except pictureB (no adu

Re: Searching by bit masks

2006-11-28 Thread Erick Erickson
Lucene will automatically separate tokens during index and search if you use the right analyzer. See the various classes that implement Analyzer. I don't know if you really wanted to use the numeric literals, but I wouldn't. The analyzers that do the most for you (automatically break up on spaces,

Re: Searching by bit masks

2006-11-28 Thread Biggy
OK here what i've come up with - After reading your suggestions - bit set from DB stays untouched - only one field shall be used to store interest field bits in the document: "interest". Saves disk space. - The bits shall be not be converted to readable string but added as values separated by spa

Re: Searching by bit masks

2006-11-27 Thread Daniel Noll
Erick Erickson wrote: Well, you really have the code already . From the top... 1> there's no good way to support searching bitfields If you wanted, you could probably store it as a small integer and then search on it, but that's waaay too complicated than you want. 2> Add the fields like you

Re: Searching by bit masks

2006-11-27 Thread Erick Erickson
Well, you really have the code already . From the top... 1> there's no good way to support searching bitfields If you wanted, you could probably store it as a small integer and then search on it, but that's waaay too complicated than you want. 2> Add the fields like you have the snippet from, so

Re: Searching by bit masks

2006-11-27 Thread Biggy
i have the same problem here. I have an interest bit field, which i receive from the applciation backend. I have control over how the docuemtns are built. To be specific, the field looks like this: ID: interest 1 : sport 2 : music 4 : film 8 : clubs So someone interested in sports and music ca

RE: Searching by bit masks

2006-11-10 Thread Larry Taylor
To: java-user@lucene.apache.org Subject: Re: Searching by bit masks Erick Erickson wrote: > Something like > Document doc = new Document(); > doc.add("flag1", "Y"); > doc.add("flag2", "Y"); > IndexWriter.add(doc); Fields have overheads. It

Re: Searching by bit masks

2006-11-10 Thread Doug Cutting
Erick Erickson wrote: Something like Document doc = new Document(); doc.add("flag1", "Y"); doc.add("flag2", "Y"); IndexWriter.add(doc); Fields have overheads. It would be more efficient to implement this as a single field with a different value for each boolean flag (as others have suggested

Re: Searching by bit masks

2006-11-10 Thread John Haxby
Larry Taylor wrote: What we need to do is to be able to store a bit mask specifying various filter flags for a document in the index and then search this field by specifying another bit mask with desired filters, returning documents that have any of the specified flags set. In other words, we are

Re: Searching by bit masks

2006-11-09 Thread Erick Erickson
Let me see if I have a clue what you're trying to do. Warning: I'm a bit confused since "filter" has a very specific meaning in Lucene, so when you talk about filters I'm assuming that you're NOT talking about Lucene filters, but rather just a set of flags you're associating with each document, an