Is it possible to get only one Field from a Document?

2008-06-11 Thread Marcelo Schneider
I have a environment where we have indexed a DB with about 6mil entries with Lucene, and each row has 25 columns. 20 cols have integer codes used as filters (indexed/unstored), and the other 5 have (very) large texts (also indexed/unstored). Currently the search I'm doing is like this: Hits hi

Re: Is it possible to get only one Field from a Document?

2008-06-11 Thread Marcelo Schneider
ssage- From: Marcelo Schneider [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 11, 2008 13:29 To: java-user@lucene.apache.org Subject: Is it possible to get only one Field from a Document? I have a environment where we have indexed a DB with about 6mil entries with Lucene, and each row h

Re: My own nalyzer in lucene

2008-07-09 Thread Marcelo Schneider
starz10de escreveu: Hi All, I am new in lucene! I am trying to do my own nalyzer (myAnalyzer) in lucene. I worte it and I compile it, then i add myAnlayzer.class to the folder \org\apache\lucene\analysis and then i create new jar files which contains myAnalyzer and the other files, then i

Re: Ignoring XML tags when Indexing

2008-07-24 Thread Marcelo Schneider
Do you just want to ignore them and store all in one field? If you know the used tags previously, I guess you could set up a stop words list with them. If not, you could do an "XMLAnalyzer" that simply ignores everything inside '<>'... If you want to split the xml content in separate fields, y

Re: Ignoring XML tags when Indexing

2008-07-25 Thread Marcelo Schneider
Daniel Noll wrote: What makes more sense (at least the way I see it) is to implement a Reader which returns the text you need from the XML. This sort of thing is relatively simple to do with the newer StAX API. You can have your reader return even small chunks of text, and it should perform