I have a environment where we have indexed a DB with about 6mil entries
with Lucene, and each row has 25 columns. 20 cols have integer codes
used as filters (indexed/unstored), and the other 5 have (very) large
texts (also indexed/unstored). Currently the search I'm doing is like this:
Hits hi
ssage-
From: Marcelo Schneider [mailto:[EMAIL PROTECTED]
Sent: Wednesday, June 11, 2008 13:29
To: java-user@lucene.apache.org
Subject: Is it possible to get only one Field from a Document?
I have a environment where we have indexed a DB with about 6mil entries
with Lucene, and each row h
starz10de escreveu:
Hi All,
I am new in lucene!
I am trying to do my own nalyzer (myAnalyzer) in lucene. I worte it and I
compile it, then i add myAnlayzer.class to the folder
\org\apache\lucene\analysis and then i create new jar files which
contains myAnalyzer and the other files, then i
Do you just want to ignore them and store all in one field? If you know
the used tags previously, I guess you could set up a stop words list
with them. If not, you could do an "XMLAnalyzer" that simply ignores
everything inside '<>'...
If you want to split the xml content in separate fields, y
Daniel Noll wrote:
What makes more sense (at least the way I see it) is to implement a
Reader which returns the text you need from the XML. This sort of
thing is relatively simple to do with the newer StAX API. You can
have your reader return even small chunks of text, and it should
perform