Re: Why are tokens not being indexed?

Erik Hatcher Wed, 30 Nov 2005 05:48:05 -0800

What Analyzer are you using? Have a look at the Analyzer demo withLucene in Action's code, or from my java.net article so you cananalyze your analyzer. Also try out Luke, it really is handy forseeing inside your index.

And is your text really long (> 10,000 terms)? If so, you'll needto set the max. field length higher on IndexWriter (I always useInteger.MAX_VALUE).


        Erik



On 30 Nov 2005, at 08:04, Combs, Craig wrote:

I have a body of text which is being added to a document asunstored. All
the words in the body text are coming through in the token stream for
analyzing. For some reason I can search on some of tokens andothers I can
not.

Take the following string:
"L'amministrazione di Uniface View consente un elevato grado diflessibilitàe può essere realizzata in base ai requisiti del proprio sito. NelmanualeGuida all'amministrazione di Uniface View vengono descritte leprocedure diamministrazione del sistema. Utilizzare le informazioni riportatedi seguito
per creare il portale più adatto alle proprie esigenze."
If I search for "consente" or "elevato" no results are found. If Isearchfor "view" or "grado" results are found. I find an explanation forthisbehavior. These tokens are being returned to the reader but don'tseem to
be making it into the index.
Any ideas on how to debugs or an explanation why this might behappening?
Regards,
Craig Combs
The contents of this e-mail are intended for the named addresseeonly. Itcontains information that may be confidential. Unless you are thenamedaddressee or an authorized designee, you may not copy or use it, ordiscloseit to anyone else. If you received it in error please notify usimmediately
and then destroy it.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Why are tokens not being indexed?

Reply via email to