; see if "this letter sequence occur(s)" in it? I'm thinking I'm missing
> something because that seems no different than using wildcards. Or am I
> missing a subtle difference?
>
> Thank you.
>
> -----Original Message-
> From: Erick Erickson [mailto:erick
On 07/01/2013 12:22 PM, Erick Erickson wrote:
> WordDelimiterFilter(Factory if you're experimenting with
> Solr as Jack suggests) will fix a number of your cases since
> it splits on case change and numeric/alpha changes.
If WordDelimiterFilter doesn't help, maybe you could take a look at
n-gram t
Thank you.
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Monday, July 01, 2013 5:23 AM
To: java-user
Subject: Re: Questions about doing a full text search with numeric values
WordDelimiterFilter(Factory if you're experimenting with Solr as Jack sugges
WordDelimiterFilter(Factory if you're experimenting with
Solr as Jack suggests) will fix a number of your cases since
it splits on case change and numeric/alpha changes. There
are a bunch of ways to recombine things so be aware that
it'll take some fiddling with the parameters. As Jack
suggests, us
Do continue to experiment with Solr as a "testbed" - all of the analysis
filters used by Solr are... part of Lucene, so once you figure things out in
Solr (using the Solr Admin UI analysis page), you can mechanically translate
to raw Lucene API calls.
Look at the standard tokenizer, it should