Re: Questions about doing a full text search with numeric values

2013-07-06 Thread Erick Erickson
; see if "this letter sequence occur(s)" in it? I'm thinking I'm missing > something because that seems no different than using wildcards. Or am I > missing a subtle difference? > > Thank you. > > -----Original Message- > From: Erick Erickson [mailto:erick

Re: Questions about doing a full text search with numeric values

2013-07-03 Thread Ivan Krišto
On 07/01/2013 12:22 PM, Erick Erickson wrote: > WordDelimiterFilter(Factory if you're experimenting with > Solr as Jack suggests) will fix a number of your cases since > it splits on case change and numeric/alpha changes. If WordDelimiterFilter doesn't help, maybe you could take a look at n-gram t

RE: Questions about doing a full text search with numeric values

2013-07-01 Thread Todd Hunt
Thank you. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, July 01, 2013 5:23 AM To: java-user Subject: Re: Questions about doing a full text search with numeric values WordDelimiterFilter(Factory if you're experimenting with Solr as Jack sugges

Re: Questions about doing a full text search with numeric values

2013-07-01 Thread Erick Erickson
WordDelimiterFilter(Factory if you're experimenting with Solr as Jack suggests) will fix a number of your cases since it splits on case change and numeric/alpha changes. There are a bunch of ways to recombine things so be aware that it'll take some fiddling with the parameters. As Jack suggests, us

Re: Questions about doing a full text search with numeric values

2013-06-27 Thread Jack Krupansky
Do continue to experiment with Solr as a "testbed" - all of the analysis filters used by Solr are... part of Lucene, so once you figure things out in Solr (using the Solr Admin UI analysis page), you can mechanically translate to raw Lucene API calls. Look at the standard tokenizer, it should