subject:"Re\: Search in non\-linguistic text"

RE: Search in non-linguistic text

2009-07-16 Thread Digy

: http://people.apache.org/~digy/SingleCharAnalyzer.cs DIGY -Original Message- From: Matthew Hall [mailto:mh...@informatics.jax.org] Sent: Thursday, July 16, 2009 4:36 PM To: java-user@lucene.apache.org Subject: Re: Search in non-linguistic text Assuming your dataset isn't incredibly

Re: Search in non-linguistic text

2009-07-16 Thread Matthew Hall

Assuming your dataset isn't incredibly large, I think you could.. cheat here, and optimize your data for searching. Am I correct in assuming that BC, should also match on ABCD? If so, then yes your current thoughts on the problems that you face are correct, and everything you do will be turnin

Re: Search in non-linguistic text

2009-07-16 Thread Robert Muir

take a look at WordDelimiterFilter from Solr [you can use it in your lucene app too] On Thu, Jul 16, 2009 at 9:04 AM, JesL wrote: > > Hello, > Are there any suggestions / best practices for using Lucene for searching > non-linguistic text? What I mean by non-linguistic is that it's not English >

Re: Search in non-linguistic text

2009-07-16 Thread Anshum

Hi Jes,Good to see you here. You could try something like an n'gram analyzer. You'd have to explore, though 'm assuming it'd be helpful for you. -- Anshum Gupta Naukri Labs! http://ai-cafe.blogspot.com The facts expressed here belong to everybody, the opinions to me. The distinction is yours to d