Strange StopFilter and stop words behaviour

2011-07-26 Thread SBS
to remove any words (or so I believe). How can I achieve a StandardAnalyzer where every word in the query is significant? Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Strange-StopFilter-and-stop-words-b

Searching for words containing accents or umlauts?

2011-08-10 Thread SBS
How can I use Lucene Java to search for words containing accents or umlauts such as Hürthle, Sézary or Waldenström? I don't get any matches when I use the plain characters without accents. Is it a configuration issue? Thanks, -sbs -- View this message in context: http://lucene.4720

Re: Searching for words containing accents or umlauts?

2011-08-16 Thread SBS
Thanks, ASCIIFoldingFilter works well. -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-for-words-containing-accents-or-umlauts-tp3244774p3259979.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --

Overriding default handling of '/' and '-'

2011-08-16 Thread SBS
text where processing of the slashes and dashes in this manner is probably expected and desirable. I guess if I turn off this default behaviour then I will lose it for normal words but that is probably acceptable and unavoidable. Thanks, -sbs -- View this message in context: http://lucene.47206

Searching behaviour with content containing decimal points

2011-08-22 Thread SBS
is document? I am using StandardAnalyzer and QueryParser at the moment in Lucene Java 3.2. Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-behaviour-with-content-containing-decimal-points-tp3276878p3276878.html Sent from the Lucene -

Re: Searching behaviour with content containing decimal points

2011-08-25 Thread SBS
Can anyone help me with this? Do you require further information? This has become a serious issue for us. Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-behaviour-with-content-containing-decimal-points-tp3276878p3285423.html Sent from the Lucene

Re: Searching behaviour with content containing decimal points

2011-08-25 Thread SBS
o getting such a query to match this document? I am > using StandardAnalyzer and > QueryParser at the moment in Lucene Java 3.2. I have included the previous post as I realise not everyone accesses this content via the web. -sbs -- View this message in context: http://lucene.472066.n3.nab

How to ignore apostrophes in indexes and queries?

2011-09-11 Thread SBS
cal results and the highlighter would tell me that the word /*Alzheimer's*/ needs to be highlighted in both cases for that document. Is this possible? What's the best way of implementing this? Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-igno

How to ignore apostrophes in indexes and queries?

2011-09-11 Thread SBS
nd in both cases the highlighter would indicate that the word /*Alzheimer's*/ needs to be highlighted in the displayed content. Is this possible? What's the best way to achieve this? I am using Lucene 3.2 with StandardAnalyzer and QueryParser at present. Thanks, -sbs -- View this messa

How to ignore apostrophes in indexes and queries?

2011-09-11 Thread SBS
nd in both cases the highlighter would indicate that the word /*Alzheimer's*/ needs to be highlighted in the displayed content. Is this possible? What's the best way to achieve this? I am using Lucene 3.2 with StandardAnalyzer and QueryParser at present. Thanks, -sbs -- View this messa

How to ignore apostrophes in indexes and queries?

2011-09-11 Thread SBS
nd in both cases the highlighter would indicate that the word /*Alzheimer's*/ needs to be highlighted in the displayed content. Is this possible? What's the best way to achieve this? I am using Lucene 3.2 with StandardAnalyzer and QueryParser at present. Thanks, -sbs -- View this messa

Apologies for the multiple posts

2011-09-11 Thread SBS
Each time I posted I received a "500 Internal Server Error" message and assumed that the post had not been accepted. Obviously this was not the case! -- View this message in context: http://lucene.472066.n3.nabble.com/Apologies-for-the-multiple-posts-tp3328645p3328645.html Sent from the Lucene -

Re: How to ignore apostrophes in indexes and queries?

2011-09-13 Thread SBS
fragments and this is a serious problem. Is that because the highlighter looks for the sanitised tokens in the text instead of the raw tokens? Perhaps your solution would not have this problem but I am unsure how to implement it. Thanks, -sbs -- View this message in context: http://lucene

Enabling indexing of hyphenated terms sans the hyphen

2011-09-19 Thread SBS
finduced" to be indexed. How would I go about implementing this? We use Lucene Java 3.2. Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3350008.html Sent from the Lucene - Java

RE: Enabling indexing of hyphenated terms sans the hyphen

2011-09-20 Thread SBS
> Solr's WordDelimiterFilterFactory does what you want. Thanks Steve, that's exactly what I was looking for and it is now working. -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3353208.html

PorterStemFilter causes wildcard searches to not work

2011-11-21 Thread SBS
ch but instead I get "sensitisation" and not "appendicitis". If I remove the PorterStemFilter then things behave as I would have expected and desired. Why is this happening? Is there a way to apply a PorterStemFilter and still be able to use wildcards? I am using Lucene

Re: PorterStemFilter causes wildcard searches to not work

2011-11-28 Thread SBS
I am applying the PorterStemFilter at both indexing and search time. As for schema, I have 3 fields: title, subtitle and notes. When the user enters a query string of */a*itis/*, my software turns this into an actual Lucene query of */title: a*itis OR subtitle: a*itis OR notes: a*itis/* and I get

Re: PorterStemFilter causes wildcard searches to not work

2011-11-29 Thread SBS
rStemFilter from my custom analyzer (which I use at both indexing and > search time) then I get the results I want (albeit I lose the other > functionality which I also need). I hope that's a bit clearer. Any ideas to explain and/or resolve this? Thanks, -s