Am 13.12.2012 18:00, schrieb Jack Krupansky: > Can you provide some examples of terms that don't work and the index > token stream they fail on? > > Make sure that the Analyzer you are using doesn't do any magic on the > indexed terms - your query term is unanalyzed. Maybe multiple, but > distinct, index terms are analyzing to the same, but unexpected term.
Apart from the answer I've already given myself, here's another note about the issue. I've been using WhitespaceAnalyzer for both indexing and query parsing, but apparently, the query parser lowercased by default while WhitespaceAnalyzer did not. Therefore, QueryParser.setLowercaseExpandedTerms(false) is necessary in order to get the same results. Best, Carsten -- Institut für Deutsche Sprache | http://www.ids-mannheim.de Projekt KorAP | http://korap.ids-mannheim.de Tel. +49-(0)621-43740789 | schno...@ids-mannheim.de Korpusanalyseplattform der nächsten Generation Next Generation Corpus Analysis Platform --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org