Hi Luke,
Thank you for your work and information sharing. From my point of view
lemmatization is just a use case of text token annotation. I have been
working with Lucene since 2006 to index lexicographic and linguistic
data and I always miss the fact that (1) token attributes are not
search
my own FrenchAnalyzer for lexical analysis. It extends
org.apache.lucene.analysis.Analyzer and my stopwords set is empty.
Benoit
On 2011-01-12 23:05, Robert Muir wrote:
On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier
wrote:
Hi,
I am happily using Lucene for several years to offer French lexical analysis
tools
Thank you for your reply.
I am using my own FrenchAnalyzer for lexical analysis. It extends
org.apache.lucene.analysis.Analyzer and my stopwords set is empty.
Benoit
On 2011-01-12 23:05, Robert Muir wrote:
On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier
wrote:
Hi,
I am happily using
Hi,
I am happily using Lucene for several years to offer French lexical
analysis tools to university researchers. Today, one of them decided
to analyze the use of the French word "or" (meaning "gold" in French) in
one of my corpus powered by Lucene... And, as you probably already
guessed,
Thank you Grant. I will try your suggested approach. It confirms to me
that I wasn't lost too much;-)
mercibe
Grant Ingersoll a écrit :
On Mar 23, 2010, at 12:58 AM, Benoit Mercier wrote:
Hi,
I would like to write a query composed of a BooleanQuery (several clauses) and
a Span
Hi,
I would like to write a query composed of a BooleanQuery (several
clauses) and a SpanQuery (SpanNearQuery), where both are mandatory.
Sounds simple but I have to work on spans returned by this query.
I know that I could use a Filter, but my goal is to get the spans from
the « combined