Luchess: searching chess games with Lucene

2014-05-17 Thread Yann-Erwan Perio
Hello, I've just finished a little side project of mine, and wanted to let you know about it. Basically, it's a Java web application, which lets your search chess games using Lucene - hence its cheeky name "Luchess". The code is nothing complicated, as I've simply made a standard use of the API.

Re: Custom Tokenizer/Analyzer

2014-02-20 Thread Yann-Erwan Perio
On Thu, Feb 20, 2014 at 10:46 AM, Geet Gangwar wrote: Hi, > My requirement is it should have capabilities to match multiple words as > one token. for example. When user passes String as International Business > machine logo or IBM logo it should return International Business Machine as > one tok

Re: Tokenization and PrefixQuery

2014-02-14 Thread Yann-Erwan Perio
On Fri, Feb 14, 2014 at 1:11 PM, Yann-Erwan Perio wrote: > On Fri, Feb 14, 2014 at 12:33 PM, Michael McCandless > wrote: Hi again, >> That should not be the case: it should match all terms with that >> prefix regardless of the term's length. Try to boil it down to a &g

Re: Tokenization and PrefixQuery

2014-02-14 Thread Yann-Erwan Perio
On Fri, Feb 14, 2014 at 12:33 PM, Michael McCandless wrote: > This is similar to PathHierarchyTokenizer, I think. Ah, yes, very much. I'll check it out and see if I can make something of it. I am not sure to what extent it'll be reusable though, as my tokenizer also sets payloads (the next comin

Tokenization and PrefixQuery

2014-02-14 Thread Yann-Erwan Perio
Hello, I am designing a system with documents having one field containing values such as "Ae1 Br2 Cy8 ...", i.e. a sequence of items made of letters and numbers (max=7 per item), all separated by a space, possibly 200 items per field, with no limit upon the number of documents (although I would no

Re: BytesRef equals() method

2014-01-22 Thread Yann-Erwan Perio
On Wed, Jan 22, 2014 at 12:09 PM, Michael McCandless wrote: Hi, > DocsAndPositionsEnum.getPayload() is allowed to re-use the returned > BytesRef under the hood. Ah, I am starting to get it. The BytesRef would be directly stored in the key set of the map, but since its properties can change, the

Re: BytesRef equals() method

2014-01-21 Thread Yann-Erwan Perio
On Tue, Jan 21, 2014 at 7:54 PM, Steven Schlansker wrote: Hi, Firstly, thanks to all of you for your insights. > How can two byte arrays be equal if they have different lengths? > Same way as two Strings with differing lengths can never be equal, two > byte arrays with different lengths will ne

BytesRef equals() method

2014-01-21 Thread Yann-Erwan Perio
Hello, I have been working a bit with BytesRef recently, and I wonder whether the content of the equals() method, and more specifically the content of the bytesEquals(BytesRef other) method, is the intended one. Here is my use case. I work with Lucene 4.6.0. During indexing, using a custom tokeni

Re: Storing Documents in Lucene

2013-03-28 Thread Yann-Erwan Perio
On Thu, Mar 28, 2013 at 11:06 PM, Paul wrote: Hello, > Some of the stuff I've read suggests that Lucene is not especially > well-suited to storing the documents. It's supposed to be great at > indexing those documents, but not so great at storing the docs themselves. If you think about navigati

Re: Filter based on the sum of values of two fields

2013-03-26 Thread Yann-Erwan Perio
On Sun, Mar 24, 2013 at 10:46 AM, Wei Wang wrote: Hi, > For example, assume we have fields F1 and F2, we would like to find > all documents with condition F1+F2 > 5.0. This filter may be combined > with other filters to form a BooleanFilter. > > The question is, is there any way to construct an

Re: CustomScoreQuery + Collector + Scoring

2013-01-10 Thread Yann-Erwan Perio
On Thu, Jan 10, 2013 at 11:22 AM, Uwe Schindler wrote: Hi Uwe, > The best way to do this ist o wrap the standard Lucene > TopScoreDocCollector by your own collector (passing all > calls to the collector also down to the top-docs collector). > Then you don't have to take care of sorting the resul

CustomScoreQuery + Collector + Scoring

2013-01-10 Thread Yann-Erwan Perio
Hello, I am using Lucene 4.0.0, trying to put together a CustomQuery and a Collector, and have a problem with the calculation of scores. My context is as follows. I have a big BooleanQuery which works fine, but I also want to calculate some statistics during the search (i.e. perform aggregation o

Re: Beginning with Lucene

2012-12-04 Thread Yann-Erwan Perio
On Tue, Dec 4, 2012 at 4:32 PM, Mohammad Tariq wrote: Hi, > Thank you so much for the quick response guys. Got it working. I had to add > two more jars along with lucene-core-4.0.0.jar. Thanks again. There were some API and packaging changes when the v4 came out, so possibly the tutorial that y

Re: Custom Analyzer Not Called When Indexing

2012-09-09 Thread Yann-Erwan Perio
se TextField! > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message- >> From: Yann-Erwan Perio [mailto:ye.pe...@gmail.com] >> Sent: Sunday, September 09, 2012

Custom Analyzer Not Called When Indexing

2012-09-09 Thread Yann-Erwan Perio
Hello, This is my first time writing to the list. I am a Java developer, writing a personal project using Lucene, and so far have been very happy with the library (v4BETA). However, I have recently decided to build and use a custom analyzer, and could not make it work with IndexWriter. I must be m