Re: Issue upgrading from lucene 2.3.2 to 2.4 (moving from bitset to docidset)

2008-12-09 Thread Michael Stoppelman
Yeah looks similar to what we've implemented for ourselves (although I haven't looked at the implementation). We've got quite a custom version of lucene at this point. Using Solr at this point really isn't a viable option, but thanks for pointing this out. M On Tue, Dec 9, 2008 at 1:47 AM, Michae

Taxonomy in Lucene

2008-12-09 Thread Kesarkar, Dipak
Hi I want to include Taxonomy feature in my search. Does Lucene support Taxonomy? How? If not, is there in different way to add Taxonomy feature in the Lucene search? Regards, Dipak This message contains information that may be privileged or confidential and is the property o

Re: Chinese Analyzer evaluation

2008-12-09 Thread Cooper Geng
I found these libraries from the google engine. But I have no experience on using these classes. Do you any suggestion on Asian languages Analyzers? Specially for Chinese Thanks in advance. On Wed, Dec 10, 2008 at 9:17 AM, John Wang <[EMAIL PROTECTED]> wrote: > Hi Cooper: >Where are thes

Re: Chinese Analyzer evaluation

2008-12-09 Thread John Wang
Hi Cooper: Where are these classes? Thanks -John On Tue, Dec 9, 2008 at 2:27 AM, Cooper Geng <[EMAIL PROTECTED]> wrote: > Hi all, > > My application will provide Chinese search engine. I got some analyzer on > Chinese language. > Any suggestion about these: > > IK_CAnalyzer > IKAnalyzer > >

Re: Has anyone written SpanFuzzyQuery?

2008-12-09 Thread Mark Miller
http://issues.apache.org/jira/browse/LUCENE-522 note the bugs mentioned at the bottom. - Mark - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Has anyone written SpanFuzzyQuery?

2008-12-09 Thread Teruhiko Kurosaka
I'd like to search docs that has terms that are close to those supplied in the query in that order in the search field. I was thinking of combining the fuzzy query with SpanNearQuery, but the constructor of SpanNearQuery takes an array of SpanQuery objects. So I would need a fuzzy query implemen

Re: Problem with PorterStemFilter

2008-12-09 Thread Erick Erickson
Thanks for reporting back, I learned something new today... Best Erick On Tue, Dec 9, 2008 at 1:02 PM, Preetam Rao <[EMAIL PROTECTED]> wrote: > Thanks Eric. Looking at Luke output helped. > > The problem was that I had overridden tokenStream() of the StandardAnalyzer > but did not override the

Re: Problem with PorterStemFilter

2008-12-09 Thread Preetam Rao
Thanks Eric. Looking at Luke output helped. The problem was that I had overridden tokenStream() of the StandardAnalyzer but did not override the reusableTokenStream(). The IndexWriter was using reusableTokenStream() and QueryParser was using tokenStream() and hence the mismatch. So looks like one

Looking for a way to customize how StandardAnalyzer handles punctuation

2008-12-09 Thread Greg Shackles
Hey everyone, I'm running into a problem where some punctuation that I would actually want to keep gets thrown out because they don't get tokenized. By far the most common case for this is ampersand, but it does happen with others as well. My concern isn't even so much in that I need to be able t

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-09 Thread Matthias W.
I tested it with the lucene-spellchecker-2.4.0.jar... Then I get the queryString as suggestion if it exists in the index. But there's no chance to get other suggestions neither with LevensteinDistance nor with JaroWinklerDistance. The setAccuracy() method has no effect on the result. I think this

Chinese Analyzer evaluation

2008-12-09 Thread Cooper Geng
Hi all, My application will provide Chinese search engine. I got some analyzer on Chinese language. Any suggestion about these: IK_CAnalyzer IKAnalyzer or more? -- Best Regards Cooper Geng

Re: Issue upgrading from lucene 2.3.2 to 2.4 (moving from bitset to docidset)

2008-12-09 Thread Michael McCandless
This use case sounds alot like faceted navigation, which Solr provides. Mike Michael Stoppelman wrote: Hi all, I'm working on upgrading to Lucene 2.4.0 from 2.3.2 and was trying to integrate the new DodIdSet changes since o.a.l.search.Filter#bits() method is now depreciated. For our app we