Re: How to search special characters in LUcene

2009-04-22 Thread uday kumar maddigatla
HI Here are the details about my goals. 1. I want to use this lucene for mixed languages. 2. I want to make indexes of the documents which are either english or danish etc. I'm attaching my IndexFiles.java file. When i'm searching i'm giving the index path location as well as doucmets folder.

Re: How to search special characters in LUcene

2009-04-22 Thread uday kumar maddigatla
HI Here are the details about my goals. 1. I want to use this lucene for mixed languages. 2. I want to make indexes of the documents which are either english or danish etc. I'm attaching my IndexFiles.java file. When i'm searching i'm giving the index path location as well as doucmets folder.

Re: Faceting, Sort and DocIDSet

2009-04-22 Thread John Wang
Karsten: Yes, you kinda need that for faceting to work. Take a look at FacetDataCache class. -John On Wed, Apr 22, 2009 at 3:06 AM, Karsten F. wrote: > > Hi Dave, > > facets: > in you case a solution with one > int[IndexReader.maxDoc()] > fits. For each document number you can store an inte

Re: Why is CustomScoreQuery limited to ValueSourceQuery type?

2009-04-22 Thread Doron Cohen
Steve, I added a patch in https://issues.apache.org/jira/browse/LUCENE-1608, which allows to wrap any query in a value source, and then create a value source query out of it. Let us know how this works for you... Doron On Wed, Apr 22, 2009 at 2:07 AM, Steven Bethard wrote: > On 4/21/2009 10:09 A

Question around LOM | Lucene Ontology

2009-04-22 Thread Rangan Gupta
Hi Am a newbie to Lucene and hence this question about how to implement Ontology based search using Lucene (LOM). It would be useful to guide to any useful books, white papers etc. detailing out the same. Thanks R

RE: Boolean Logic inside a QueryWrapperFilter

2009-04-22 Thread Steven A Rowe
Hi David, On 4/22/2009 at 4:15 PM, David Seltzer wrote: > I have some code that dynamically creates a Boolean query designed to > work as a filter. After the query runs I end up with this filter. > > Filter: QueryWrapperFilter(+(-SourceID:100) > +spanNear([ArticleContent:nuclear, ArticleContent:p

Boolean Logic inside a QueryWrapperFilter

2009-04-22 Thread David Seltzer
Hi Everyone, I have some code that dynamically creates a Boolean query designed to work as a filter. After the query runs I end up with this filter. Filter: QueryWrapperFilter(+(-SourceID:100) +spanNear([ArticleContent:nuclear, ArticleContent:proliferation], 30, false)) My expectation is that

Re: Using Payloads

2009-04-22 Thread Murat Yakici
No, just checking. I will let you everyone know I see one. Cheers, Murat > > On Apr 21, 2009, at 4:39 AM, Murat Yakici wrote: > >> Hi, >> I started playing with the experimental payload functionality. I >> have written an analyzer which adds a payload (some sort of a score/ >> boost) for each te

Re: Using Payloads

2009-04-22 Thread Grant Ingersoll
On Apr 21, 2009, at 4:39 AM, Murat Yakici wrote: Hi, I started playing with the experimental payload functionality. I have written an analyzer which adds a payload (some sort of a score/ boost) for each term occurance. The payload/score for each term is dependent on the document that the t

Fwd: ApacheCon

2009-04-22 Thread Grant Ingersoll
Begin forwarded message: We are having a discussion gene...@lucene.apache.org about Lucene related content for ApacheCon. You may want to check it out and see http://wiki.apache.org/jakarta-lucene/LuceneAtApacheConUs2009 -- Grant Ingersoll http://www.lucidimagi

Re: changing term freq in indexing time

2009-04-22 Thread liat oren
The reason I am searching "3 3 2 1" and not "3 2 1" is the reason I asked the question - it is important to include also these frequencies into account when generating these scores. Look at it as if - if a word appears more frequently in a text, is it more important. I managed to make the boosting

Re: changing term freq in indexing time

2009-04-22 Thread Eran Sevi
Hi, I'm no expert on the subject but it seems like you're searching for one term that should be "3 3 2 1" (why do you write "3" two times anyway?). I think you should try a regulalr boolean query where each sub-query is a BoostingTermQuery on one term only. These queries should be used with Occur.M

Re: How to search special characters in LUcene

2009-04-22 Thread Erick Erickson
Are you *also* using the DutchAnalyzer for your *query*? Please show us the index and search code (simplified as much as possible), then we'll be able to provide better suggestions. Also, tell us a bit more about your goals here. Is this an index entirely of Dutch documents? Or is it a mixed-lang

Re: Appropriate analyzer

2009-04-22 Thread Erick Erickson
*If* your terms are simple (that is, not wildcarded), you may get some joy from TermEnum. The idea here would be to find the longest term *already in your index* that satisfies your need and use that to form a simple TermQuery Essentially using TernEnum.skipTo on successively shorter strings u

Re: changing term freq in indexing time

2009-04-22 Thread liat oren
Thanks Eran, I tried it, adding the classes I copied below and tried to run the following code: [Also I have below a question about the usage of synonyms and BooleanQuery.] DoubleMap wordMap = new DoubleMap(); wordMap.insert("1", 1, 5); // for word "1" we have the world 1, 5 times wordMap.i

RE: Faceting, Sort and DocIDSet

2009-04-22 Thread Karsten F.
Hi Dave, facets: in you case a solution with one int[IndexReader.maxDoc()] fits. For each document number you can store an integer which represents the facet value. This is what org.apache.solr.request.UnInvertedField will store in your case. (*John* : is there something similar in com.browseeng

SpellChecker locks folder

2009-04-22 Thread John Cherouvim
Hello After I call the SpellChecker.indexDictionary method the directory which contained the lucene index is locked. I cannot rename of delete the folder (windows). In the source of SpellChecker lines 352-353 I see that after the indexing is done the index is reopened: searcher.close(); sea