Re: How to manage resource out of index?

2010-07-06 Thread Li Li
thank you. 2010/7/7 Rebecca Watson : > hi li, > > i looked at doing something similar - where we only index the text > but retrieve search results / highlight from files -- we ended up giving > up because of the amount of customisation required in solr -- mainly > because we wanted the distributed

Re: How to manage resource out of index?

2010-07-06 Thread Rebecca Watson
hi li, i looked at doing something similar - where we only index the text but retrieve search results / highlight from files -- we ended up giving up because of the amount of customisation required in solr -- mainly because we wanted the distributed search functionality in solr which meant making

How to manage resource out of index?

2010-07-06 Thread Li Li
I used to store full text into lucene index. But I found it's very slow when merging index because when merging 2 segments it copy the fdt files into a new one. So I want to only index full text. But When searching I need the full text for applications such as hightlight and view full text. I can s

Issue Lucene-2421 and NativeFSLockFactory.clearLock behaviour?

2010-07-06 Thread Ted McFadden
Hi, For Lucene 3.0.2, issue LUCENE-2421 ( https://issues.apache.org/jira/browse/LUCENE-2421) changed NativeFSLock.release to not raise an exception if a write.lock file could not be deleted since the presence of the file itself does not mean a lock is held. Should NativeFSLockFactory.clearLock al

Re: index format error because disk full

2010-07-06 Thread jg lin
老子喜欢用汉语,竟敢骂我,这就是洋人的素质,鄙视。有些人就是自卑到骨子里了。。。 2010/7/7 > poopy > Sent via BlackBerry from T-Mobile > > -Original Message- > From: Li Li > Date: Wed, 7 Jul 2010 10:48:53 > To: > Reply-To: java-user@lucene.apache.org > Subject: Re: index format error because disk full > > 会 > 在 2010年7月7日 上午1

Re: index format error because disk full

2010-07-06 Thread baleksan
poopy Sent via BlackBerry from T-Mobile -Original Message- From: Li Li Date: Wed, 7 Jul 2010 10:48:53 To: Reply-To: java-user@lucene.apache.org Subject: Re: index format error because disk full 会 在 2010年7月7日 上午10:46,jg lin 写道: > 你会说汉语吗(⊙_⊙)? > > 2010/7/7 Li Li > >> -- Forward

Re: index format error because disk full

2010-07-06 Thread luby
这种问题,你两个单独交流就行了。你这么一搞全世界都知道了。 于 2010-7-7 11:03, Li Li 写道: > 谢谢 > > 在 2010年7月7日 上午10:53,jg lin 写道: > >> 加个QQ群问问18038594,你的问题我不会。 >> >> Li Li 於 2010年7月7日上午10:48 ��道: >> >> >>> 会 >>> 在 2010年7月7日 上午10:46,jg lin 写道: >>> > 你会说汉语吗(⊙_⊙)? >>> 2010/7/7 Li Li >

Re: index format error because disk full

2010-07-06 Thread Li Li
谢谢 在 2010年7月7日 上午10:53,jg lin 写道: > 加个QQ群问问18038594,你的问题我不会。 > > Li Li 於 2010年7月7日上午10:48 ��道: > >> 会 >> 在 2010年7月7日 上午10:46,jg lin 写道: >> > 你会说汉语吗(⊙_⊙)? >> > >> > 2010/7/7 Li Li >> > >> >> -- Forwarded message -- >> >> From: Li Li >> >> Date: 2010/7/7 >> >> Subject: index fo

Re: about contrib instantiated

2010-07-06 Thread jg lin
用汉语几句话就搞定的东西,非得叽叽歪歪整这么一段。洋文一个字,烂。。。 2010/7/3 Karl Wettin > > 2 jul 2010 kl. 08.32 skrev Li Li: > > > I have an index of >> about 8,000,000 document and the current index size is about 30GB. Is >> it possbile to use this contrib to speed up my search? I have enough >> memory for it. >> > > > In

Re: index format error because disk full

2010-07-06 Thread jg lin
加个QQ群问问18038594,你的问题我不会。 Li Li 於 2010年7月7日上午10:48 ��道: > 会 > 在 2010年7月7日 上午10:46,jg lin 写道: > > 你会说汉语吗(⊙_⊙)? > > > > 2010/7/7 Li Li > > > >> -- Forwarded message -- > >> From: Li Li > >> Date: 2010/7/7 > >> Subject: index format error because disk full > >> To: solr-u...@luce

Re: index format error because disk full

2010-07-06 Thread Li Li
会 在 2010年7月7日 上午10:46,jg lin 写道: > 你会说汉语吗(⊙_⊙)? > > 2010/7/7 Li Li > >> -- Forwarded message -- >> From: Li Li >> Date: 2010/7/7 >> Subject: index format error because disk full >> To: solr-u...@lucene.apache.org >> >> >> the index file is ill-formated because disk full when feed

Re: CFP for Surge Scalability Conference 2010

2010-07-06 Thread jg lin
你认识汉字吗(⊙_⊙)? 2010/7/3 Jason Dixon > A quick reminder that there's one week left to submit your abstract for > this year's Surge Scalability Conference. The event is taking place on > Sept 30 and Oct 1, 2010 in Baltimore, MD. Surge focuses on case studies > that address production failures an

Re: index format error because disk full

2010-07-06 Thread jg lin
你会说汉语吗(⊙_⊙)? 2010/7/7 Li Li > -- Forwarded message -- > From: Li Li > Date: 2010/7/7 > Subject: index format error because disk full > To: solr-u...@lucene.apache.org > > > the index file is ill-formated because disk full when feeding. Can I > roll back to last version? Is there

Re: Fielded Queries Question

2010-07-06 Thread jg lin
cao li ma. 2010/7/7 Erick Erickson > That's not the meaning of phrase query at all. Searching for a phrase > implies proximity. That is, searching for name:"Tom Jones" requires that > these two terms appear next to each other, whereas > name:Tom AND name:Jones merely requires that the two terms

Fwd: index format error because disk full

2010-07-06 Thread Li Li
-- Forwarded message -- From: Li Li Date: 2010/7/7 Subject: index format error because disk full To: solr-u...@lucene.apache.org the index file is ill-formated because disk full when feeding. Can I roll back to last version? Is there any method to avoid unexpected errors when ind

Re: Fielded Queries Question

2010-07-06 Thread Erick Erickson
That's not the meaning of phrase query at all. Searching for a phrase implies proximity. That is, searching for name:"Tom Jones" requires that these two terms appear next to each other, whereas name:Tom AND name:Jones merely requires that the two terms appear anywhere in the field, no matter how ma

Re: Write QueryParser in JavaScript and use Lucene JAVA Indexer for searching

2010-07-06 Thread Kasun Gajasinghe
This is a reminder, as the mail was sent in the weekend! Please go through it and suggest me a solution. Thanks in advance, /KasunBG On Sun, Jul 4, 2010 at 3:44 AM, Kasun Gajasinghe wrote: > Hello, > I need to add a completely client-side search mechanism to a help system that > I'm implementi

Fielded Queries Question

2010-07-06 Thread Thomas Nguyen
Hello All, Can someone explain to me how fielded queries work with phrases? My first thought is that the phrase is broken down into terms and those terms are then fielded and separated with the AND operator. An example would be the following: name:"Tom Jones" --> name:"Tom" AND name:"Jones" I

RE: Question: searching on a NumericField, with sort and filter

2010-07-06 Thread Eduardo Pierdant
Works perfectly now, thx! -Original Message- From: Uwe Schindler [mailto:u...@thetaphi.de] Sent: martes, 06 de julio de 2010 01:31 p.m. To: java-user@lucene.apache.org Subject: RE: Question: searching on a NumericField, with sort and filter Hi Eduardo, For sorting or a NumericRangeQuer

RE: Question: searching on a NumericField, with sort and filter

2010-07-06 Thread Uwe Schindler
Hi Eduardo, For sorting or a NumericRangeQuery, the field must be *indexed*. Storing has no effect at all, it's only needed, if you want to show the value in search results. That is important for *any* type of field (not only NumericField). Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 B

Question: searching on a NumericField, with sort and filter

2010-07-06 Thread Eduardo Pierdant
Hi all. I've been dealing with a small problem when searching and trying to sort and filter on a NumericField using Lucene 2.9.2; the result never comes back as expected. Here are some snippets of my code, so you might take a look and reference me or help me out to solve this problem. At doc

Re: multi-term synonym expansion

2010-07-06 Thread darren
How does the synonym filter work internally? I configured it with a very large synonym file (90,000 lines) running Solr in glassfish and it started fine, but when I queried, it hung and ran out of memory. The file wasn' big enough to exhaust the heapI never was able to get it to run smoothly.

require that top results includes a certain percentage from each source

2010-07-06 Thread Aaron Surty
Hello! I'm new to using lucene and was hoping to get advice on best practices to solve the following problem. I have a set of documents each coming from their own source. I want to require that the top K docs have a minimum of n relevant docs from certain sources. For example, lets say

Re: multi-term synonym expansion

2010-07-06 Thread Ahmet Arslan
> My custom SKOSAnalyzer already performs synonym expansion > based on the labels defined in a given SKOS model. But now I > have the problem that real-world thesauri often define > (multi terms) synonyms for mult-term words. Here is an > example that defines the abbreviation "UN" as synonym for >

multi-term synonym expansion

2010-07-06 Thread Bernhard Haslhofer
Hi, I am currently working on a Lucene module that makes use of controlled SKOS vocabularies (http://www.w3.org/TR/skos-primer/) during index and search time. It should work similar to Lucene's Wordnet contrib module, just with some extended SKOS-specific functionality (e.g., support for broade

Re: Lucene php - search client index 3.0. compatible

2010-07-06 Thread Licinio Fernández Maurelo
Sorry , the lucene index format i was talking is 2.9 El 6 de julio de 2010 14:54, Licinio Fernández Maurelo < licinio.fernan...@gmail.com> escribió: > Hi folks, > > I'm looking for a lucene php search client library lucene index 3.0 > compatible. > > I've only found info related to zend framewor

Lucene php - search client index 3.0. compatible

2010-07-06 Thread Licinio Fernández Maurelo
Hi folks, I'm looking for a lucene php search client library lucene index 3.0 compatible. I've only found info related to zend framework lucene support (2.3 lucene index compatible). Any help much appreciated -- |_ici ~Java Consultant~

Re: Search on top x results

2010-07-06 Thread liat oren
Ok, thanks, I will try that On 6 July 2010 11:57, Uwe Schindler wrote: > Both must be "must", else it makes no sense. > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: liat oren [mailto

RE: Search on top x results

2010-07-06 Thread Uwe Schindler
Both must be "must", else it makes no sense. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: liat oren [mailto:oren.l...@gmail.com] > Sent: Tuesday, July 06, 2010 10:53 AM > To: java-user@lucene.apache.o

Re: Search on top x results

2010-07-06 Thread liat oren
Thank you! So the best thing to do is to add the first boolean query as a MUST, and the second query as a should? Even if both of them query the same fields? Will it actually be faster? On 6 July 2010 11:35, Ian Lea wrote: > > http://wiki.apache.org/lucene-java/LuceneFAQ#Can_Lucene_do_a_.22sea

Re: Search on top x results

2010-07-06 Thread Ian Lea
http://wiki.apache.org/lucene-java/LuceneFAQ#Can_Lucene_do_a_.22search_within_search.22.2C_so_that_the_second_search_is_constrained_by_the_results_of_the_first_query.3F -- Ian. On Tue, Jul 6, 2010 at 9:25 AM, liat oren wrote: > Hi all, > > Is it possible to run a search over top 100,000 (for ex

Search on top x results

2010-07-06 Thread liat oren
Hi all, Is it possible to run a search over top 100,000 (for example) results of a prior search. So if the user first does the search, gets results, if pressing on the search button again, I would like it to run on the top 100,000 results. Thanks, Liat

Re: Using Lucene with a rather simplistic scoring system?

2010-07-06 Thread Marcel Bruch
Hi all. to close my thread: My requirement was to build a simple scoring system that basically reuses Lucene's index infrastructure but not its advanced scoring system, i.e., I had to replace the query and scorer infrastructure with my own implementation. In detail I had to come up with my

Re: Lucene Scoring

2010-07-06 Thread Ian Lea
You are calling the explain method incorrectly. You need something like System.out.println(indexSearcher.explain(query, 0)); See the javadocs for details. -- Ian. On Tue, Jul 6, 2010 at 7:39 AM, manjula wijewickrema wrote: > Dear Grant, > > Thanks a lot for your guidence. As you have ment