Can I run Lucene in google app engine?

2009-04-12 Thread Noble Paul നോബിള്‍ नोब्ळ्
Is it possible to run Lucene in google app engine? has anyone tried it? -- --Noble Paul - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: exponential boosts

2009-04-12 Thread Steven Bethard
On 4/10/2009 5:13 PM, Steven Bethard wrote: > On 4/10/2009 12:56 PM, Steven Bethard wrote: >> I need to have a scoring model of the form: >> >> s1(d, q)^a1 * s2(d, q)^a2 * ... * sN(d, q)^aN >> >> where "d" is a document, "q" is a query, "sK" is a scoring function, and >> "aK" is the exponential

Re: Searching a single file

2009-04-12 Thread Michael Chan
I have been trying to use grep, but my file is way too big (~300gb). Could Lucene search through it more efficiently than grep? Thanks, Michael On Sun, Apr 12, 2009 at 7:53 PM, Shashi Kant wrote: > Not sure what the business-case for this is and why you cannot use > RegEx for this. But you cou

Re: Searching a single file

2009-04-12 Thread Shashi Kant
Not sure what the business-case for this is and why you cannot use RegEx for this. But you could consider chopping up the document into (sub) documents and adding them to the Lucene index. For example, chop by paragraph or line-break. HTH, Shashi On Sun, Apr 12, 2009 at 1:51 PM, wrote: > Hi, >

Searching a single file

2009-04-12 Thread dayzman
Hi, Can Lucene be used to return all matches of a query (perhaps using regex) in a single document? If so, is it just as efficient if it was to match multiple documents? How can that be done? Thanks Michael

Re: RangeFilter performance problem using MultiReader

2009-04-12 Thread Yonik Seeley
Hmmm, something is wrong range queries over many terms should definitely be faster. There are some other oddities in your results... - the "consolidated index" shows to be slower 295ms vs 602ms... but patch 1596 doesn't touch that code path (a single segment index). - TEST2 (using searcher.sear

Re: Sequential match query

2009-04-12 Thread Erik Hatcher
On Apr 12, 2009, at 8:15 AM, Tim Williams wrote: On Sun, Apr 12, 2009 at 5:56 AM, Erik Hatcher wrote: On Apr 11, 2009, at 9:11 PM, Tim Williams wrote: On Sat, Apr 11, 2009 at 12:25 PM, Erick Erickson wrote: That'll teach me to scan a post. The link I sent you is still relevant, but wild

Re: Sequential match query

2009-04-12 Thread Tim Williams
On Sun, Apr 12, 2009 at 5:56 AM, Erik Hatcher wrote: > > On Apr 11, 2009, at 9:11 PM, Tim Williams wrote: > >> On Sat, Apr 11, 2009 at 12:25 PM, Erick Erickson >> wrote: >>> >>> That'll teach me to scan a post. The link I sent you >>> is still relevant, but wildcards are NOT intended to be used t

Re: Sequential match query

2009-04-12 Thread Erik Hatcher
On Apr 11, 2009, at 9:11 PM, Tim Williams wrote: On Sat, Apr 11, 2009 at 12:25 PM, Erick Erickson wrote: That'll teach me to scan a post. The link I sent you is still relevant, but wildcards are NOT intended to be used to concatenate terms. You want a phrase query or a span query for that. i.

Re: RangeFilter performance problem using MultiReader

2009-04-12 Thread Raf
I am sorry, but after applying this patch, the performance on my tests are worse than those on lucene-2.9-dev trunk. TEST1: using *filter.getDocIdSet(reader)*; *Test *results* (Num docs = 2,940,738) using lucene-core-2.9-dev trunk** 1 Original index (12 collections * 6 months = 72 indexes)*