date:20090511

RE: RegexQuery Incomplete Results

2009-05-11 Thread Huntsman84

This is the code for searching: String index = "index"; String field = "contents"; IndexReader reader = IndexReader.open(index); Searcher searcher = new IndexSearcher(reader); System.out.println("Enter query: "); String line = ".IN.";//in jakarta regexp this is like * IN * RegexQuery rxquery = n

Re: RegexQuery Incomplete Results

2009-05-11 Thread Ian Lea

The default regex package is java.util.regex and I can't see anywhere that you tell it to use the Jakarta regexp package. So I don't think that ".in" will match. Also, you are storing your contents field as NOT_ANALYZED so you will need to be wary of case sensitivity. Maybe this is what you want

Downloading Lucene 2.9 from Maven

2009-05-11 Thread Paul Taylor

Hi I'm working on an opensrc project that needs Lucene 2.9 and uses Maven, it would be great if Lucene 2.9 could be found automatically so that other developers can easily run and build the project. First of all I changed the version field from 2.4 to 2.9 in my pom.xml but lucene couldn't find

Re: 'problem with indexformat and luke

2009-05-11 Thread Grant Ingersoll

It's not bad, the fix is simply to link your new JARs with the Luke one. As for why you use 2.9, I don't know. It's not officially released, but that doesn't stop a lot of people from using it. Trunk is generally stable, so if it works for you, I wouldn't worry too much about it. On M

Re: Stemming

2009-05-11 Thread Hannu Väisänen

On Fri, May 08, 2009 at 08:57:59AM -0400, Matthew Hall wrote: > process your > words into a more base form before they go into the stemmed Malaga (http://home.arcor.de/bjoern-beutel/malaga/) can be used to make a program that converts words to a base form. --

Re: Boosting query - debuging

2009-05-11 Thread Grant Ingersoll

On May 10, 2009, at 5:59 AM, liat oren wrote: The output is the following: *finlin, score: 19.366615* 19.366615 = (MATCH) fieldWeight(worlds:6621468^3.0 in 35433), product of: 4.2426405 = (MATCH) btq, product of: 0.70710677 = tf(phraseFreq=0.5) 6.0 = scorePayload(...) 7.3036084 = idf

Re: Downloading Lucene 2.9 from Maven

2009-05-11 Thread Michael McCandless

Lucene's trunk build (2.9-dev) is supposed to be available, as 2.9-SNAPSHOT. But something is wrong: we seem to be building 2.9 trunk but attempting to publish it (incorrectly) as 2.4-SNAPSHOT. But then, 2.4-SNAPSHOT is not published correctly... I only see the maven-metadata.xml, and no artifact

I can't found the package org.apache.lucene.index .memory.AnalyzerUtil

2009-05-11 Thread Kamal Najib

Hi all, can someone tell me where can i find the package org.apache.lucene.index.memory.AnalyzerUtil in the lucene api?I can't found it. many thanks. Kamal. -- - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.or

thanks, i have got it!(the package :org.apache.l ucene.index.memory.AnalyzerUtil)

2009-05-11 Thread Kamal Najib

I have found it. thanks. Kamal. -- - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: RegexQuery Incomplete Results

2009-05-11 Thread Huntsman84

The RegexQuery class uses that package, and for that reason the expression matches. If my records contained only one word each, this code would work, but I need to apply that regular expression to a phrase... Ian Lea wrote: > > The default regex package is java.util.regex and I can't see anywh

[ no subject ]

2009-05-11 Thread Kamal Najib

will the anlyzer now do stemming, if i do the folow: analyzer = new StandardAnalyzer(); analyzer=AnalyzerUtil.getPorterStemmerAnalyzer(analyzer); thanks. Kamal. -- - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.or

how to do stemming?

2009-05-11 Thread Kamal Najib

will the anlyzer now do stemming, if i do the folow: analyzer = new StandardAnalyzer(); analyzer=AnalyzerUtil.getPorterStemmerAnalyzer(analyzer); thanks. Kamal. -- -- - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apa

Re: I can't found the package org.apache.lucene.index.memory.AnalyzerUtil

2009-05-11 Thread Erick Erickson

The class is contained in org.apache.lucene.index.memory.AnalyzerUtil Assuming you've installed 2.4, it's in... which is located in the contrib area. Try looking in /contrib/memory/lucene-memory-2.4.0.jar Best Erick 2009/5/11 Kamal Najib > Hi all, > can someone tell me where can i find the p

Re: Re: I can't found the package org.apache.luc ene.index.memory.AnalyzerUtil

2009-05-11 Thread Kamal Najib

Thank you. kamal. Original Message: The class is contained in org.apache.lucene.index.memory.AnalyzerUtil Assuming you've installed 2.4, it's in... which is located in the contrib area. Try looking in /contrib/memory/lucene-memory-2.4.0.jar Best Erick 2009/5/11 Kamal Najib > Hi all, > can

Re: how to do stemming?

2009-05-11 Thread Ian Lea

Yep, I reckon so. btw a Google search for something like lucene stemming gets hits, including a couple of articles about stemming. Might be worth a look. -- Ian. On Mon, May 11, 2009 at 2:08 PM, Kamal Najib wrote: > will the anlyzer now do stemming, if i do the folow: > analyzer = new Standar

Re: Re: how to do stemming?

2009-05-11 Thread Kamal Najib

Thank you Ian. Kamal Original Message: Yep, I reckon so. btw a Google search for something like lucene stemming gets hits, including a couple of articles about stemming. Might be worth a look. -- Ian. On Mon, May 11, 2009 at 2:08 PM, Kamal Najib wrote: > will the anlyzer now do stemming, if

Re: RegexQuery Incomplete Results

2009-05-11 Thread Ian Lea

The little self-contained program below runs regex queries for a few regexps against a few phrases for both the java.util and jakarta regexp packages. Output when run with lucene 2.4.1 and jakarta-regexp 1.5 is Added Knowing yourself Added Old clinic Added INSIDE Added Not INSIDE Default regexc

IndexReader.Terms - internals

2009-05-11 Thread Ian Vink

IndexReader rdr = IndexReader.Open(myFolder); TermEnum terms = rdr.Terms((new Term(myTermName, ""))); (from .NET land, but it's all the same) This code works great, I can loop thru the terms nicely, but after it returns all the myTermName terms, it goes into all other term

Re: IndexReader.Terms - internals

2009-05-11 Thread Ian Lea

I believe not. Just get out when term.field() != myTermName, interned. -- Ian. On Mon, May 11, 2009 at 4:35 PM, Ian Vink wrote: > IndexReader rdr = IndexReader.Open(myFolder); > TermEnum terms = rdr.Terms((new Term(myTermName, ""))); > > (from .NET land, but it's all the

Re: RegexQuery Incomplete Results

2009-05-11 Thread Huntsman84

That's it!!! The problem was with the regular expression, the one I need is ".*IN"!! Thank you so much, I was turning mad... =) Ian Lea wrote: > > The little self-contained program below runs regex queries for a few > regexps against a few phrases for both the java.util and jakarta > regexp p

Re: IndexReader.Terms - internals

2009-05-11 Thread David Causse

Hi, We noticed this behaviour also, so we do like this : Map result = new HashMap(); TermEnum all; if(matcher.fullScan()) { all = reader.terms(new Term(field)); } else { all = reader.terms(new Term(field, matcher.prefix())); } if(all == null) return result; Term t; do { t = a

RE: IndexReader.Terms - internals

2009-05-11 Thread Uwe Schindler

No, there is no other way to do this. And if you think, the TermEnum takes too much RAM when returning all terms and also from different, you can be sure, that there is no wasted memory, as the term enum does not allocate the whole terms (like normal Java iterators). The term enum is iterated on di

Re: Lucene Index Encryption

2009-05-11 Thread Babak Farhang

On Mon, May 11, 2009 at 12:19 AM, Andrzej Bialecki wrote: > > Unfortunately, current Lucene IndexWriter implementation uses seek / > overwrite when writing term info dictionary. This is described in more > detail here: > > https://issues.apache.org/jira/browse/LUCENE-532 > Thanks for the enlight

Re: IndexReader.Terms - internals

2009-05-11 Thread Ian Vink

Thanks guys, Here's what I built: http://BahaiResearch.com It allows any language speaker to read about another person's religion in any language. Helps promote unity in diversity. It's open source. Ian On Mon, May 11, 2009 at 1:39 PM, Uwe Schindler wrote: > No, there is no other way to do

Re: Lucene Index Encryption

2009-05-11 Thread Michael McCandless

On Mon, May 11, 2009 at 2:06 PM, Babak Farhang wrote: > I am not familiar with the details of CFS, but I didn't interpret > Michael's comment to mean that there is actually any rewriting going > on here. The problem here appears to be one of translating the > encrypted/compressed file position to

pagination search results

2009-05-11 Thread Daniel Susanto

Hi, Please help, how to do pagination search results? note: i'm using Java Server Pages. thx. Daniel Susanto http://susantodaniel.wordpress.com

RE: RegexQuery Incomplete Results

Re: RegexQuery Incomplete Results

Downloading Lucene 2.9 from Maven

Re: 'problem with indexformat and luke

Re: Stemming

Re: Boosting query - debuging

Re: Downloading Lucene 2.9 from Maven

I can't found the package org.apache.lucene.index .memory.AnalyzerUtil

thanks, i have got it!(the package :org.apache.l ucene.index.memory.AnalyzerUtil)

Re: RegexQuery Incomplete Results

[ no subject ]

how to do stemming?

Re: I can't found the package org.apache.lucene.index.memory.AnalyzerUtil

Re: Re: I can't found the package org.apache.luc ene.index.memory.AnalyzerUtil

Re: how to do stemming?

Re: Re: how to do stemming?

Re: RegexQuery Incomplete Results

IndexReader.Terms - internals

Re: IndexReader.Terms - internals

Re: RegexQuery Incomplete Results

Re: IndexReader.Terms - internals

RE: IndexReader.Terms - internals

Re: Lucene Index Encryption

Re: IndexReader.Terms - internals

Re: Lucene Index Encryption

pagination search results

26 matches

Site Navigation

Mail list logo

Footer information