This is the code for searching:
String index = "index";
String field = "contents";
IndexReader reader = IndexReader.open(index);
Searcher searcher = new IndexSearcher(reader);
System.out.println("Enter query: ");
String line = ".IN.";//in jakarta regexp this is like * IN *
RegexQuery rxquery = n
The default regex package is java.util.regex and I can't see anywhere
that you tell it to use the Jakarta regexp package. So I don't think
that ".in" will match. Also, you are storing your contents field as
NOT_ANALYZED so you will need to be wary of case sensitivity. Maybe
this is what you want
Hi
I'm working on an opensrc project that needs Lucene 2.9 and uses Maven,
it would be great if Lucene 2.9 could be found automatically so that
other developers can easily run and build the project.
First of all I changed the version field from 2.4 to 2.9 in my pom.xml
but lucene couldn't find
It's not bad, the fix is simply to link your new JARs with the Luke
one. As for why you use 2.9, I don't know. It's not officially
released, but that doesn't stop a lot of people from using it. Trunk
is generally stable, so if it works for you, I wouldn't worry too much
about it.
On M
On Fri, May 08, 2009 at 08:57:59AM -0400, Matthew Hall wrote:
> process your
> words into a more base form before they go into the stemmed
Malaga (http://home.arcor.de/bjoern-beutel/malaga/) can be used to
make a program that converts words to a base form.
--
On May 10, 2009, at 5:59 AM, liat oren wrote:
The output is the following:
*finlin, score: 19.366615*
19.366615 = (MATCH) fieldWeight(worlds:6621468^3.0 in 35433),
product of:
4.2426405 = (MATCH) btq, product of:
0.70710677 = tf(phraseFreq=0.5)
6.0 = scorePayload(...)
7.3036084 = idf
Lucene's trunk build (2.9-dev) is supposed to be available, as
2.9-SNAPSHOT. But something is wrong: we seem to be building 2.9
trunk but attempting to publish it (incorrectly) as 2.4-SNAPSHOT.
But then, 2.4-SNAPSHOT is not published correctly... I only see the
maven-metadata.xml, and no artifact
Hi all,
can someone tell me where can i find the package
org.apache.lucene.index.memory.AnalyzerUtil in the lucene api?I can't found it.
many thanks.
Kamal.
--
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.or
I have found it.
thanks.
Kamal.
--
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
The RegexQuery class uses that package, and for that reason the expression
matches.
If my records contained only one word each, this code would work, but I need
to apply that regular expression to a phrase...
Ian Lea wrote:
>
> The default regex package is java.util.regex and I can't see anywh
will the anlyzer now do stemming, if i do the folow:
analyzer = new StandardAnalyzer();
analyzer=AnalyzerUtil.getPorterStemmerAnalyzer(analyzer);
thanks.
Kamal.
--
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.or
will the anlyzer now do stemming, if i do the folow:
analyzer = new StandardAnalyzer();
analyzer=AnalyzerUtil.getPorterStemmerAnalyzer(analyzer);
thanks.
Kamal.
--
--
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apa
The class is contained in
org.apache.lucene.index.memory.AnalyzerUtil
Assuming you've installed 2.4, it's in...
which is located in the contrib area. Try looking in /contrib/memory/lucene-memory-2.4.0.jar
Best
Erick
2009/5/11 Kamal Najib
> Hi all,
> can someone tell me where can i find the p
Thank you.
kamal.
Original Message:
The class is contained in
org.apache.lucene.index.memory.AnalyzerUtil
Assuming you've installed 2.4, it's in...
which is located in the contrib area. Try looking in
/contrib/memory/lucene-memory-2.4.0.jar
Best
Erick
2009/5/11 Kamal Najib
> Hi all,
> can
Yep, I reckon so.
btw a Google search for something like lucene stemming gets hits,
including a couple of articles about stemming. Might be worth a look.
--
Ian.
On Mon, May 11, 2009 at 2:08 PM, Kamal Najib wrote:
> will the anlyzer now do stemming, if i do the folow:
> analyzer = new Standar
Thank you Ian.
Kamal
Original Message:
Yep, I reckon so.
btw a Google search for something like lucene stemming gets hits,
including a couple of articles about stemming. Might be worth a look.
--
Ian.
On Mon, May 11, 2009 at 2:08 PM, Kamal Najib wrote:
> will the anlyzer now do stemming, if
The little self-contained program below runs regex queries for a few
regexps against a few phrases for both the java.util and jakarta
regexp packages.
Output when run with lucene 2.4.1 and jakarta-regexp 1.5 is
Added Knowing yourself
Added Old clinic
Added INSIDE
Added Not INSIDE
Default
regexc
IndexReader rdr = IndexReader.Open(myFolder);
TermEnum terms = rdr.Terms((new Term(myTermName, "")));
(from .NET land, but it's all the same)
This code works great, I can loop thru the terms nicely, but after it
returns all the myTermName terms, it goes into all other term
I believe not. Just get out when term.field() != myTermName, interned.
--
Ian.
On Mon, May 11, 2009 at 4:35 PM, Ian Vink wrote:
> IndexReader rdr = IndexReader.Open(myFolder);
> TermEnum terms = rdr.Terms((new Term(myTermName, "")));
>
> (from .NET land, but it's all the
That's it!!!
The problem was with the regular expression, the one I need is ".*IN"!!
Thank you so much, I was turning mad... =)
Ian Lea wrote:
>
> The little self-contained program below runs regex queries for a few
> regexps against a few phrases for both the java.util and jakarta
> regexp p
Hi,
We noticed this behaviour also, so we do like this :
Map result = new HashMap();
TermEnum all;
if(matcher.fullScan()) {
all = reader.terms(new Term(field));
} else {
all = reader.terms(new Term(field, matcher.prefix()));
}
if(all == null) return result;
Term t;
do {
t = a
No, there is no other way to do this. And if you think, the TermEnum takes
too much RAM when returning all terms and also from different, you can be
sure, that there is no wasted memory, as the term enum does not allocate the
whole terms (like normal Java iterators). The term enum is iterated on di
On Mon, May 11, 2009 at 12:19 AM, Andrzej Bialecki wrote:
>
> Unfortunately, current Lucene IndexWriter implementation uses seek /
> overwrite when writing term info dictionary. This is described in more
> detail here:
>
> https://issues.apache.org/jira/browse/LUCENE-532
>
Thanks for the enlight
Thanks guys,
Here's what I built:
http://BahaiResearch.com
It allows any language speaker to read about another person's religion in
any language. Helps promote unity in diversity. It's open source.
Ian
On Mon, May 11, 2009 at 1:39 PM, Uwe Schindler wrote:
> No, there is no other way to do
On Mon, May 11, 2009 at 2:06 PM, Babak Farhang wrote:
> I am not familiar with the details of CFS, but I didn't interpret
> Michael's comment to mean that there is actually any rewriting going
> on here. The problem here appears to be one of translating the
> encrypted/compressed file position to
Hi,
Please help, how to do pagination search results?
note: i'm using Java Server Pages.
thx.
Daniel Susanto
http://susantodaniel.wordpress.com
26 matches
Mail list logo