Re: Analyzer don't work with wildcard queries, snowball analyzer.

2005-03-31 Thread Morus Walter
Ernesto De Santis writes: > Hi Erik > > Ok, in PrefixQuery cases, non analyze is right. > It creates the same problems. 'example*' should find 'example' but does not if 'example' is stemmed to 'exampl' and you don't analyze the prefix query. > > You search "example" and obtain x results. > You

Re: using different analyzer for searching

2005-03-31 Thread pashupathinath
hi erik, i'm creating a blogger application, where the users can create blogs, upload pictures and post comments etc etc. i'm storing all the information using mysql database. i'm indexing the database contents and searching on this index.i'm using lucene to implement this feature. i give

Re: using different analyzer for searching

2005-03-31 Thread Erik Hatcher
On Mar 31, 2005, at 11:44 AM, pashupathinath wrote: is it possible to index using a predefined analyzer and search using a custom analyzer ?? Yes, its perfectly fine to do so with the caveat that you end up searching for the terms exactly as they were indexed. I end up doing this in most applic

Re: Analyzer don't work with wildcard queries, snowball analyzer.

2005-03-31 Thread Erik Hatcher
On Mar 31, 2005, at 12:26 PM, Ernesto De Santis wrote: Hi Erik Finally, my name spelled correctly. :)) Ok, in PrefixQuery cases, non analyze is right. But you think that non analyze in WildcardQuery is right? Do I think its right? That's just the way it is. Whether that is right or not I don

Re: HTML pages highlighter

2005-03-31 Thread Erik Hatcher
On Mar 31, 2005, at 6:36 PM, Yagnesh Shah wrote: try { fis = new FileInputStream(f); HTMLParser parser = new HTMLParser(fis); // Add the tag-stripped contents as a Reader-valued Text field so it will // get tokenized and indexed. // doc.add(new Field("contents", pa

RE: HTML pages highlighter

2005-03-31 Thread Yagnesh Shah
Hi! Eric, I have modified HTMLDocument.java try section to used doc.add(Field.Text("contents", l)); I am able to compile with following warning about depricated API. But I am still unable to see any value. compile warning # compile-demo: [javac] Compiling 1 sourc

RE: error when query contains numbers

2005-03-31 Thread Omar Didi
I did index new documents and they are fine. thanks for your help Eric. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Thursday, March 31, 2005 5:20 PM To: java-user@lucene.apache.org Subject: Re: error when query contains numbers On Mar 31, 2005, at 5:00 PM, Omar

Re: error when query contains numbers

2005-03-31 Thread Erik Hatcher
On Mar 31, 2005, at 5:00 PM, Omar Didi wrote: CyAnalyzer: [year] [2005] [yahoo123] the analyzer I am using doesn t discard numbers but I can t find them. any ideas?? If that is the analyzer you are using for indexing and you are using a tokenized field type then those tokens will be indexed

RE: error when query contains numbers

2005-03-31 Thread Omar Didi
I have used luke to look inside the index and i haven't found any numbers except for the fields that were indexed as keywords. i have used the AnalyzerDemo class in LIA, I got the following output: Analyzing "welcome to the year 2005 yahoo123" WhitespaceAnalyzer: [welcome] [to] [the] [year

Re: Newbie question

2005-03-31 Thread Otis Gospodnetic
It's as easy as replacing a Jar, if you are upgrading from a Lucene version with compatible index format. Some versions changed the index format. See http://cvs.apache.org/viewcvs.cgi/*checkout*/jakarta-lucene/CHANGES.txt?rev=1.96.2.6 for index format change information. If the index format chan

Re: new bie ..

2005-03-31 Thread Otis Gospodnetic
Hello, For 1) you'll need to write your own Analyzer that knows that you want to be able to search on "abc", "123", and probably "abc123". WhitespaceAnalyzer just breaks input on spaces. 2) I think I answered that in the Lucene in Action forum. Otis --- pashupathinath <[EMAIL PROTECTED]> wrote

Re: Analyzer don't work with wildcard queries, snowball analyzer.

2005-03-31 Thread Ernesto De Santis
Hi Erik Ok, in PrefixQuery cases, non analyze is right. But you think that non analyze in WildcardQuery is right? You search "example" and obtain x results. You search "ex?mple" and don't obtain any result. This is correct for you? It is difficult to analyze wildcard queries in lucene code? Ernesto

using different analyzer for searching

2005-03-31 Thread pashupathinath
is it possible to index using a predefined analyzer and search using a custom analyzer ?? i'm searching using the built in whitespace analyser. the problem is when i'm searching for a part of a string the search results are zero. i'm using white space analyzer. for example if the statement is

Re: Analyzer don't work with wildcard queries, snowball analyzer.

2005-03-31 Thread Erik Hatcher
Wildcard terms simply are not analyzed. How could it be possible to do this? What if I search for "a*" - how could you stem that? Erik On Mar 31, 2005, at 9:51 AM, Ernesto De Santis wrote: Hi I get an unexpected behavior when use wildcards in my queries. I use a EnglishAnalyzer develope

Analyzer don't work with wildcard queries, snowball analyzer.

2005-03-31 Thread Ernesto De Santis
Hi I get an unexpected behavior when use wildcards in my queries. I use a EnglishAnalyzer developed with SnowballAnalyzer. version 1.1_dev from Lucene in Action lib. Analysis case: When use wildcards in the middle of one word, the word in not analyzed. Examples: QueryParser qp = new Qu

Re: pre computing possible search results narrowing and hit counts on those

2005-03-31 Thread mark harwood
As suggested before, the real killer for performance in Lucene is when you have to read a stored field. It doesn't matter how small the one field you want is, Lucene will read *all* fields for that document off the disk. If you have a large "body" field that will get read too when you try read the