Job Opening for Technical Operations Manager / Systems Engineer

2005-06-23 Thread jobs
JOB TITLE: Technical Operations Manager / Systems Engineer LOCATION: Mountain View, California, USA Want to build the next generation of search from scratch? In this role as Technical Operations Manager / System Engineer, you will be asked to hit the ground running to build and scale a technica

Re: weight score based on a fields value

2005-06-23 Thread Chris Hostetter
Your question can be read two ways... 1) score documents in which field "FFF" has value "VVV" slightly higher then documents in which field FFF does not contain value "VVV". (or any value) 2) score documents in which field "FFF" has some numeric value in such a way that the score of hte

Re: Updateing Documents:

2005-06-23 Thread Chris Hostetter
: 3. Does anyone know a project based on lucene that offers a complete : solution for a Intranetsearch? nutch... http://lucene.apache.org/nutch/ -Hoss - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-ma

Re: Is a field in use?

2005-06-23 Thread Avi Drissman
On Jun 23, 2005, at 2:37 PM, Dan Armbrust wrote: In my indexes where the available fields vary by document, I maintain an additional field that lists out what fields are in used per document. Excellent idea! Thanks. Avi -- Avi 'rlwimi' Drissman [EMAIL PROTECTED] Argh! This darn mail serve

Re: Is a field in use?

2005-06-23 Thread Dan Armbrust
In my indexes where the available fields vary by document, I maintain an additional field that lists out what fields are in used per document. That way, I can query for all documents that contain field "foo", or all documents that contain a field "foo", and don't contain "bar"... etc. Avi

Re: Is a field in use?

2005-06-23 Thread Avi Drissman
On Jun 23, 2005, at 2:19 PM, Daniel Naber wrote: Maybe IndexReader.getFieldNames()? But will that not list fields that were in use at one time but are no longer? Avi -- Avi 'rlwimi' Drissman [EMAIL PROTECTED] Argh! This darn mail server is trunca -

Re: Is a field in use?

2005-06-23 Thread Daniel Naber
On Thursday 23 June 2005 17:09, Avi Drissman wrote: > Suppose I have an index with a varying number of fields in a   > document. Some documents have some fields, and some documents have   > others. Is there a quick way to query the index to see if a   > particular field name is in use at all by an

Re: getting text-snippets

2005-06-23 Thread Erik Hatcher
On Jun 23, 2005, at 10:17 AM, Ulrich Schinz wrote: Field.Text(String, Reader) is not a stored field. This is why doc.get("contents") is empty. ok, i read that in javadoc of lucene... in dont understand what Field.Text(String,Reader,boolean) does... if i set boolean to true, what is the

Re: How lucene and nucth work together?

2005-06-23 Thread Andrzej Bialecki
Giovanni Dima wrote: Andrzej, I'm so sorry about still disturbing you! I followed your suggest and i have found this code: try { searcher = new IndexSearcher( IndexReader.open(indexName) //create an indexSearcher for our page ); } catch (Exception e) { IndexSearcher is a class of Lucene API,

Re:Re:How lucene and nucth work together?

2005-06-23 Thread Giovanni Dima
Andrzej, I'm so sorry about still disturbing you! I followed your suggest and i have found this code: try { searcher = new IndexSearcher( IndexReader.open(indexName) //create an indexSearcher for our page ); } catch (Exception e

Is a field in use?

2005-06-23 Thread Avi Drissman
Suppose I have an index with a varying number of fields in a document. Some documents have some fields, and some documents have others. Is there a quick way to query the index to see if a particular field name is in use at all by any document in the index? For example, if document 1 has fie

Re: getting text-snippets

2005-06-23 Thread Ulrich Schinz
Field.Text(String, Reader) is not a stored field. This is why doc.get("contents") is empty. ok, i read that in javadoc of lucene... in dont understand what Field.Text(String,Reader,boolean) does... if i set boolean to true, what is the stortermvector?? You have some options... change t

Re: getting text-snippets

2005-06-23 Thread Erik Hatcher
On Jun 23, 2005, at 6:41 AM, Ulrich Schinz wrote: hi there! first of all: im new here in the list, my name is uli. hello to all ! im quite new in using lucene. i created different indices, some with GermanAnalyzer some with StandardAnalyzer... i added Fields to my Documents with doc.add(Fiel

RE: Weird Problem with Lucene

2005-06-23 Thread Urs Eichmann
First, sorry for the double post. I had problems with subscribing to the mailing list and thought my first message didn't go through. Thanks you Chris and the others for your valuable tips. It was indeed a problem with the Analyzer. I used the SimpleAnalyzer and thought from the doc's that it will

getting text-snippets

2005-06-23 Thread Ulrich Schinz
hi there! first of all: im new here in the list, my name is uli. hello to all ! im quite new in using lucene. i created different indices, some with GermanAnalyzer some with StandardAnalyzer... i added Fields to my Documents with doc.add(Field.Text("contents",new FileReader(f)); and doc.add(

Re: Question for Wildcard Search:

2005-06-23 Thread Volodymyr Bychkoviak
Hello about 3 months ago I posted some idea about wildcard searching. main idea was to index every character of input as separate term. and then search using PhraseQuery. for example word "12345" would be indexed as "1" "2" "3" "4" "5". to find "*23*" you can use PhraseQuery with this two term

Re: lucene highlighter

2005-06-23 Thread mark harwood
Hi Johan, To avoid selecting fragments see here: http://www.mail-archive.com/java-user@lucene.apache.org/msg01097.html Be aware though that the highlighter is really designed to decorate plain-text by adding highlight tags - if your text already includes any HTML mark-up it becomes hard to correct

Re: Question for Wildcard Search:

2005-06-23 Thread Dave Kor
Quoting Dave Kor <[EMAIL PROTECTED]>: > Quoting Erik Hatcher <[EMAIL PROTECTED]>: > > > Anyone tried this technique with Lucene? > > Actually, the problem is that the wildcard code has to search over a large > subset of terms because the list of terms is, well, a linear structure. > > If, for exam

lucene highlighter

2005-06-23 Thread johan duflost
Dear list, I try to use the Term Highlighter in my webapp but I have a problem. I want to highlight the terms in a text without extracting the most relevant sections. The highlighting works but the last characters are trimmed ! Here is a portion of my code : Analyzer analyzer = new Standa

Re: Question for Wildcard Search:

2005-06-23 Thread Dave Kor
Quoting Erik Hatcher <[EMAIL PROTECTED]>: > Anyone tried this technique with Lucene? Actually, the problem is that the wildcard code has to search over a large subset of terms because the list of terms is, well, a linear structure. If, for example, all terms in the index is arranged as a suffix