RE: Lucene or DB?

2005-11-02 Thread Pasha Bizhan
o you need a search? What is your typical search query? What are you going to do with the results of the search? How are you going to use it? Also, what is your hardware requirements? Do you already have a db administrator? And etc.. Pasha Bizhan ---

RE: Wildcard

2005-12-02 Thread Pasha Bizhan
/www.mail-archive.com/lucene-user@jakarta.apache.org/msg12254.html And http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg12268.html > If I have an item 108585-123 > And another 332323-123 > How can I look for all the -123 family of it

RE: Boolean Query

2005-12-28 Thread Pasha Bizhan
(t2 OR t3) now. BooleanQuery q2 = new BooleanQuery(); q2.add(t1,true,false); q2.add(q1,true,false); Q2 is (t1 AND Q1). I guess. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: best practices

2006-01-17 Thread Pasha Bizhan
org/jakarta-lucene/HowTo Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Duplicates recods in index

2006-02-08 Thread Pasha Bizhan
re Just perform the search with given id and name values. String query = "+id:(" + doc.get("id") + ") +name:(" + doc.get("name") + ")"; Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Duplicates recods in index

2006-02-09 Thread Pasha Bizhan
o not need to reopen index for every single document if the new data doesn't contain dupes. In any case, we use the following algorithm: - remove dupes from new data (it's possible) - open writer, open reader/searcher - use reader/searcher to check for dupes - use writer to add new docume

RE: Question

2006-03-07 Thread Pasha Bizhan
ist of > 10 possible words in term "name" starting with "Ab*". So i > don't need the hole document and i need this information realy fast. It was implemented by David Spencer. See http://searchmorph.com/experiments.

RE: Question

2006-03-07 Thread Pasha Bizhan
ist of > 10 possible words in term "name" starting with "Ab*". So i > don't need the hole document and i need this information realy fast. It was implemented by David Spencer. See http://searchmorph.com/experiments.

RE: Question

2006-03-07 Thread Pasha Bizhan
or use web.archive: http://web.archive.org/web/20050306065912/http://www.searchmorph.com/weblog/ index.php?id=26 Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Question

2006-03-07 Thread Pasha Bizhan
or use web.archive: http://web.archive.org/web/20050306065912/http://www.searchmorph.com/weblog/ index.php?id=26 Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Lucene Scoring

2006-03-08 Thread Pasha Bizhan
://lucene.apache.org/java/docs/api/org/apache/lucene/search/Similarity.h tml And "Lucene Book: - 3.3 : Understanding Lucene scoring " http://lucenebook.com/search?query=scoring Pasha Bizhan - To unsubscribe, e-mai

RE: TooManyClauses exception in Lucene (1.4)

2006-03-16 Thread Pasha Bizhan
RangeQuery allows to highlight the results. Filter does not. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Lucene Seaches VS. Relational database Queries

2006-04-13 Thread Pasha Bizhan
ry. For example, the query: select titles, yesr, etc from titles where 'Assistant Producer' like '%%. is full-text search query. And Lucene can help you in this case. Pasha Bizhan - To unsubscribe, e-mail

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
Stored (Store.YES)? Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
Hi, > From: Amaresh Kumar Yadav [mailto:[EMAIL PROTECTED] > > do we need some setting in any jsp or other file for document ??? You need to specify Stored (Store.YES) attribute for a field during indexing. Pas

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
Field(filedName, fieldValue, Field.Store.YES, Field.Index.UN_TOKENIZED)); - Or Could you please provide your source code? How do you index your data? Pasha Bizhan - To unsubscribe,

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
l) || doctitle.equals("")) //use the url if it has no title ... Also, use Luke to inspect your index. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: how to craete index with particular ID

2006-05-31 Thread Pasha Bizhan
on title of document which is stored in > the text file. > > after searching i want to disply corresponding path from index file. > > How should i creat such index file ??? > > how should i proceed See http://wiki.apache.org/jakarta-lucene

RE: Problems indexing large documents

2006-06-09 Thread Pasha Bizhan
a/docs/api/org/apache/lucene/index/IndexWriter.h tml#setMaxFieldLength(int) Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: lucene based frameworks/servers: solr, nutch, compass - which one is for what????

2006-09-20 Thread Pasha Bizhan
: http://www.amazon.com/Building-Search-Applications-Lucene-Nutch/dp/159059687 0/sr=8-2/qid=1158770477/ref=pd_bbs_2/002-1809958-7770417?ie=UTF8&s=books http://apress.com/book/bookDisplay.html?bID=10125 Pasha Bizhan - To unsubs

RE: new added documents not showing

2005-03-17 Thread Pasha Bizhan
xReader. Or reopen IndexReader. See TestIndexReader.java:: private void deleteReaderWriterConflict(boolean optimize) throws IOException for more info. Pasha Bizhan http://lucenedotnet.com - To unsubscribe, e-mail: [EMAIL PROTECT

RE: new added documents not showing

2005-03-21 Thread Pasha Bizhan
Hi, > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > I just realized that the way I described our process was off > a little bit. > > Here's the process again: > > I apologize for the amount of code below. When you open the index writer? Where is th

RE: new added documents not showing

2005-03-21 Thread Pasha Bizhan
Hi, > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > I just realized that the way I described our process was off a little > bit. > > Here's the process again: > > I apologize for the amount of code below. When do you open the index writer? Where is th

RE: new added documents not showing

2005-03-21 Thread Pasha Bizhan
r.close(); } - And then you execute the search (using mainindex) and you don't see the new documents. Yes? Pasha Bizhan http://lucenedotnet.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: How to include a multi-word synonym to a word when indexing?

2005-04-11 Thread Pasha Bizhan
crement of them to zero so that they overlay the > original term. What about putting all synonyms into index? Foo Bar Corp, FooBar Corp, FooBarCorp, cyber cafe, cybercafe etc? In this case we do no need analyze input query. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: getting the number of occurrences within a document

2005-04-14 Thread Pasha Bizhan
; Is possible to do that? See IndexReader.TermDocs(Term t) and TermDocs.Freq(). Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Wildcard searching with Highlight support ?

2005-04-19 Thread Pasha Bizhan
Hi, > From: Eric Chow [mailto:[EMAIL PROTECTED] > > I mean if I use wildcard query, it cannot highlight any terms ? > > Any idea to do this or any existing example ? Try to rewrite query before highlighting.

RE: Search Theory Book

2005-05-12 Thread Pasha Bizhan
Hi, Managing Gigabytes http://www.amazon.com/exec/obidos/tg/detail/-/1558605703/qid=1115898416/sr=8 -1/ref=pd_csp_1/104-0210366-8377506?v=glance&s=books&n=507846 Pasha Bizhan http://lucenedotnet.com > -Original Message- > From: Anna Bing [mailto:[EMAIL PROTECTED] > Se

RE: Displaying relevant text with Search results

2005-06-13 Thread Pasha Bizhan
> give any help is achieving this? You need the Highlighter package. http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/highlighter/src/ java/org/apache/lucene/search/highlight/package.html?view=markup http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/c

RE: Displaying relevant text with Search results

2005-06-13 Thread Pasha Bizhan
> give any help is achieving this? You need the Highlighter package. http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/highlighter/src/ java/org/apache/lucene/search/highlight/package.html?view=markup http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/c

RE: Indexes auto creation

2005-06-13 Thread Pasha Bizhan
createIndex = ! (IndexReader.indexExists(directory)); writer = new IndexWriter(directory, analyzer, createIndex ); Pasha Bizhan http://lucenedotnet.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Indexes auto creation

2005-06-13 Thread Pasha Bizhan
createIndex = ! (IndexReader.indexExists(directory)); writer = new IndexWriter(directory, analyzer, createIndex ); Pasha Bizhan http://lucenedotnet.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Document ID

2005-06-25 Thread Pasha Bizhan
ermFreqVector(document)? reader.delete(document); Etc.. It's not rise a claim. I don't know a good unified solution for this problem :( Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Extending 'sealed' classes & usage in comercial solution.

2005-10-25 Thread Pasha Bizhan
See http://apache.org/foundation/licence-FAQ.html for more details. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]