Re: How to make Lucene effective for video retrieval?

2011-08-18 Thread Li Li
if there are only text information, your "video search" is just normal full text search. but I think you should consider more on ranking, facet search etc. On Fri, Aug 19, 2011 at 1:05 PM, Lei Pang wrote: > Hi everyone, I want to use Lucene to retrieve videos through their meta > data: title, de

How to make Lucene effective for video retrieval?

2011-08-18 Thread Lei Pang
Hi everyone, I want to use Lucene to retrieve videos through their meta data: title, description, tags and comments. Which query type should I use? How can I make Lucene effective? Any help would be great appreciated. Best Wishes Yours Sincerely, Pang Lei

Re: Strange change to query parser behaviour in recent versions

2011-08-18 Thread Chris Hostetter
See LUCENE-2458 for the backstory. the argument was that while phrase queries were historicly generated by the query parser when a single (white space deliminated) "chunk" of query parser input produced multiple tokens, that logic didn't make sense in CJK type langauges where whitespace is not

Multiple fields derived from same source text?

2011-08-18 Thread Graham Sugden
Hi, I am just beginning to implement text indexation for an application I am building and am not quite sure of a few things. The documents indexed will be in various languages, ranging mostly from short notes to ~20 page articles (with the occaisional book length). And so my plan is to have separa

RE: What kind of System Resources are required to index 625 million row table...???

2011-08-18 Thread Uwe Schindler
Hi, The answer is simple: It does make no sense for numeric fields to index coarser granularity in addition to finer granularity fields. Because NumericField already indexes a lot of additional terms in coarser granularity (if precisionStep = 1..63) to speed up range queries. This would simply mak

Re: What kind of System Resources are required to index 625 million row table...???

2011-08-18 Thread Erick Erickson
Uwe: Thanks, I guess my mind is still stuck on the really old versions of Solr! Quick clarification, which part "won't work"? I'm assuming it's the splitting up of the dates into year, month, and date. Or are you talking about indexing the dates with coarser granularity? Or both? Thanks again, E

RE: Strange behavior of the StandardAnalyzer

2011-08-18 Thread Alain Sahli
Hi, Yes I use the same Analyzer for both. I use (and must use) the version 2.4 of Lucene. I changed nothing to the configuration of the StandardAnalyzer. For the moment, to resolve that issue, I split the words by hyphen before I run the search... Thanks for your help, Alain --