Re: Passage retrieval with Lucene-based application

2011-05-25 Thread Sujit Pal
Hi Leroy, Would it make sense to index as Lucene documents the unit to be searched? So if you want paragraphs to be shown in search results, you could parse the source document during indexing into paragraphs and index them as separate Lucene documents. -sujit On Wed, 2011-05-25 at 15:46 -0400,

Re: Passage retrieval with Lucene-based application

2011-05-25 Thread Shashi Kant
https://issues.apache.org/jira/browse/LUCENE-1522 On Wed, May 25, 2011 at 3:46 PM, Leroy Stone wrote: > document ("paragraphs") that contain my search phrase, rather than simply > pointers to the whole document. in searching among applications based upon > the Lucene, I have found only one that

Passage retrieval with Lucene-based application

2011-05-25 Thread Leroy Stone
Hello! I am purchased "Lucene in Action", 2nd Ed., and posted the question below at the Manning Forum. Mike MCCandless suggested that I send it to you. Thanks in advance for your attention. the question I posted ___ I would like the search program to return with segm

JobClient.runJob(job) in Fetcher.java

2011-05-25 Thread Cheng
Hi, I notice that there are a few run() methods in Fetcher.java and that the following statement in Crawler.java calls the JobClient.runJob(job) in Fetcher.java. fetcher.fetch(segs[0], threads, org.apache.nutch.fetcher.Fetcher.isParsing(conf)); I would like to know which run() in Fetcher.java has

Re: Is there a limit on the size of the text for a single field?

2011-05-25 Thread Cheng Zhou
thanks lan. On Wed, May 25, 2011 at 11:44 PM, Ian Lea wrote: > Sure. See the javadocs for IndexWriter.setMaxFieldLength or > LimitTokenCountAnalyzer if you are using 3.1.0. > > > -- > Ian. > > > On Wed, May 25, 2011 at 4:24 PM, Cheng Zhou > wrote: > > Hi, I wonder if I can associate a text str

Re: Is there a limit on the size of the text for a single field?

2011-05-25 Thread Ian Lea
Sure. See the javadocs for IndexWriter.setMaxFieldLength or LimitTokenCountAnalyzer if you are using 3.1.0. -- Ian. On Wed, May 25, 2011 at 4:24 PM, Cheng Zhou wrote: > Hi, I wonder if I can associate a text string of over 5MB with a single > field. > > Thanks. >

Is there a limit on the size of the text for a single field?

2011-05-25 Thread Cheng Zhou
Hi, I wonder if I can associate a text string of over 5MB with a single field. Thanks.

Re: is OpenBitSet / SortedVIntList compressed bit map index?

2011-05-25 Thread ai114
First Last wrote: > > Are there any other compressed bitmap index implementations which offer > bit > map compression at a decent performance assuming filters are sparse? > Have a look at EWAH by Daniel Lemire google: http://code.google.com/p/javaewah/ http://code.google.com/p/javaewah/ resea

Re: how to search multiple fields

2011-05-25 Thread Ian Lea
> In the first link you presented, there is one comment that "Note that terms > which occur in short fields have a higher effect on the result ranking." > > What does "short fields" mean? This is a short sentence. This is a somewhat longer sentence that may get lower scores when matched by terms

Re: how to search multiple fields

2011-05-25 Thread Cheng Zhou
Hi lan, thanks. Still two questions. In the first link you presented, there is one comment that "Note that terms which occur in short fields have a higher effect on the result ranking." What does "short fields" mean? What are the differences between the impact of the short fields and that of the

dynamic frag size - highlighter

2011-05-25 Thread dan sutton
Hi, I'd like to make highlighting work as follows: length(all snippits) approx. 200 chars hl.snippits = 2 (2 snippits) e.g. if there is only 1 snippet available, length <= 200chars e.g. if there is >1 snippet, length each snippet == 100chars, so I take the first 2 and get 200 chars Is this pos

dynamic frag size - highlighter

2011-05-25 Thread dan sutton
Hi, I'd like to make highlighting work as follows: length(all snippits) approx. 200 chars hl.snippits = 2 (2 snippits) e.g. if there is only 1 snippet available, length <= 200chars e.g. if there is >1 snippet, length each snippet == 100chars, so I take the first 2 and get 200 chars Is this pos

Re: how to search multiple fields

2011-05-25 Thread Ian Lea
> Quite a few Lucene examples on lines shows how to insert multiple fields > into a Document and how to query the indexed file with certain fields and > queried text. I would like to know: > > 1.       How to do a cross-field search? http://wiki.apache.org/lucene-java/LuceneFAQ#How_can_I_search_ov

how to search multiple fields

2011-05-25 Thread zhoucheng2008
Hi, Quite a few Lucene examples on lines shows how to insert multiple fields into a Document and how to query the indexed file with certain fields and queried text. I would like to know: 1. How to do a cross-field search? 2. How to specify some key fields as well as some less i

Re: Apache Lucene v2.3.2

2011-05-25 Thread Ian Lea
Probably depends on what you mean by supported. If you mean messages on this list, then yes, although be prepared for suggestions that you upgrade. If you mean bug fixes/code changes, I'd guess not. You really should upgrade ... -- Ian. On Tue, May 24, 2011 at 5:03 PM, Garry S Ditzler wrote