Hi Leroy,
Would it make sense to index as Lucene documents the unit to be
searched? So if you want paragraphs to be shown in search results, you
could parse the source document during indexing into paragraphs and
index them as separate Lucene documents.
-sujit
On Wed, 2011-05-25 at 15:46 -0400,
https://issues.apache.org/jira/browse/LUCENE-1522
On Wed, May 25, 2011 at 3:46 PM, Leroy Stone wrote:
> document ("paragraphs") that contain my search phrase, rather than simply
> pointers to the whole document. in searching among applications based upon
> the Lucene, I have found only one that
Hello!
I am purchased "Lucene in Action", 2nd Ed., and posted the
question below at the Manning Forum. Mike MCCandless suggested that I
send it to you.
Thanks in advance for your attention.
the question I posted ___
I would like the search program to return with segm
Hi, I notice that there are a few run() methods in Fetcher.java and that the
following statement in Crawler.java calls the JobClient.runJob(job) in
Fetcher.java.
fetcher.fetch(segs[0], threads,
org.apache.nutch.fetcher.Fetcher.isParsing(conf));
I would like to know which run() in Fetcher.java has
thanks lan.
On Wed, May 25, 2011 at 11:44 PM, Ian Lea wrote:
> Sure. See the javadocs for IndexWriter.setMaxFieldLength or
> LimitTokenCountAnalyzer if you are using 3.1.0.
>
>
> --
> Ian.
>
>
> On Wed, May 25, 2011 at 4:24 PM, Cheng Zhou
> wrote:
> > Hi, I wonder if I can associate a text str
Sure. See the javadocs for IndexWriter.setMaxFieldLength or
LimitTokenCountAnalyzer if you are using 3.1.0.
--
Ian.
On Wed, May 25, 2011 at 4:24 PM, Cheng Zhou wrote:
> Hi, I wonder if I can associate a text string of over 5MB with a single
> field.
>
> Thanks.
>
Hi, I wonder if I can associate a text string of over 5MB with a single
field.
Thanks.
First Last wrote:
>
> Are there any other compressed bitmap index implementations which offer
> bit
> map compression at a decent performance assuming filters are sparse?
>
Have a look at EWAH by Daniel Lemire
google: http://code.google.com/p/javaewah/
http://code.google.com/p/javaewah/
resea
> In the first link you presented, there is one comment that "Note that terms
> which occur in short fields have a higher effect on the result ranking."
>
> What does "short fields" mean?
This is a short sentence.
This is a somewhat longer sentence that may get lower scores when
matched by terms
Hi lan, thanks. Still two questions.
In the first link you presented, there is one comment that "Note that terms
which occur in short fields have a higher effect on the result ranking."
What does "short fields" mean? What are the differences between the impact
of the short fields and that of the
Hi,
I'd like to make highlighting work as follows:
length(all snippits) approx. 200 chars
hl.snippits = 2 (2 snippits)
e.g. if there is only 1 snippet available, length <= 200chars
e.g. if there is >1 snippet, length each snippet == 100chars, so I
take the first 2 and get 200 chars
Is this pos
Hi,
I'd like to make highlighting work as follows:
length(all snippits) approx. 200 chars
hl.snippits = 2 (2 snippits)
e.g. if there is only 1 snippet available, length <= 200chars
e.g. if there is >1 snippet, length each snippet == 100chars, so I
take the first 2 and get 200 chars
Is this pos
> Quite a few Lucene examples on lines shows how to insert multiple fields
> into a Document and how to query the indexed file with certain fields and
> queried text. I would like to know:
>
> 1. How to do a cross-field search?
http://wiki.apache.org/lucene-java/LuceneFAQ#How_can_I_search_ov
Hi,
Quite a few Lucene examples on lines shows how to insert multiple fields
into a Document and how to query the indexed file with certain fields and
queried text. I would like to know:
1. How to do a cross-field search?
2. How to specify some key fields as well as some less i
Probably depends on what you mean by supported. If you mean messages
on this list, then yes, although be prepared for suggestions that you
upgrade. If you mean bug fixes/code changes, I'd guess not.
You really should upgrade ...
--
Ian.
On Tue, May 24, 2011 at 5:03 PM, Garry S Ditzler wrote
15 matches
Mail list logo