Re: [ANNOUNCE] Web Crawler

2011-05-27 Thread Dominique Bejean
Hi, Sorry for the delay, but I haven't been checking the mailing list for a long time. Crawl-anywhere includes 3 piece of software : a crawler, a pipeline and a solr indexer. There is a default Solr schema used by Crawl-anywhere, tested with Solr 1.4.1 and Solr 3.1.0. But, you can config

Re: FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Koji Sekiguchi
(11/05/27 19:57), Joel Halbert wrote: Hi, I'm using Lucene 3.0.3. I'm extracting snippets using FastVectorHighlighter, for some snippets (I think always when searching for exact matches, quoted) the fragment is null. Code looks like: query = QueryParser.escape(query);

Re: FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Koji Sekiguchi
(11/05/27 20:56), Pierre GOSSE wrote: Hi, Maybe is it related to : https://issues.apache.org/jira/browse/LUCENE-3087 No, because Joel's problem is FastVectorHighlighter, but LUCENE-3087 is for Highlighter. koji -- http://www.rondhuit.com/en/ --

RE: FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Pierre GOSSE
Actually, this second issue was opened since Highlight seams to ignore positions and treats WITH_POSITIONS_OFFSETS like it was WITH_OFFSETS. https://issues.apache.org/jira/browse/LUCENE-3091 As far as I remember, the trouble is that to trust positions in the tokenstream built from termvector, y

RE: FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Joel Halbert
Hi Pierre, Thanks for the pointer. So if I understand correctly this bug definitely applies to fields with TermVector.WITH_OFFSETS. My field uses TermVector.WITH_POSITIONS_OFFSETS) I wasn't sure from the bug report if it applies to WITH_POSITIONS_OFFSETS as well? It looks like it might? - Joel

RE: FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Pierre GOSSE
Hi, Maybe is it related to : https://issues.apache.org/jira/browse/LUCENE-3087 Pierre -Message d'origine- De : Joel Halbert [mailto:j...@su3analytics.com] Envoyé : vendredi 27 mai 2011 12:57 À : lucene users Objet : FastVectorHighlighter.getBestFragments returning null Hi, I'm using L

FastVectorHighlighter.getBestFragments returning null

2011-05-27 Thread Joel Halbert
Hi, I'm using Lucene 3.0.3. I'm extracting snippets using FastVectorHighlighter, for some snippets (I think always when searching for exact matches, quoted) the fragment is null. Code looks like: query = QueryParser.escape(query); if (exact) {