Re: dataImportHandler: delta query fetching data, not just ids?

2012-03-27 Thread janne mattila
How did it work before SOLR-811 update? I don't understand. Did it fetch delta data with two queries (1. gets ids, 2. gets data per each id) or did it fetch all delta data with a single query? On Tue, Mar 27, 2012 at 5:45 PM, Ahmet Arslan wrote: >> 2. If not - what's the reason delta import is im

Re: Auto-complete phrase

2012-03-27 Thread William Bell
I am also very confused at the use case for the Suggester component. With collate on, it will try to combine random words together not the actual phrases that are there. I get better mileage out of EDGE grams and tokenize on whitespace... Left to right... Since that is how most people think. Howe

Re: StreamingUpdateSolrServer - exceptions not propagated

2012-03-27 Thread Mike Sokolov
On 3/27/2012 11:14 AM, Mark Miller wrote: On Mar 27, 2012, at 10:51 AM, Shawn Heisey wrote: On 3/26/2012 6:43 PM, Mark Miller wrote: It doesn't get thrown because that logic needs to continue - you don't necessarily want one bad document to stop all the following documents from being added.

Solr with UIMA

2012-03-27 Thread chris3001
I am having a hard time integrating UIMA with Solr. I have downloaded the Solr 3.5 dist and have it successfully running with nutch and tika on windows 7 using solrcell and curl via cygwin. To begin, I copied the 6 jars from solr/contrib/uima/lib to the working /lib in solr. Next, I read the readme

RE: how to store file path in Solr when using TikaEntityProcessor

2012-03-27 Thread ZHANG Liang F
Could you please show me how to get those values inside TikaEntityProcessor? -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: 2012年3月27日 22:43 To: solr-user@lucene.apache.org Subject: Re: how to store file path in Solr when using TikaEntityProcessor > I am using DI

Re: Why my highlights are wrong(one character offset)?

2012-03-27 Thread Koji Sekiguchi
How does your sequence field look like in schema.xml, fieldType and field? And what version are you using? koji -- Query Log Visualizer for Apache Solr http://soleami.com/ (12/03/27 13:06), neosky wrote: all of my highlights has one character mistake in the offset,some fragments from my respons

Re: Why my highlights are wrong(one character offset)?

2012-03-27 Thread neosky
My current version is solr 3.5. It should be the most updated. -- View this message in context: http://lucene.472066.n3.nabble.com/Why-my-highlights-are-wrong-one-character-offset-tp3860286p3862872.html Sent from the Solr - User mailing list archive at Nabble.com.

Unload(true) doesn't delele Index file when unloading a core

2012-03-27 Thread vybe3142
>From what I understand, isn't the index file deletion an expected result? Thanks public int drop(, boolean removeIndex) ===> removeIndex passed in as true throws Exception { String coreName = . Unload req = new Unload(removeIndex); req.setCore

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread Dyer, James
Assuming you're just using this field for spellcheck and not for queries, then it doesn't matter. But the correct way to do it is to have it in both places. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: geeky2 [mailto:gee...@hotmail.com] Se

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
hello, should i apply the StopFilterFactory at index time or query time. right now - per the schema below - i am applying it at BOTH index time and query time. is this correct? thank you, mark // snipped from schema.xml

Re: Why my highlights are wrong(one character offset)?

2012-03-27 Thread Ahmet Arslan
Can you reproduce the problem with latest trunk? > Does anyone know it is a bug or not? > I use Ngram in my index. > > class="solr.TextField" > positionIncrementGap="100"> > > minGramSize="5" > maxGramSize="5"/> > > > > > > > > class="solr.TextField" > positionIncrementGap="100"> >

Re: Why my highlights are wrong(one character offset)?

2012-03-27 Thread neosky
Does anyone know it is a bug or not? I use Ngram in my index. ... -- View this message in context: http://lucene.472066.n3.nabble.com/Why-my-highlights-are-wrong-one-character-offset-tp3860286p3862326.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: First steps with Solr

2012-03-27 Thread Erik Hatcher
Note that the VelocityResponseWriter puts a tool in the context to escape various things. See the "Velocity Context" section here: . That'll take you to this You can do

Re: Using the ids parameter

2012-03-27 Thread Jamie Johnson
Yes, sorry for the delay, we now do q=key:("key1" "key2"...) and that works properly. On Tue, Mar 27, 2012 at 3:53 AM, Dmitry Kan wrote: > So I solved it by using key:(id1 OR ... idn). > > On Tue, Mar 27, 2012 at 9:14 AM, Dmitry Kan wrote: > >> Hi, >> >> Actually we ran into the same issue with

Auto-complete phrase

2012-03-27 Thread Rémy Loubradou
Hello, I am working on creating a auto-complete functionality for my field merchant_name present all over my documents. I am using the version 3.4 of Solr and I am trying to take advantage of the Suggester functionality. Unfortunately so far I didn't figure out how to make it works as I expected.

Re: First steps with Solr

2012-03-27 Thread Marcelo Carvalho Fernandes
I've had the same problem and my solution was to... #set($pName = "#field('name')") #set($pName = $pName.trim()) Marcelo Carvalho Fernandes +55 21 8272-7970 +55 21 2205-2786 On Mon, Mar 26, 2012 at 3:24 PM, henri.gour...@laposte.net < henri.gour...@laposte.net> wrote: > trying to play wit

Re: SolrCloud with Tomcat and external Zookeeper, does it work?

2012-03-27 Thread jerry.min...@gmail.com
Hi Vadim, I too am experimenting with SolrCloud and need help with setting it up using Tomcat as the java servlet container. While searching for help on this question, I found another thread in the solr-mailing-list that is helpful. In case you haven't seen this thread that I found, please search

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
thank you very much for the info ;) -- View this message in context: http://lucene.472066.n3.nabble.com/preventing-words-from-being-indexed-in-spellcheck-dictionary-tp3861472p3861987.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: StreamingUpdateSolrServer - exceptions not propagated

2012-03-27 Thread Erick Erickson
https://issues.apache.org/jira/browse/SOLR-445 This JIRA reflects the slightly different case of wanting better reporting of *which* document failed in a multi-document packet, it doesn't specifically address SUSS. But it might serve to give you some ideas if you tackle this. On Tue, Mar 27, 2012

Re: StreamingUpdateSolrServer - exceptions not propagated

2012-03-27 Thread Mark Miller
On Mar 27, 2012, at 10:51 AM, Shawn Heisey wrote: > On 3/26/2012 6:43 PM, Mark Miller wrote: >> It doesn't get thrown because that logic needs to continue - you don't >> necessarily want one bad document to stop all the following documents from >> being added. So the exception is sent to that m

RE: preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread Dyer, James
If the list of words isn't very long, you can add a StopFilter to the analysis for "itemDescSpell" and put the words you don't want in the stop list. If you want to prevent low-occuring words from being sued as corrections, use the "thresholdTokenFrequency" in your spellcheck configuration. Ja

RE: possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-27 Thread Dyer, James
It might be easier to know what's going on if you provide some snippets from solrconfig.xml and schema.xml. But my guess is that in your solrconfig.xml, under the spellcheck "searchComponent" either the "queryAnalyzerFieldType" or the "fieldType" (one level down) is set to a field that is remov

Re: StreamingUpdateSolrServer - exceptions not propagated

2012-03-27 Thread Shawn Heisey
On 3/26/2012 6:43 PM, Mark Miller wrote: It doesn't get thrown because that logic needs to continue - you don't necessarily want one bad document to stop all the following documents from being added. So the exception is sent to that method with the idea that you can override and do what you wo

Re: dataImportHandler: delta query fetching data, not just ids?

2012-03-27 Thread Ahmet Arslan
> 2. If not - what's the reason delta import is implemented > like it is? > Why split it in two queries? I would think having a single > delta query > that fetches the data would be kind of an "obvious" design > unless > there's something that calls for 2 separate queries...? I think this is it? h

Re: how to store file path in Solr when using TikaEntityProcessor

2012-03-27 Thread Ahmet Arslan
> I am using DIH to index local file system. But the file > path, size and lastmodified field were not stored. in the > schema.xml I defined: > > >     indexed="true" stored="true"/> >     indexed="true" stored="true" /> >     >     indexed="true" stored="true" /> >     indexed="true" stored="t

dataImportHandler: delta query fetching data, not just ids?

2012-03-27 Thread janne mattila
It seems that delta import works in 2 steps, first query fetches the ids of the modified entries, then second query fetches the actual data. I am aware that there's a workaround: http://wiki.apache.org/

preventing words from being indexed in spellcheck dictionary?

2012-03-27 Thread geeky2
hello all, i am creating a spellcheck dictionary from the itemDescSpell field in my schema. is there a way to prevent certain words from entering the dictionary - as the dictionary is being built? thanks for any help mark // snipped from solarconfig.xml default itemDescSpell

Re: Solr cores issue

2012-03-27 Thread Erick Erickson
It might be administratively easier to have multiple webapps, but it shouldn't really matter as far as I know... Best Erick On Tue, Mar 27, 2012 at 12:22 AM, Sujatha Arun wrote: > yes ,I must have mis-copied and yes, i do have the conf folder per core > with schema etc ... > > Because of this is

Re: document inside document?

2012-03-27 Thread Erick Erickson
For your tagging, think about using multiValued="true" with an increment gap of, say, 100. Then your searches on this field can be phrase queries with a smaller slop e.g. "tall woman"~90 would match, but "purse gucci"~90 would not because "purse" and "gucci" are not within 90 tokens of each other.

Re: StreamingUpdateSolrServer - exceptions not propagated

2012-03-27 Thread Mark Miller
Like I said, you have to extend the class and override the error method. Sent from my iPhone On Mar 27, 2012, at 2:29 AM, Shawn Heisey wrote: > On 3/26/2012 10:25 PM, Shawn Heisey wrote: >> The problem is that I currently have no way (that I know of so far) to >> detect that a problem happene

CLOSE_WAIT connections

2012-03-27 Thread Bernd Fehling
Hi list, I have looked into the CLOSE_WAIT problem and created an issue with a patch to fix this. A search for CLOSE_WAIT shows that there are many Apache projects hit by this problem. https://issues.apache.org/jira/browse/SOLR-3280 Can someone recheck the patch (it belongs to SnapPuller) and

Re: Client-side failover with SolrJ

2012-03-27 Thread darul
I rediscover the world every day, thanks for this. -- View this message in context: http://lucene.472066.n3.nabble.com/Client-side-failover-with-SolrJ-tp3858461p3860700.html Sent from the Solr - User mailing list archive at Nabble.com.

how to store file path in Solr when using TikaEntityProcessor

2012-03-27 Thread ZHANG Liang F
Hi, I am using DIH to index local file system. But the file path, size and lastmodified field were not stored. in the schema.xml I defined: And also defined tika-data-config.xml:

Re: Using the ids parameter

2012-03-27 Thread Dmitry Kan
So I solved it by using key:(id1 OR ... idn). On Tue, Mar 27, 2012 at 9:14 AM, Dmitry Kan wrote: > Hi, > > Actually we ran into the same issue with using ids parameter, in the solr > front with shards architecture (exception throws in the solr front). Were > you able to solve it by using the key

Re: possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-27 Thread tom
so any one has a clue what's (might be) going wrong ? or do i have to debug and myself and post a jira issue? PS: unfortunately i cant give anyone the index for testing due to NDA. cheers On 22.03.2012 10:17, tom wrote: same On 22.03.2012 10:00, Markus Jelsma wrote: Can you try spellcheck.q