Re: Field for 'species' data?

2014-07-03 Thread Gora Mohanty
On 3 July 2014 21:40, Dan Bolser wrote: > > Hi, > > Does anyone on the list have experience with hierarchical facets, > specifically for species data? [...] Maybe not specifically for species data, but hierarchical faceting works pretty well with Solr. Please see http://wiki.apache.org/solr/Hiera

How to get related facets using Solr query ?

2014-07-03 Thread Shamik Bandopadhyay
Hi, I've trying construct a facet query to organize related facets in the response. Let me illustrate a sample. Let's say I've the following documents indexed in Solr. 1. Doc A --> Facet:AB Facet:MNO 2. Doc B --> Facet:CD

Passing Solr data directory to DataImportHandler?

2014-07-03 Thread Brian Panulla
I'm trying to use the new propertyWriter configuration directive for DataImportHandler ( https://wiki.apache.org/solr/DataImportHandler#Configuring_The_Property_Writer). I'd like to tell Solr to write the dataimport.properties to the data directory for the corresponding Core on our multicore server

Re: java.net.SocketException: Connection reset

2014-07-03 Thread heaven
Hello, usually the loading is not high at all: We're using bundled jetty and writing in batches by 50-100 documents and only using soft and auto commits. About clients, we have 4 processes and each could run up to 5 threads. At

Re: Listening on SolrCloud events

2014-07-03 Thread Jeff Wartes
If you¹re using SolrJ, CloudSolrServer exposes the information you need directly, although you¹d have to poll it for changes. Specifically, this code path will get you a snapshot of the clusterstate: http://lucene.apache.org/solr/4_5_0/solr-solrj/org/apache/solr/client/solrj /impl/CloudSolrServer.h

Re: Confusion about location of + and - ?

2014-07-03 Thread Jack Krupansky
I gave the relevant Solr Jiras on a previous fork of this thread. Here is the most relevant current Lucene Jira: https://issues.apache.org/jira/browse/LUCENE-3451 "Remove special handling of pure negative Filters in BooleanFilter, disallow pure negative queries in BooleanQuery" That patch is s

Re: java.net.SocketException: Connection reset

2014-07-03 Thread Michael Della Bitta
What's the %system load on your nodes? What servlet container are you using? Are you writing a single document per update, or in batches? How many clients are attached to your cloud? Michael Della Bitta Applications Developer o: +1 646 532 3062 appinions inc. “The Science of Influence Market

Re: Solr Map Reduce Indexer Tool GoLive to SolrCloud with index on local file system

2014-07-03 Thread Erick Erickson
Hmmm, interesting, I actually hadn't thought of doing it that way. I don't know the internals well enough to comment on it but I do know someone who does. I'll check with them Erick On Thu, Jul 3, 2014 at 9:18 AM, Tom Chen wrote: > Hi, > > In the GoLive stage, the MRIT sends the MERGEINDEXES

Re: Strategy for removing an active shard from zookeeper

2014-07-03 Thread Jeff Wartes
To expand on that, the Collections API DELETEREPLICA command is availible in Solr >= 4.6, but will not have the ability wipe the disk until Solr 4.10. Note that whether or not it deletes anything from disk, DELETEREPLICA will remove that replica from your cluster state in ZK, so even in 4.10, reb

Re: Disable Regular Expression Support

2014-07-03 Thread Jack Krupansky
Unfortunately, not - the syntax is hard-wired into the grammar. Feel free to file a Jira though. I would be in favor of having a query parser config option to disable features like regex and leading wildcard as well. -- Jack Krupansky -Original Message- From: Markus Schuch Sent: Th

Disable Regular Expression Support

2014-07-03 Thread Markus Schuch
Hi Solr Community, we migrate from solr 1.4 to 4.3 and found out, that solr 4.x invented regular expression support for the query parser. Is it possible to disable this feature to get back to the 1.4 behavior of the query parser? Many thanks in advance, Markus Schuch

RE: Disable Regular Expression Support

2014-07-03 Thread Markus Jelsma
Hi, you can escape the surrounding slashes in your front-end. Markus -Original message- > From:Markus Schuch > Sent: Thursday 3rd July 2014 20:53 > To: solr-user@lucene.apache.org > Subject: Disable Regular Expression Support > > Hi Solr Community, > > we migrate from solr 1.4 to 4.

java.net.SocketException: Connection reset

2014-07-03 Thread heaven
Hi, trying DigitalOcean for Solr, everything seems well, except sometimes I see these errors: java.net.SocketException: Connection reset at java.net.SocketInputStream.read(SocketInputStream.java:196) at java.net.SocketInputStream.read(SocketInputStream.java:122) at org.apach

Re: SolrCloud copy the index to another cluster.

2014-07-03 Thread heaven
Hi, sorry for the delay. Yes, we thought to simply copy the index over but this sounds risky and time consuming. Our index is too big to copy it over the internet quickly. We decided to re-index our data and then switch and re-index again. It's a pity there's no way to do this like with mysql :)

Re: Restriction on type of uniqueKey field?

2014-07-03 Thread Shawn Heisey
On 7/1/2014 8:43 AM, Koji Sekiguchi wrote: > In addition, KeywordTokenizer can be seemingly used but it should be > avoided > for unique key field. One of my customers that used it and they had got OOM > during a long term indexing. As it was difficult to find the problem, > I'd like to share my ex

Re: Confusion about location of + and - ?

2014-07-03 Thread Shawn Heisey
On 7/1/2014 2:06 PM, Brett Hoerner wrote: > Also, does anyone have the Solr or Lucene bug # for this? It's not actually a bug. It's an inherent truth -- you can't subtract from nothing. You must tell Lucene what to start with before you can subtract the 123456789 value. The fact that a bare que

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
Aha, you are right wrdrvf! The query is forwarded to any of the active shards (I saw the query alternate between both of mine). Nice feature. Also, looking at 'ClusterStateAwarePingRequestHandler' (which I downloaded from www.manning.com/SolrinAction), it is checking zookeeper to see if the logical

Re: MLT weird behaviour in Solrcloud

2014-07-03 Thread shamik
Anyone ? -- View this message in context: http://lucene.472066.n3.nabble.com/MLT-weird-behaviour-in-Solrcloud-tp4145066p4145502.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Listening on SolrCloud events

2014-07-03 Thread Shawn Heisey
On 7/3/2014 7:49 AM, Ugo Matrangolo wrote: > I would like to be informed as soon as a cluster event happens like a node > dropping and/or starting a recovery process. > > What is the best way (if any) to listening on SolrCloud events ? I don't know how it's done, but if you are using SolrJ and Cl

Re: Solr Map Reduce Indexer Tool GoLive to SolrCloud with index on local file system

2014-07-03 Thread Tom Chen
Hi, In the GoLive stage, the MRIT sends the MERGEINDEXES requests to Solr instances. The request has a indexDir parameter with a hdfs path to the index generated on HDFS, as shown in the MRIT log: 2014-07-02 15:03:55,123 DEBUG org.apache.http.impl.conn.DefaultClientConnection: Sending request: GE

Field for 'species' data?

2014-07-03 Thread Dan Bolser
Hi, Does anyone on the list have experience with hierarchical facets, specifically for species data? I have a variety of 'messy' species names that I'd like to tidy up at analysis time and use as the basis for taxonomically guided hierarchical facets at query time. I was wondering if there's som

Re: Question about solrcloud recovery process

2014-07-03 Thread wrdrvr
Try querying the recovering core with distrib=false, you should get the count of docs in it. Most likely, since the replica is recovering it is forwarding all queries to the active replica, this can be verified in the core logs. -- View this message in context: http://lucene.472066.n3.nabble

Re: Streaming large updates with SolrJ

2014-07-03 Thread Joe Gresock
Thanks Hoss, that's a good explanation. I don't have much experience with the non-sugar parts of the API, so this was a good summary. I suppose I can at least help out the client heap size this way. On Wed, Jul 2, 2014 at 10:14 PM, Chris Hostetter wrote: > > : Now that I think about it, thoug

Re: Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
No, we're not doing NRT. The search clients aren't using CloudSolrServer and they are behind an AWS load balancer, which calls the Solr ping handler (implemented with ClusterStateAwarePingRequestHandler) to determine when the node is active. This ping handler also responds during the index copy, wh

Listening on SolrCloud events

2014-07-03 Thread Ugo Matrangolo
Hi, I would like to be informed as soon as a cluster event happens like a node dropping and/or starting a recovery process. What is the best way (if any) to listening on SolrCloud events ? Ugo

Re: Question about solrcloud recovery process

2014-07-03 Thread Mark Miller
I don’t know offhand about the num docs issue - are you doing NRT? As far as being able to query the replica, I’m not sure anyone ever got to making that fail if you directly query a node that is not active. It certainly came up, but I have no memory of anyone tackling it. Of course in many othe

Re: Dynamic field doesnt work

2014-07-03 Thread Anton Gavazuk
Jack thank you very much! So obvious - damn, Ive mixed my codebase with solr shema xml :) 2014-07-03 14:23 GMT+02:00 Jack Krupansky : > s/dynamic_field/dynamicField/

Re: Dynamic field doesnt work

2014-07-03 Thread Alexandre Rafalovitch
So, I was right about misspelling, but totally missed the one staring me in the face :-) Hopefully, that was all. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency On Thu, Jul 3, 2014 at 7:23 PM, Jack Krupansky wrote

Question about solrcloud recovery process

2014-07-03 Thread Peter Keegan
I bring up a new Solr node with no index and watch the index being replicated from the leader. The index size is 12G and the replication takes about 6 minutes, according to the replica log (from 'Starting recovery process' to 'Finished recovery process). However, shortly after the replication begin

Re: Query with many search items

2014-07-03 Thread Toke Eskildsen
On Thu, 2014-07-03 at 13:10 +0200, Matt Stunfield wrote: [Sections of text < 200 words, dictionary of terms & phrases] > Task: find if terms occur in single section. Found terms must be > distinguished. In query result there is (highlighted?) found terms > positions. I agree with Alexandre that

Re: Dynamic field doesnt work

2014-07-03 Thread Jack Krupansky
s/dynamic_field/dynamicField/ -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Thursday, July 3, 2014 5:45 AM To: solr-user@lucene.apache.org Subject: Re: Dynamic field doesnt work I would say something is misspelt somewhere. Put a dynamic field called '*' and se

Re: Query with many search items

2014-07-03 Thread Alexandre Rafalovitch
Luwak may be relevant here (https://github.com/flaxsearch/luwak)? Or it may help to describe difference from Luwak's solution to further fine-tune your requirement. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your So

Query with many search items

2014-07-03 Thread Matt Stunfield
Hi, I'm new to Lucene/SOLR and I'm researching if SOLR would fit to our case requirements. I would be very happy if You could help me :). Environment: We have a database storing some (mostly) text information. There are elements containing multiple sections of information. Each section is stored

Re: Dynamic field doesnt work

2014-07-03 Thread Alexandre Rafalovitch
I would say something is misspelt somewhere. Put a dynamic field called '*' and see what you end up getting in the schema. Obviously, on a separate test instance. Regards, Alex. Personal website: http://www.outerthoughts.com/ Current project: http://www.solr-start.com/ - Accelerating your Solr

Dynamic field doesnt work

2014-07-03 Thread Anton Gavazuk
Hi dear SOLR community, I'm having a problem with a dynamic field in my schema - basically when I'm adding any document with key matching dynamic field definition I'm getting back an error which says something like unknown field 'test_exattrs' The schema contains such definition SOLR version 4.

Re: Customise score

2014-07-03 Thread rachun
Hi Jack, I just found out that the sort it doesn't work right for the next page. The response result show that "numFound":23850. So I tried this .../select?q=MacBook&sort=sum(base_score,query($q,0))+desc&wt=json&indent=true&start=100 The summary of base_score and document sort at the next page s

Re: OCR - Saving multi-term position

2014-07-03 Thread Charlie Hull
On 02/07/2014 15:19, Manuel Le Normand wrote: Hello, Many of our indexed documents are scanned and OCR'ed documents. Unfortunately we were not able to improve much the OCR quality (less than 80% word accuracy) for various reasons, a fact which badly hurts the retrieval quality. As we use an open

Re: External File Field eating memory

2014-07-03 Thread Kamal Kishore Aggarwal
With the above replication configuration, the eff file is getting replicated at core/conf/data/external_eff_views (new dir data is being created in conf dir) location, but it is not getting replicated at core/data/external_eff_views on slave. Please help. On Thu, Jul 3, 2014 at 12:21 PM, Kamal K

Consultez mon profil LinkedIn

2014-07-03 Thread benjelloun
LinkedIn Daniel, J'aimerais vous inviter à rejoindre mon réseau professionnel en ligne, sur le site LinkedIn. anass anass benjelloun Stagiaire chez Document numérique Région de Dijon , France Veuillez confirmer que vous connaissez anass benjelloun : https://www.linkedin.com/e/-

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Walter, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=f7uf5h-hx5qhjgx-4c&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360658842640384&sharedKey=ln36HRyT Vou

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=ylp2ee-hx5qhjfy-2p&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360657898921984&sharedKey=aNAb4H5v

Rejoignez mon réseau sur LinkedIn

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=1p1dkj-hx5qhjh3-5k&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360658481917952&sharedKey=iwQz292V

I use LinkedIn and you should use it too.

2014-07-03 Thread benjelloun
Bonjour Steve, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=-k0w1w5-hx5qhjfn-3j&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360657726947328&sharedKey=v4df5Q_O Vou

J'aimerais vous inviter à rejoindre mon réseau professionnel sur LinkedIn.

2014-07-03 Thread benjelloun
Bonjour Erick, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=-u5fqpj-hx5qhjg5-70&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360658087665665&sharedKey=CJxCbD_4 Vou

Je suis sur LinkedIn et je vous recommande ce site également.

2014-07-03 Thread benjelloun
Bonjour Alexandre, J'aimerais que vous rejoigniez mon réseau pour partager des nouvelles et rester en contact. anass Accepter : https://www.linkedin.com/e/v2?e=z6nfen-hx5qhjgq-5r&a=preRegInvite&tracking=eml-guest-invite-cta&ek=invite_guest&invitationID=5890360658662281216&sharedKey=eBz8oxMp