Solr on Azure - Worker roles are just recycling

2014-08-15 Thread dgtallykarim
I have a similar issue. It started showing up from yesterday. I am suspecting it to be related to some azure update but not sure. I am still investigating.

Re: Managed Schema

2014-08-15 Thread Joseph Obernberger
Thank you for the update! I will work with 4.8.1 for now - I'm able to use the schema API and waiting (about 2 seconds) between any updates. Looking forward to 4.10! Joe On Fri, Aug 15, 2014 at 7:15 PM, Erick Erickson wrote: > Ryan is talking of cutting the initial 4.10 in a week or so. As

Solr on Azure - Worker roles are just recycling

2014-08-15 Thread Sangram Rath
Hi, I followed instructions on https://github.com/MSOpenTech/Windows-Azure-Solr to implement Solr on Azure, the deployment is successful and give me the web role UI, the worker roles however are in continuous recycle mode. I then tried on Azure emulator and the issue is same. I tried SolrInstEmul

Re: Solr cloud performance degradation with billions of documents

2014-08-15 Thread Erick Erickson
Toke: bq: I would have agreed with you fully an hour ago. Well, I now disagree with myself too :) I don't mind talking to myself. I don't even mind arguing with myself. I really _do_ mind losing the arguments I have with myself though. Scott: OK, that has a much better chance of working

Re: Managed Schema

2014-08-15 Thread Erick Erickson
Ryan is talking of cutting the initial 4.10 in a week or so. As Steve mentioned it may take some time to get it out the door, so let's call it a (very) tentative date of mid September. On Fri, Aug 15, 2014 at 9:16 AM, Steve Rowe wrote: > In the Lucene/Solr 4.X line, the average minor release int

RE: Solr cloud performance degradation with billions of documents

2014-08-15 Thread Toke Eskildsen
Erick Erickson [erickerick...@gmail.com] wrote: > I guess that my main issue is that from everything I've seen so far, > this project is doomed. You simply cannot put 7B documents in a single > shard, period. Lucene has a 2B hard limit. I would have agreed with you fully an hour ago and actually p

Re: Writing my first Solr Search Component

2014-08-15 Thread Apurv Verma
Thanks, this was very helpful :) -- Regards, Apurv Verma On Thu, Aug 14, 2014 at 9:31 AM, Tri Cao wrote: > 1. No, there is only one instance > 2. init() is called > 3. check these standard search components: > > https://github.com/apache/lucene-solr/tree/trunk/solr/core/src/java/org/apache/

RE: Solr cloud performance degradation with billions of documents

2014-08-15 Thread Toke Eskildsen
Wilburn, Scott [scott.wilb...@verizonwireless.com.INVALID] wrote: > You make some very good valid points. Let me clear a few things up, though. > We are not trying to put 7B docs into one single shard, because we are using > collections, created daily, which spread the index across the 32 shards th

Spellchecking suggestions won't collate

2014-08-15 Thread Corey Gerhardt
It must be Friday. I can't figure out why there is no collation value: { "responseHeader":{ "status":0, "QTime":31, "params":{ "spellcheck":"on", "spellcheck.collateParam.qf":"BUS_BUSINESS_NAME", "spellcheck.maxResultsForSuggest":"5", "spellcheck.maxCollations

Re: Syntax unavailable for parameter substitution Solr 3.5

2014-08-15 Thread Chris Hostetter
: but am stuck with syntax with does not allow any concatenation of params to : a prefix. there's no inherent support for anything like that in the local param parsing that would help you in building up facet params like this. you'd either need to do this in some layer betwen your client and so

Re: Syntax unavailable for parameter substitution Solr 3.5

2014-08-15 Thread deepaksshettigar
Shawn Thanks a lot your interest & quick response. The reason I am leaning towards a Solr config based solution is due to shortage of time & expertise in server level configuration. If I do not find any workable Solr solution, I would certainly try out Option #1 as suggested by you. For Option #

How to structure inner entity in the query result

2014-08-15 Thread patch_78
Hi,I have defined my entity as below when I queried a dataset which has two subjects I got the following result: "docs": [ { "title": [ "Q0916 105"], "descripti

RE: Solr cloud performance degradation with billions of documents

2014-08-15 Thread Wilburn, Scott
Erick, You make some very good valid points. Let me clear a few things up, though. We are not trying to put 7B docs into one single shard, because we are using collections, created daily, which spread the index across the 32 shards that make up the cloud/collection. Last I counted, we are puttin

Re: regexTransformer returns no results if there is no match

2014-08-15 Thread Shalin Shekhar Mangar
You are right. I just tried this. It works correctly as long as you don't use the same 'name' as the 'column' but it breaks if you use 'sourceColName'. Can you please open a Jira issue? On Tue, Aug 12, 2014 at 3:46 AM, wrote: > Hello, > > I try to construct wikipedia page url from page title u

Re: Managed Schema

2014-08-15 Thread Steve Rowe
In the Lucene/Solr 4.X line, the average minor release interval has been 10 weeks. 4.9 was released 7 weeks ago. It takes 2-3 weeks to go from starting the release process to producing and announcing a release, so I expect someone will volunteer to perform the 4.10 release within the next week

Re: Syntax unavailable for parameter substitution Solr 3.5

2014-08-15 Thread Shawn Heisey
On 8/15/2014 9:46 AM, deepaksshettigar wrote: > Another Query, is it possible to handle HTTP 413 by increasing Allowed HTTP > Request Size on Apache/Jboss I have no idea how to help you with the other idea you've got, but this is an area where I've got some expertise. You have two ways to solve t

Syntax unavailable for parameter substitution Solr 3.5

2014-08-15 Thread deepaksshettigar
Environment :- -- Solr version 3.5 Apache Web Server on Jboss AS 5.1.x === Problem statement :- -- I am using a singe request handler to handle dynamic scenarios.

Re: Managed Schema

2014-08-15 Thread Joseph Obernberger
Thank you! Any ideas when 4.10 will be released? I tried sleeping when adding fields, and that improved things, but since I'm in schema-less mode, sometimes that doesn't work either. If I managed the schema all 'myself' using the schema API; would that work? On Fri, Aug 15, 2014 at 11:34 AM, St

Re: Managed Schema

2014-08-15 Thread Steve Rowe
Hi Joseph, SOLR-6137 fixed the NPE you encountered with 4.9, as well a couple other managed schema concurrency issues - it will be included in the next release of Solr: 4.10. Steve On Aug 15, 2014, at 11:00 AM, Joseph Obernberger wrote: > Hi

Re: How to restore an index from a backup over HTTP

2014-08-15 Thread Shawn Heisey
On 8/15/2014 5:51 AM, Greg Solovyev wrote: > What I want to achieve is being able to send the backed up index to Solr > (either standalone or with ZooKeeper) in a way similar to creating a new > Collection. I.e. create a new collection and upload an exiting index directly > into that Collection.

Managed Schema

2014-08-15 Thread Joseph Obernberger
Hi - I've been using Solr Cloud in schema-less mode and am having some issues with 4.8.1 and 4.9.0 when adding lots of new fields. In 4.8.1 I'll get continuous messages that say: 134567307 [qtp968427990-2492] INFO org.apache.solr.schema.IndexSchema â Failed to persist managed schema at /configs/

Re: Solr cloud performance degradation with billions of documents

2014-08-15 Thread Erick Erickson
Toke: You make valid points. You're completely right that my reflexes are for sub-second responses so I tend to think of lots and lots of memory being a requirement. I agree that depending on the problem space the percentage of the index that has to be in memory varies widely, I've seen a large va

Highlighting Block Joins

2014-08-15 Thread StrW_dev
Hi, I am using Block in my index structure as I have many variations of documents, which have the same content. I am searching on parent level, but will be returning the child level. Example doc: This works great and fast, but doesn't work with the highlig

How to restore an index from a backup over HTTP

2014-08-15 Thread Greg Solovyev
Hello, I am looking for advice on implementing the following backup/restore scenario. We are using Solr to index email. Each mailbox has it's own Collection. We do not store emails in Solr, the emails are stored on disk in a blob store, meta data is stored in a database and Solr is used only fo

Filtering autosuggest results in Solr

2014-08-15 Thread Chakravarthy Yeleswarapu -X (cyeleswa - ASQUARE INC at Cisco)
Hi, We have following use case: Filter autosuggest results of solr_field1 based on solr_field2 values. The solr_field2 values are constants such as "source1", "source2" etc. If user types "xyz" for solr_field1, suggestions returned can match anywhere in solr_field1 value such as abcxyz, xyzabc

FuzzyLookup does not return exact matches first

2014-08-15 Thread Harun Reşit Zafer
I have the following configuration. fsuggest name="classname">org.apache.solr.spelling.suggest.Suggester name="lookupImpl">org.apache.solr.spelling.suggest.fst.FuzzyLookupFactory text_suggest text_suggest suggestio

Re: Ubuntu 14.04 & Tomcat 7.0.52 & Solr 4.9 - org.apache.solr.common.SolrException: Invalid chunk header

2014-08-15 Thread Shawn Heisey
On 8/15/2014 12:32 AM, mark12345 wrote: > I am pretty sure I have resolved this issue down to the HttpClient version > SolrJ was using. > > The SolrJ 4.9.0 Maven Library (org.apache.solr:solr-solrj) has a dependency > on HttpClient version org.apache.httpcomponents:httpclient 4.3.1, however I > ha