Admin Optimize

2015-12-18 Thread Rick Leir
Hi all, The Solr Admin new UI in 5.4 lost the Optimise button in core admin. What did it do, and is the 'Optimize' more automatic? Google is not helping me, but then perhaps I should go past the first results page! It is in the screendump here: https://cwiki.apache.org/confluence/display/solr/Core

ToParentBlockJoinQuery.java

2015-12-23 Thread Rick Leir
Hi all, This is working fine for me, searching for 'charlie': $ curl http://localhost:8983/solr/dorsetdata/query -d ' q={!parent which="content_type:parentDocument" score=total} type:page AND charlie &wt=json &rows=2 &debugQuery=true &fl=score,[child parentFilter=content_type:parentDocument childF

Re: ToParentBlockJoinQuery.java

2015-12-23 Thread Rick Leir
> If you want conditions/filters on the parent document, that's easy... > your query is already mapping to parents, so simply add another "fq" > param. That is perfect. Thanks!! Cheers -- Rick

Fwd: Block Joins

2015-11-29 Thread Rick Leir
Hi all, I am new to Block Joins, and am trying to follow https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-BlockJoinQueryParsers This page shows two forms of block join syntax for this parser q={!child of=}. example queryq={!child of="content_type:parentDoc

Re: Block Joins

2015-11-30 Thread Rick Leir
nt"} type_s:page AND charlie& wt=json&indent=true& fl=score,[child parentFilter=type_s:book childFilter="charlie" ],canonicalMaster,title,publisher,[docid]' Cheers -- Rick Re: Block Joins > 118348 by: Rick Leir > 118349 by: Mikhail Khludnev &g

Re: Block Joins

2015-12-01 Thread Rick Leir
On Mon, Nov 30, 2015 at 11:47 PM, wrote: > > I'd happy that it helps. You need to thank Varun who made [child]. There is > no performance impact for sure, it's a compromise or either nobody > considered it as an issue. You can raise a new JIRA and describe how it > should work in general and make

Re: Block Joins

2015-12-01 Thread Rick Leir
Hi all, Scoring is confusing me. Is the following correct? $ curl http://localhost:8983/solr/dorsetdata/query -d ' q={!parent which="content_type:parentDocument" *score=max*} type_s:page AND that& wt=json&indent=true& fl=score,[child parentFilter=type_s:book childFilter=*{!score=avg}*that ],canoni

Re: Block Joins

2015-12-02 Thread Rick Leir
d parentFilter=type_s:book childFilter=*{!score=avg}*that], > > This childFilter value doesn't make sense. What do you want to achieve? > > On Tue, Dec 1, 2015 at 7:28 PM, Rick Leir > wrote: > > > Hi all, > > Scoring is confusing me. Is the following correct?

Re: highlight

2015-12-02 Thread Rick Leir
For performance, if you have many large documents, you want to index the whole document but only store some identifiers. (Maybe this is not a consideration for you, stop reading now ) If you are not storing the whole document, then Solr cannot do the highlighting. You would get an id, then locate

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Rick Leir
On Fri, Dec 4, 2015 at 12:59 AM, wrote: > > >Just wondering if folks have any suggestions on using Schema.xml vs. > >Managed Schema going forward. > > We are using loosely typed languages (Perl and Javascript), and a loosely typed DB (CouchDB). This is consistent with running Solr in Schemaless

re: nested fields

2015-12-14 Thread Rick Leir
On Sun, Dec 13, 2015 at 8:26 PM, wrote: > > I want to define nested fileds in SOLR using schema.xml. Us too (using Solr 5.3.1). And doco is not jumping out at me. My approach is (please suggest a better way) 1/ create a blank core 2/ add a few nested docs using bin/post 3/ use the schema browse

Re: Question about content indexing with Alfresco

2016-06-09 Thread Rick Leir
Is there some reason you are using version 1.4? In the Solr admin dashboard you can load your core and do queries against it. On June 9, 2016 5:06:33 AM EDT, OTEC Jordi Florit wrote: >Hi, > >I'm using Alfresco 4.2.6 and SOLR 1.4, and I want to verify if my >content is indexing on SOLR or not. I

Re: Checking performance of plugins, queryParser, edismax, etc

2016-06-09 Thread Rick Leir
On Wed, 2016-06-08 at 11:56 +0800, Zheng Lin Edwin Yeo wrote: > Hi, > > Would like to find out, is there a way to check the performance of > the queryParser and things like edismax in Solr? > > I have tried on the debug=true, but it only show general information like > the time taken for query,

Re: Nested vs Flattened Indexes

2016-06-09 Thread Rick Leir
Can you use Tika? https://tika.apache.org/0.9/formats.html On Wed, 2016-06-08 at 10:06 -0400, Aniruddh Sharma wrote: > Hi > > I am new to use Solr. > > I am running Solr 4.10.3 on CDH 5.5. > > My use case is , I have real time data ingestion in Hadoop on which I want > to implement search. >

suggester stack overflow

2016-06-09 Thread Rick Leir
I know how to debug this, but am hoping someone can give me a tip before I dive in! Solr 6.0.0, I just started the server, hoping to build the suggester. from the log: 3625 INFO (searcherExecutor-7-thread-1-processing-x:blinkmon) [ x:blinkmon] o.a.s.s.s.SolrSuggester SolrSuggester.build(mySugge

Re: SOLR Sizing

2016-10-04 Thread Rick Leir
OCR’ed text can have large amounts of garbage such as '';,-d'." particularly when there is poor image quality or embedded graphics. Is that what is causing your huge vocabularies? I filtered the text, removing any word with fewer than 3 alphanumerics or more than 2 non-alphas. On 2016-10-03 0

Re: SOLR Sizing

2016-10-06 Thread Rick Leir
the text size. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) On Oct 4, 2016, at 7:14 AM, Rick Leir wrote: OCR’ed text can have large amounts of garbage such as '';,-d'." particularly when there is poor image quality or embedded gr

Re: (ANNOUNCEMENT) Solr Examples reading group

2016-10-07 Thread Rick Leir
Thanks for using the word bewildering, I agree. While we are talking of simplifying solrconfig.xml, may I mention YAML? I find the YAML format so much more readable than XML. I have not looked at the code which reads the config, so I do not know how big a change it is to use cfg4j and read

Re: (ANNOUNCEMENT) Solr Examples reading group

2016-10-07 Thread Rick Leir
for warping this thread so far: When you comment out a swatch of XML, you need to allow for embedded comments. That is less troublesome with some other flavours of config file. Erick On Fri, Oct 7, 2016 at 9:50 AM, Rick Leir wrote: Thanks for using the word bewildering, I agree. While we are talk

Related Search

2016-10-24 Thread Rick Leir
Hi all, There is an issue 'Create a Related Search Component' which has been open for some years now. It has a priority: major. https://issues.apache.org/jira/browse/SOLR-2080 I discovered it linked from Lucidwork's very useful blog on ecommerce: https://lucidworks.com/blog/2011/01/25/impl

Re: Poor Solr Cloud Query Performance against a Small Dataset

2016-11-02 Thread Rick Leir
Here is a wild guess. Whenever I see a 5 second delay in networking, I think DNS timeouts. YMMV, good luck. cheers -- Rick On 2016-11-01 04:18 PM, Dave Seltzer wrote: Hello! I'm trying to utilize Solr Cloud to help with a hash search problem. The record set has only 4,300 documents. When I r

Re: Apache Solr Question

2016-11-03 Thread Rick Leir
On November 3, 2016 4:49:07 AM EDT, Chien Nguyen wrote: >Hi everyone! >I'm a newbie in using Apache Solr. Welcome! > I've read some documents about it. >But i >can't answer some questions. >1. How many documents Solr can search at a moment?? I would like to say unlimited. But it depends on

Re: Field x is not multivalued and destination for multiple copyFields

2017-06-05 Thread Rick Leir
Nawab, If you have multivalued=true and you index a document with two or more name_* fields, then the name_token will have two or more values, all of which will be searchable. I think this is what you want, because names can be quite different in different languages. For example the country na

Re: Upgrading config from 4.5.0 to 6.5.1

2017-06-05 Thread Rick Leir
ble settings. For an example of >> "text_general_edge_ngram" fieldType setting, side="front" is no >longer >> valid attributes. >> >> Tony >> >> >> On Wed, May 31, 2017 at 3:53 PM, Rick Leir >wrote: >> >> > Hi N

Re: Velocity UI with Analyzing Infix Suggester?

2017-06-06 Thread Rick Leir
> typeahead solutions using a separate collection Erik, Do you use a separate collection so it can be smaller and thereby faster? Or so you can keep good performance on the the main collection server? In my mind, the performance of the as-you-type is more important than the regular search. Chee

Re: Custom Response writer

2017-06-16 Thread Rick Leir
M, You ask what is better, and that is often a matter of opinion. My guess is that you should have that name stored in the Solr doc, so it can be in the response when you have a match. Oh, and I find JSON easier to work with than XML. Cheers -- Rick On June 16, 2017 10:19:03 AM EDT, mganeshs w

Re: Solr starts without error but not working

2017-06-18 Thread Rick Leir
firewall? On 2017-06-18 01:04 AM, Nawab Zada Asad Iqbal wrote: Hi So I am deploying solr 6.5.1 using puppet to another machine (which I can ssh to) . The logs have no error but solr home page has nothing (no response from server) . Using curl also showed empty response. What could be wrong ?

Re: Give boost only if entire value is present in Query

2017-06-19 Thread Rick Leir
Aman, Use a copyfield so you can have a second field that uses a different analysis chain. In the new field you just created for the copyfield, use the lowercase type, or create a type using KeywordTokenizer in the analysis chain. Then match on the original field, and boost based on the new fiel

Re: Dynamic fields vs parent child

2017-06-28 Thread Rick Leir
Saurabh Maybe you need two fields. The first field is named "keyName" and the second is "keyValue". Give that a try, though searching with AND may be a challenge. Otherwise, use one field named "whatever" containing "key-value", assuming '-' never appears in keys or values. Search for an exa

Re: Not highlighting "and" and "or"?

2017-06-29 Thread Rick Leir
Stopwords? On June 28, 2017 5:13:43 PM EDT, Walter Underwood wrote: >Is there some special casing in the highlighter to skip query syntax >words? The words “and” and “or” don’t get highlighted. > >This is in 6.5.0. > > question > html > 440 > fastVector > 1 > >wunder

Re: Not highlighting "and" and "or"?

2017-06-29 Thread Rick Leir
rators is true. The docs don’t give the default >value for that in edismax. > >https://lucene.apache.org/solr/guide/6_6/the-extended-dismax-query-parser.html > >wunder >Walter Underwood >wun...@wunderwood.org >http://observer.wunderwood.org/ (my blog) > > >>

Re: Not highlighting "and" and "or"?

2017-06-29 Thread Rick Leir
>>>> >https://observer.wunderwood.org/2007/05/31/do-all-stopword-queries-matter/ >>>> >>>> Mostly, stopwords were a performance hack back when people ran >search >>>> engines on 16-bit machines. Neither disks nor RAM were big enough >to

Re: Not highlighting "and" and "or"?

2017-06-30 Thread Rick Leir
Created Wish SOLR-10992 . Maybe I am a troublemaker, the example configs will need to be tested after the change is made and that will be a bother. Any tips on how to easily do that? Thanks Rick On 2017-06-30 10:54 AM, Walter Underwood wrote

Re: Solr 6.5.1 crashing when too many queries with error or high memory usage are queried

2017-07-02 Thread Rick Leir
Stack trace? Memory diagnostics from top(1)? What querys? On July 2, 2017 3:00:16 AM EDT, Zheng Lin Edwin Yeo wrote: >Hi, > >I'm currently facing the issue whereby the Solr crashed when I have >issued >too many queries with error or those with high memory usage, like JSON >facet or Streaming exp

Re: behind nginx

2017-07-04 Thread Rick Leir
Hi Walid Is there any error occurring? If not, then do not change anything. Yes, that version of Solr is old, if you get the chance you would want to upgrade. Cheers -- Rick On July 4, 2017 6:11:57 AM EDT, walid wrote: >Hi, >i have a multiple solr slaves reversed by nginx, clients browser send

Re: recovery information for replica in recovering state

2017-07-06 Thread Rick Leir
Novin, How long is recovery taking for you? I assume the recovery completes correctly. Cheers-- Rick On July 6, 2017 7:59:03 AM EDT, Novin Novin wrote: >Hi Guys, > >I was just wondering is solr cloud can give information about how much >recovery has been done by replica while in it is recovering

Re: index new discovered fileds of different types

2017-07-07 Thread Rick Leir
Thaer Whoa, hold everything! You said RDF, meaning resource description framework? If so, you have exactly​ three fields: subject, predicate, and object. Maybe they are text type, or for exact matches you might want string fields. Add an ID field, which could be automatically generated by Solr,

Re: index new discovered fileds of different types

2017-07-09 Thread Rick Leir
architect Cominvent AS - www.cominvent.com 8. jul. 2017 kl. 02.36 skrev Rick Leir : Thaer Whoa, hold everything! You said RDF, meaning resource description framework? If so, you have exactly​ three fields: subject, predicate, and object. Maybe they are text type, or for exact matches you might

Re: Using HTTP and HTTPS at the same time

2017-07-12 Thread Rick Leir
Hi all, The recommended best practice is to run a web app in front of Solr, and maybe there is no benefit in SSL between the web app and Solr. In any case, if SSL is desired, you would configure the web app to always use HTTPS. Without the web app, you can have Apache promote a connection from

Re: accessing numfound value

2017-07-13 Thread Rick Leir
Steve, This is a wild guess because you are not telling us nearly enough. The guess: you are not getting any matches or hits. O. Cheers --Rick On July 12, 2017 4:29:43 PM EDT, Steve Pruitt wrote: >I'm having difficulty finding the value for numFound that is in the >response. My context is a cus

Re: Get results in multiple orders (multiple boosts)

2017-07-19 Thread Rick Leir
Luca, You can pass a sort parameter in the query. User A could sort=date%20desc and user b could sort=foofield%20asc. Maybe query functions can also help with this. Cheers -- Rick On July 19, 2017 4:39:59 AM EDT, Luca Dall'Osto wrote: >Hello,The problem of build an index is that each user has

Re: 6.6 cloud starting to eat CPU after 8+ hours

2017-07-19 Thread Rick Leir
Markus, What does iostat(1) tell you? Cheers -- Rick On July 19, 2017 5:35:32 AM EDT, Markus Jelsma wrote: >Hello, > >Another peculiarity here, our six node (2 shards / 3 replica's) cluster >is going crazy after a good part of the day has passed. It starts >eating CPU for no good reason and its

Re: DIH, multiple sources, cores and search: single core with multiple entities or single core per source with search across multiple cores?

2017-07-24 Thread Rick Leir
Giovanni, Start with your search results page and work back from there. Decide what fields you want to display in a results page, then plan for your Solr document to contain all these fields. Now you will need a program to ingest the data from whatever database, and create documents for Solr. Th

Re: FreeTextSuggester throwing error "token must not contain separator byte"

2017-07-24 Thread Rick Leir
Angel, The 20 byte is an ASCII space character, which is a separator in most contexts. Breaking the buffer at spaces, you can see 6 non-space tokens. Have a look at your analysis chain and see why you are getting this. Cheers -- Rick On July 24, 2017 4:27:00 PM EDT, Angel Todorov wrote: >Hi gu

Re: Unable to create core [collection] Caused by: null

2017-07-25 Thread Rick Leir
Lucas, What is in the log before that stackdump? The stackdump seems to indicate that Solr is trying to run with a managed schema. Looking at the cwiki, "When a ||is not explicitly declared in a |solrconfig.xml|file, Solr implicitly uses a|ManagedIndexSchemaFactory|, which is by default|"muta

Re: generate field name in query

2017-08-02 Thread Rick Leir
Peter The common setup is to use copyfield from all your fields into a 'grab bag' containing everything, and then to search on it alone. Cheers -- Rick On August 2, 2017 7:31:10 AM EDT, Peter Kirk wrote: >Hi - is it possible to create a query (or fq) which generates the field >to search on, base

Re: Solr Input and Output format

2017-08-03 Thread Rick Leir
Ranganath, I googled 'getRecordWriter solr' and came up with (among 446 results) this partial stack trace: at org.apache.solr.handler.component.HttpShardHandlerFactory.init(HttpShardHandlerFactory.java:168) at org.apache.solr.handler.component.ShardHandlerFactory.newInstance(ShardHandlerF

Re: Limiting the number of queries/updates to Solr

2017-08-03 Thread Rick Leir
On 2017-08-02 11:33 PM, Shawn Heisey wrote: On 8/2/2017 8:41 PM, S G wrote: Problem is that peak load estimates are just estimates. It would be nice to enforce them from Solr side such that if a rate higher than that is seen at any core, the core will automatically begin to reject the reques

Re: Get results in multiple orders (multiple boosts)

2017-08-04 Thread Rick Leir
>On Thursday, July 20, 2017 4:10 AM, Susheel Kumar > wrote: > > > Let me try to put an example for custom sort. > >On Wed, Jul 19, 2017 at 6:34 AM, Rick Leir wrote: > >> Luca, >> You can pass a sort parameter in the query. User A could >sort=date%20desc >

Re: Get results in multiple orders (multiple boosts)

2017-08-08 Thread Rick Leir
er (for example: >sort=category desc, source desc, /*...*/ ). >How do you set a custom sort order in solr? >Thanks > >Luca > > > > >On Friday, August 4, 2017 7:41 PM, Rick Leir >wrote: > > > Luca >I hope you have a web app in front of Solr. It cou

Re: identifying source of queries

2017-08-09 Thread Rick Leir
Suresh If you have a web app in front of Solr and it or Apache logs all requests then you should be able to match the log entries to the solr.log entries. That would tell you a source IP, but it might not help if the users are behind a nat firewall. But yes, you could look at the nat firewall lo

Re: Unable to write response, client closed connection or we are shutting down

2017-08-14 Thread Rick Leir
Nawab What test software do you use? What else is happening when the exception occurs? Cheers -- Rick On August 12, 2017 1:48:19 PM EDT, Nawab Zada Asad Iqbal wrote: >Hi, > >I am executing a query performance test against my solr 6.6 setup and I >noticed following exception every now and then. W

Re: Comparison of Solr with Sharepoint Search

2017-08-14 Thread Rick Leir
To get the content into Solr, you need to configure the SharePoint crawler and content processor so that your custom script gets called (I think that is possible, but have never done it). The script needs to post the content of each page to the Solr RESTful interface. On August 14, 2017 2:10:40

Re: Get results in multiple orders (multiple boosts)

2017-08-16 Thread Rick Leir
nother array that specify the sort type (like example, >user should ask documents ordered by DATE first, then by CATEGORY, >etc... ) >Natural sort order could be fine only for DATE and NAME, but for >CATEGORY and SOURCE I have to use the array with ids sorted by user. >Thanks! > &

Re: Fetch a binary field

2017-08-17 Thread Rick Leir
On 2017-08-12 04:19 AM, Barbet Alain wrote: Hi ! Because this field containt a zipped xml that is bigger than all others fields & I don't need it for searching, just for display. Yes it would be better if this field is outside the Lucene base, but as I have hundred of bases like that, with milli

Re: Issue with delta import

2017-08-17 Thread Rick Leir
It says here https://cwiki.apache.org/confluence/display/solr/Parallel+SQL+Interface#ParallelSQLInterface-SolrSQLSyntax that you need to escape the FROM. Did you try it without the 'false' != 'false' OR On 2017-08-16 04:07 AM, bhargava ravali koganti wrote: I'm getting error like this Exce

Re: Get results in multiple orders (multiple boosts)

2017-08-18 Thread Rick Leir
Luca Walter has got the best word on this, you should use SQL for sorting (maybe mySQL or Postgres). If you also need searching, you can create a Solr index by ingesting from the SQL database. The Solr index would be just used for searching. Cheers -- Rick -- Sorry for being brief. Alternate em

Re: FastVector does not highlight for phrase query when it contains stop word/s

2017-08-21 Thread Rick Leir
Recent discussions have recommended that you not use stop words in any case. Cheers -- Rick On August 21, 2017 11:47:11 AM EDT, Jagdish Vasani wrote: >Hi Solr Users, > >I come across issue that fast Vector highlighter does not highlight >field values when search for phrase query contains stop

Re: Get results in multiple orders (multiple boosts)

2017-08-22 Thread Rick Leir
never used Postgres) or I have to >forgot it? >Thanks! > > >Luca > > >On Saturday, August 19, 2017 1:02 AM, Rick Leir >wrote: > > > Luca >Walter has got the best word on this, you should use SQL for sorting >(maybe mySQL or Postgres). If you also ne

Re: Custom StoredFieldVisitor in Solr

2017-08-24 Thread Rick Leir
Jamie, what is the use case? Cheers -- Rick On August 23, 2017 11:30:38 AM MDT, Jamie Johnson wrote: >I thought I had asked this previously, but I can't find reference to it >now. I am interested in using a custom StoredFieldVisitor in Solr and >after spelunking through the code for a little it

Re: ClassCastException when using HighFrequencyDictionaryFactory with SearchComponent

2017-08-29 Thread Rick Leir
Craig, Just a guess, but your threshold is in a string field. Cheers -- Rick On August 29, 2017 7:37:56 AM MDT, Craig Smiles wrote: >Hi, > >I've been trying to get a suggester working with the >HighFrequencyDictionaryFactory but when setting the threshold I'm >getting a >ClassCastException when t

Re: Different ideas for querying unique and non-unique records

2017-08-30 Thread Rick Leir
Susheel, Just a guess, but carrot2.org might be useful. But it might be overkill. Cheers -- Rick On August 30, 2017 7:40:08 AM MDT, Susheel Kumar wrote: >Hello, > >I am looking for different ideas/suggestions to solve the use case am >working on. > >We have couple of fields in schema along with

Re: Indexed=false for a field,but still able to search on field.

2017-08-30 Thread Rick Leir
Ashish, Fast search depends on indexing the data. If it is not indexed, then the search becomes a full table scan which is much slower. Cheers -- Rick On August 29, 2017 11:57:44 AM MDT, AshB wrote: >Hi, > >Thanks ,got this issue is happening because of docValues=true. > >Please elaborate on "fu

Re: "What is Solr" in Google search results

2017-08-30 Thread Rick Leir
Vincenzo, This is a discussion for the wikipedia 'talk' page. My sense is that information must be verifiable, and that the popularity rating at db-engines is not transparent. Would you like to start the discussion? Cheers -- Rick On August 30, 2017 5:17:25 PM MDT, Vincenzo D'Amore wrote: >Hi

Re: ERR_SSL_VERSION_OR_CIPHER_MISMATCH

2017-09-01 Thread Rick Leir
Kent, Did you say you are using Tomcat? Solr does not use Tomcat by default, so you will need to tell us more about your configuration. But first, think of what you might have changed just before it stopped working. Cheers -- Rick On September 1, 2017 11:55:47 AM EDT, "Younge, Kent A - Norman,

Re: Solr uses lots of shared memory!

2017-09-03 Thread Rick Leir
Hi all Malloc has a lock while it is active in the heap. If there is more than one thread, and malloc finds the lock in use, then it avoids waiting on the lock by creating a new 'arena' to hold its heap. My understanding is that a process with multiple threads which are all active users of mallo

Re: Solr Poc -Help Needed

2017-09-05 Thread Rick Leir
Harshal, Look in solrconfig.xml, this needs to be configured based on one or more fields. Cheers -- Rick On September 4, 2017 11:40:29 PM EDT, "Agrawal, Harshal (GE Digital)" wrote: >Hello All, > >I am looking to index csv file/ PDF documents. I want solr to generate >unique key itself. >I tr

RE: Customizing JSON response of a query

2017-09-06 Thread Rick Leir
Sarvo, What are you trying to achieve? Describe the use case. Cheers -- Rick On September 6, 2017 12:36:08 PM EDT, "Davis, Daniel (NIH/NLM) [C]" wrote: >It should be possible with a custom response handler. > >-Original Message- >From: Sarvothaman Madhavan [mailto:relad...@gmail.com] >S

Re: Customizing JSON response of a query

2017-09-06 Thread Rick Leir
t; > > >I think the "grouped" json format will be much more intuitive to my end >users who are going to use the search > > > >P.S: Just to be clear I am not having any trouble querying >children/parent >document since I have all of this stored using fully qualified

Re: Query Boosting and sort

2017-09-08 Thread Rick Leir
Renuka, You have not told us nearly enough about your issue. What query? config? cheers -- Rick On 2017-09-08 05:42 AM, Renuka Srishti wrote: Hello All, I am trying to use sort parameter and phrase boosting together in search. But, if I use the sort parameter, it seems like Phrase Boosting d

Re: multi language search engine in solr

2017-09-10 Thread Rick Leir
Mugeesh, One important question: will the typical document have a mix of English and Bangla and Hindi? If so, you would probably have them all in one collection. Another thing to think about is the tokenizer. Are all words separated by white space? If not, then you might need to think about whic

Re: SolrCloud 5.3.1 "IndexWriter is closed"

2017-09-12 Thread Rick Leir
Frank, I assume you re-index everything after changing schema.xml? cheers -- Rick On 2017-09-11 04:38 PM, Kelly, Frank wrote: Just wondering if anyone has seen this before and might understand why this is happening When we deploy a new schema.xml adding some new search terms we get the dre

Re: Freeze Index

2017-09-13 Thread Rick Leir
Fabien, What do you see in the logfile at the time of the freeze? Cheers -- Rick On September 13, 2017 3:01:17 PM EDT, fabigol wrote: >hi, >my problem is that my index freeze several time and i don't know why. >So i >lost all the data of my index. >I have 14 million of documents from postgresql

Re: Solr - google like suggestion

2017-09-13 Thread Rick Leir
Ruby In the config sample, you can see this: 10 Is this the number of suggestions you are getting? Cheers -- Rick On September 13, 2017 4:26:16 PM EDT, ruby wrote: >So I followed following article to enable google like suggestions in my >application: >https://lucidworks.com/2015/03/04/solr-sugge

Re: Solr Spatial Index and Data

2017-09-14 Thread Rick Leir
hi Can Ezgi > First of all, i want to use spatial index for my data include polyghons and points. But solr indexed first 18 rows, other rows not indexed. Do all rows have a unique id field? Are there errors in the logfile? cheers -- Rick .

Re: Solr- Data search across multiple vores

2017-09-20 Thread Rick Leir
Harshal, You could send your Solr query to both cores but then you could have problems combining the results because the scores are not absolute: they just give a ranking in their own core. It might be ok, if you are searching on fields which are common to both cores. But I suspect that you wou

Re: Replicates not recovering after rolling restart

2017-09-22 Thread Rick Leir
Wunder, Erick $ dc 16o 1578578283947098112p 15E83C95E8D0 That is an interesting number. Is it, as a guess, machine instructions or an address pointer? It does not look like UTF-8 or ASCII. Machine code looks promising: Disassembly: 0: 15 e8 3c 95 e8 adceax,0xe8953ce8 5:

Re: SOLR terminology

2017-09-28 Thread Rick Leir
Gunalan, Solr Core (core), is one-to-one with a Solr process and its data directory. It can be a shard, or part of a replica. Collection - is one or more shards grouped together, and can be replicated for reliability, availability and performance Node - is a machine in a Zookeeper group SolrClu

Re: Time to Load a Solr Core with Hdfs Directory Factory

2017-10-04 Thread Rick Leir
Shashank, I had a quick look at: https://lucene.apache.org/solr/guide/6_6/running-solr-on-hdfs.html Did you enable the Block Cache and the solr.hdfs.nrtcachingdirectory? cheers -- Rick On 2017-10-03 09:22 PM, Shashank Pedamallu wrote: Hi, I’m trying an experiment in which, I’m loading a core

Re: ERROR ipc.AbstractRpcClient: SASL authentication failed

2017-10-04 Thread Rick Leir
Ascot, At the risk of ...   Can you disable Kerberos in Hbase? If not, then you will have to provide a password! Rick On 2017-10-04 07:32 PM, Ascot Moss wrote: Does anyone use hbase indexer in index kerberos Hbase to solr? Pls help! On Wed, Oct 4, 2017 at 10:18 PM, Ascot Moss wrote: Hi

Re: book on solr

2017-10-12 Thread Rick Leir
Jay, get info on this with a search: https://www.google.ca/search?q=solr+shard+size cheers -- Rick On 2017-10-13 01:42 AM, Jay Potharaju wrote: Any blog or documentation also that would provide some basic rules or guidelines for scaling would also be great. Thanks Jay Potharaju

Re: Appending fields to pre-existed document

2017-10-13 Thread Rick Leir
Hi Show us the solr version, field types, the handler definition, and the query you send. Any log entries? Cheers -- Rick On October 13, 2017 5:57:16 AM EDT, "Игорь Абрашин" wrote: >Hello, solr community. >We are getting strugled with updating already existing docs. For >instance, >we got inde

Re: Solr related questions

2017-10-13 Thread Rick Leir
1/ the _version_ field is necessary. 2/ there is a Solr api for editing the manged schema 3/ not having used solrnet, I suspect you can bypass it and use the solr REST api directly. Cheers -- Rick On October 13, 2017 5:40:26 AM EDT, startrekfan wrote: >Hello, > >I have some Solr related questio

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-13 Thread Rick Leir
Hi all, What is the earliest version which was vulnerable? Thanks -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Rick Leir
On 2017-10-13 04:19 PM, Kevin Layer wrote: Amrit Sarkar wrote: Kevin, fileType => md is not recognizable format in SimplePostTool, anyway, moving on. OK, thanks. Looks like I'll have to abandon using solr for this project (or find another way to crawl the site). Thank you for all the help,

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-15 Thread Rick Leir
nt AS - www.cominvent.com > >> 15. okt. 2017 kl. 15:29 skrev Florian Gleixner : >> >> On 13.10.2017 15:13, Rick Leir wrote: >>> Hi all, >>> What is the earliest version which was vulnerable? >>> Thanks -- Rick >>> >> >> As

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread Rick Leir
Maximka The app server is bundled in Solr, so you do not install Tomcat or JEtty separately. Cheers -- Rick On October 19, 2017 2:01:30 AM EDT, maximka19 wrote: >Hi everyone! > >I was looking for full-text search engine and chosen Solr. Quickly >introduced with Solr. Now I'm having troubles wit

Re: Goal: reverse chronological display Methods? (1) boost, and/or (2) disable idf

2017-10-20 Thread Rick Leir
Bill, In the debug score calculations, the bf boosting does not appear at all. I would expect it to at least show up with a small value. So maybe we need to look at the query. Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Upload/update full schema and solrconfig in standalone mode

2017-10-20 Thread Rick Leir
Alessandro First, let me say that the whole idea makes me nervous. 1/ are you better off with scp? I would not want to do this via Solr API 2/ the right way to do this is with Ansible, Puppet or Docker, 3/ would you like to update a 'QA' installation, test it, then flip it into production? Cheers

Re: Upload/update full schema and solrconfig in standalone mode

2017-10-21 Thread Rick Leir
Alessandro, Scp is "secure cp" and is a part of the ssh service. So if you have ssh access then you can use scp. From Windows you would be using winscp. Many hosts provide this but not all. If you send files to the ops staff then they can coordinate the restart and any fallback planning, so th

Re: BlendedTermQuery for Solr?

2017-10-25 Thread Rick Leir
James It looks as if Markus could help: http://lucene.472066.n3.nabble.com/BlendedTermQuery-causing-negative-IDF-td4271289.html Also, ES has a query. You could look at the source there. "BlendedTermQuery forms the guts behind Elasticsearch’s cross_field search. -- Doug Turnbull Cheers -- Rick O

Re: Some problems in SOLR-6.5.1

2017-10-25 Thread Rick Leir
Klin, You need to use the new version's solrconfig.xml, with modifications as necessary. Start by looking at the current solrconfig, what was modified there? Did you re-index? If you cannot reindex then you should upgrade to 5.n then to 6.m. Cheers -- Rick On October 24, 2017 11:21:48 PM EDT, S

Re: Incomplete Index

2017-10-31 Thread Rick Leir
Dawg, I have a similar setup, and this is what works for me. I have a field which contains a timestamp. The timestamp is set to be identical for all documents added/updated in a run. Whe the run is complete and some/many documents have been overwritten then I can delete all un-updated documents

Re: Fwd: configuring Solr with Tesseract

2017-11-06 Thread Rick Leir
Anand, As Charlie says you should have a separate process for this. Also, if you go back about ten months in this mailing list you will see some discussion about how OCR can take minutes of CPU per page, and needs some preprocessing with Imagemagick or Graphicsmagick. You will want to do some fi

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Dr. Krell You could look at your /select query handler, and compare it with the /query query handler in the Admin config. Did you upgrade from a previous version of Solr? Or change your config ( no, you must have thought of that). If it is a bug related to the Java upgrade then you need to sho

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Hoss Clearly it is U+00FC ü c3 bc LATIN SMALL LETTER U WITH DIAERESIS As in Tübingen "With the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset, a great novel dataset was introduced to the computer vision and multimedia research community." -- cool I think it is strange th

Re: recent utf8 problems

2017-11-07 Thread Rick Leir
Dr Krell Item 11): It is best to get the solrconfig.xml provided with the new version of Solr, and change it to suit your needs. Do not try to work from the old version's solrconfig.xml. I did not have time to read the other items. Look in solr.log, and compare the successful query with the un

Re: Admin Console Question

2017-11-14 Thread Rick Leir
Homer In chrome, right-click and choose 'inspect' at the bottom. Now go to the network tab then reload the page. Are you seeing errors? Tell! Thanks Rick On November 14, 2017 3:14:46 PM EST, Shawn Heisey wrote: >On 11/14/2017 11:43 AM, Webster Homer wrote: >> I am using chrome Version 62.0.3202.

Re: How to get a solr core to persist

2017-11-15 Thread Rick Leir
Hi Shawn, Amanda When we put the data under /var/lib, I feel a need to put the config under /etc. Is this recommended, and would you use a symbolic link for the conf dir? Cheers--Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: TimeZone issue

2017-11-16 Thread Rick Leir
Renuka Are your clients all in the same time zone? Solr should support clients in several timezones, and UTC conversion to local is best done in the client in my mind. Thanks -- Rick On November 16, 2017 6:54:47 AM EST, Renuka Srishti wrote: >Thanks for your response Shawn. I know it deals wit

  1   2   3   >