Re: cache monitoring tools?

2011-12-19 Thread Dmitry Kan
Thanks Justin. The reason I decided to ask that is how easy is it to bootstrap a system like Munin. This of course depends on how fast one needs it. That is, if SOLR already exposes certain stat via jxm accessible beans, that will make it easier and faster to set up a tool that can read from jmx. O

Mapping and Capture in ExtractingRequestHandler

2011-12-19 Thread Swapna Vuppala
Hi, I understand that we can specify parameters in ExtractingRequestHandler in solrconfig.xml to capture HTML tags of a particular type and map them to desired solr fields, like something below. div mysolrfield The above setting will capture content in "div" tags and copy to the solr field "m

Re: Replication Index Fetch error

2011-12-19 Thread Eric Tang
That did it. I was running 3.3 on one and 3.4 on another. Thanks! Eric On Mon, Dec 19, 2011 at 11:49 PM, Michael Ryan wrote: > According to http://lucene.apache.org/java/3_4_0/fileformats.html, the > FNMVersion changed from -2 to -3 in Lucene 3.4. Is it possible that the new > master is actua

RE: Replication Index Fetch error

2011-12-19 Thread Michael Ryan
According to http://lucene.apache.org/java/3_4_0/fileformats.html, the FNMVersion changed from -2 to -3 in Lucene 3.4. Is it possible that the new master is actually running 3.4, and the new slave is running 3.2? (This is just a wild guess.) -Michael

issues with WordDelimiterFilter

2011-12-19 Thread Steven Fuchs
Hello All, I'm having an issue with the way the WordDelimiterFilter parses compound words. My field declaration is simple, looks like this: When indexing 'fokker-plank' I do get the token for both fokker, planck, and fokker-planck. But in that case the f

Re: Exception using SolrJ

2011-12-19 Thread Shawn Heisey
On 12/16/2011 12:44 AM, Shawn Heisey wrote: I am seeing exceptions from some code I have written using SolrJ.I have placed it into a pastebin: http://pastebin.com/XnB83Jay No reply in three days, does nobody have any ideas for me? Thanks, Shawn

Re: Solr Optimization Fail

2011-12-19 Thread Chris Hostetter
: Oh, yes on windows, using java 1.6 and Solr 1.4.1. aparently no one has ever written a FAQ on this, so i just added one... https://wiki.apache.org/solr/FAQ#Why_doesn.27t_my_index_directory_get_smaller_.28immediately.29_when_i_delete_documents.3F_force_a_merge.3F_optimize.3F : > Are you on Wi

RE: Poor performance on distributed search

2011-12-19 Thread ku3ia
project2501 wrote > > I see what you are asking. This is an interesting question. It seems > inefficient for Solr to apply the > requested rows to all shards only to discard most of the results on merge. > That would consume lots of resources not used in the final result set. > Yeah, like Erick

RE: Poor performance on distributed search

2011-12-19 Thread Michael Ryan
I had a similar requirement in my project, where a user might ask for up to 3000 results. What I did was change SolrIndexSearcher.doc(int, Set) to retrieve the unique key from the field cache instead of retrieving it as a stored field from disk. This resulted in a massive speed improvement for t

Re: Poor performance on distributed search

2011-12-19 Thread Darren Govoni
I see what you are asking. This is an interesting question. It seems inefficient for Solr to apply the requested rows to all shards only to discard most of the results on merge. That would consume lots of resources not used in the final result set. On 12/19/2011 04:32 PM, ku3ia wrote: Uhm, eith

Re: Poor performance on distributed search

2011-12-19 Thread ku3ia
>>Uhm, either I misunderstand your question or you're doing >>a lot of extra work for nothing >>The whole point of sharding it exactly to collect the top N docs >>from each shard and merge them into a single result. So if >>you want 10 docs, just specify rows=10. Solr will query all >>the

Re: Inserting a field in the json doc before indexing

2011-12-19 Thread Anuj Kumar
Hi Dipti, If you are receiving the JSON within your Java code, you can try any library, like- GSON [1] to manipulate JSON before sending it to Solr. - Anuj 1. http://code.google.com/p/google-gson/ On Mon, Dec 19, 2011 at 11:54 PM, Dipti Srivastava < dipti.srivast...@apollogrp.edu> wrote: > Hi,

Inserting a field in the json doc before indexing

2011-12-19 Thread Dipti Srivastava
Hi, I am using HTTP and json to add my documents to Solr. Now, I have defined a special field in the Solr schema which defines what type of document that is being added. This field needs to be inserted into the json doc that I receive from my caller. Is there a way to do this? Thanks! Dipti __

Re: full-data import suddenly stopped working. Total Rows Fetched remains 0

2011-12-19 Thread PeterKerk
Hi Chantal, I reduced my data-config.xml to a bare minimum: I ran reload-config succesfully, but still the same behavior occurs. Oh and the query select * from locations WHERE isapproved='true' returns a lot of results when ran directly

RE: multi value field search

2011-12-19 Thread Ronen
Many thanks.. I took the suggestion of using a and it did the trick. --Ronen -- View this message in context: http://lucene.472066.n3.nabble.com/multi-value-field-search-tp3594701p3599082.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: full-data import suddenly stopped working. Total Rows Fetched remains 0

2011-12-19 Thread Chantal Ackermann
Hi Peter, the most probable cause is that your database query returns no results. Have you run the query that DIH is using directly on your database? In the output you can see that DIH has fetched 0 rows from the DB. Maybe your query contains a restriction that suddenly had this effect - like a r

RE: Spellchecker issue related to exact match of query in spellcheck index

2011-12-19 Thread Dyer, James
Pravin, When using the "file-based" spell checking option, it will try to give you suggestions for every query term regardless of whether or not thwy are in your spelling dictionary. Getting the behavior you want would seem to be a worthy enhancement, but I don't think it is currently supported

full-data import suddenly stopped working. Total Rows Fetched remains 0

2011-12-19 Thread PeterKerk
My full-data import stopped working all of a sudden. Afaik I have not made any changes that would cause this. Everything is deleted from the index, but no files are added anymore. I dont receive any errors either...:S STARTING via Cygwin: cd /cygdrive/c/My\ Dropbox/inetpub/apache-solr-4.0-2010-

Re: Field collapsing results caching

2011-12-19 Thread astubbs
Why is this? And what happened to http://lucene.472066.n3.nabble.com/Re-Field-Collapsing-disable-cache-td481783.html ? I don't see why basic caching of request -> result shouldn't be the same? I know I could put a layer on top, but I'd like to use a built in cache if possible. -- View this mess

Re: Retrieving Documents

2011-12-19 Thread Erick Erickson
"A programmer has a problem. She tried to solve it with regular expressions. Now she has two problems" You could *try* PatternReplaceCharFilterFactory. Note that this is applied to the entire input string *before* tokenization. I'm thinking you could write a clever regex that transformed every

Re: Poor performance on distributed search

2011-12-19 Thread Erick Erickson
Uhm, either I misunderstand your question or you're doing a lot of extra work for nothing The whole point of sharding it exactly to collect the top N docs from each shard and merge them into a single result. So if you want 10 docs, just specify rows=10. Solr will query all the shards, get the

Re: automatic delta imports?

2011-12-19 Thread Shawn Heisey
On 12/19/2011 7:09 AM, Mark Juszczec wrote: Hello all I'm seeing the following in my web server log file: [2011-12-19 08:57:00.016] [customersIndex] webapp=/solr path=/dataimport params={command=delta-import&commit=true&optimize=true} status=0 QTime=3 [2011-12-19 08:57:00.018] Starting Delta Im

error when specifying shards parameter in multicore setup

2011-12-19 Thread Robert Stewart
I have a SOLR instance running as a proxy (no data of its own), it just uses multicore setup where each core has a shards parameter in the search handler. So my setup looks like this: solr_proxy/ multicore/ /public - solrconfig.xml has "shards" pointing to some other SOL

Re: Retrieving Documents

2011-12-19 Thread Dan McGinn-Combs
I can see why you are confused. Re-reading it, I'm confused. Here's my dilemna. I am trying index some one hundred or so books all in EPUB format. The goal is to provide research functions, i.e. people who need to reference specific quotes, pages and books for their writing. I don't know if EPUB

Re: Fuzzy search testing with Solr Admin Analysis

2011-12-19 Thread Ahmet Arslan
> I try to test fuzzy queries with the Solr Admin Analysis > page at > /solr/admin/analysis.jsp, but it seems to split query terms > with the > fuzzy (~) operator to term and distance value, e.g > 'ddog~0.5' gets > converted to 'ddog' and '0.5'. Obviously that's not what is > wanted. Is > it possib

Re: Solr AutoComplete (Suggester) - Numbers not returned

2011-12-19 Thread solr-ra
You may want to take a look at the autocomplete component using Solr with RankingAlgorithm. It allows you to do edge as well as infix search. You can get more information and also try out a demo from here: http://solr-ra.tgels.org/solr-ra-autocomplete.jsp -- View this message in context: http:/

Fuzzy search testing with Solr Admin Analysis

2011-12-19 Thread Ismo Raitanen
Hi, I try to test fuzzy queries with the Solr Admin Analysis page at /solr/admin/analysis.jsp, but it seems to split query terms with the fuzzy (~) operator to term and distance value, e.g 'ddog~0.5' gets converted to 'ddog' and '0.5'. Obviously that's not what is wanted. Is it possible to test th

automatic delta imports?

2011-12-19 Thread Mark Juszczec
Hello all I'm seeing the following in my web server log file: [2011-12-19 08:57:00.016] [customersIndex] webapp=/solr path=/dataimport params={command=delta-import&commit=true&optimize=true} status=0 QTime=3 [2011-12-19 08:57:00.018] Starting Delta Import [2011-12-19 08:57:00.018] Read dataimport

Re: Copy in multivalued field and faceting

2011-12-19 Thread darul
The first case you mentioned is the one I am looking for. I do not want top terms on a whole index but top terms for a specific query result set. Faceting on my field appears being the only way to get relevant results of top terms for documents that hit query. Thanks for LukeRequestHandler and T

Re: Copy in multivalued field and faceting

2011-12-19 Thread Tomás Fernández Löbbe
Hi Darul, it actually depends in if you want the top terms in the documents that hit a query (in which case you'll need something like the faceting approach you are mentioning) or the top terms for the field in general, regardless of a specific query, in that case the easiest way to go is with the

Re: Default Search UI not working

2011-12-19 Thread Martin Koch
Have you looked here http://wiki.apache.org/solr/VelocityResponseWriter ? /Martin On Mon, Dec 19, 2011 at 12:44 PM, remi tassing wrote: > Hello guys, > the default search UI doesn't work for me. > http://localhost:8983/solr/browse gives me an HTTP 404 error. > I'm using Solr-1.4. Any idea how

Default Search UI not working

2011-12-19 Thread remi tassing
Hello guys, the default search UI doesn't work for me. http://localhost:8983/solr/browse gives me an HTTP 404 error. I'm using Solr-1.4. Any idea how to fix this? Remi

Re: Poor performance on distributed search

2011-12-19 Thread ku3ia
Hi, Erick. Thanks for your advice. >>Here's another test. Add &debugQuery=on to your query and post the results. Here is for 2K rows: 0 53153 on *,score 127.0.0.1:8080/solr/shard1,127.0.0.1:8080/solr/shard2,127.0.0.1:8080/solr/shard3,127.0.0.1:8080/solr/shard4 true 0 (mainstreaming) 2000 >>

Decimal Mapping problem

2011-12-19 Thread Niels Stevens
Hey everybody, I'm having an issue importing Decimal numbers from my Mysql DB to Solr. Is there anybody with some advise, I will start and try to explain my problem. According to my findings, I think the lack of a explicit mapping of a Decimal value in the schema.xml is causing some issues I'm ex

Re: Highlighting uses lots of memory and eventually slows down Solr

2011-12-19 Thread Pranav Prakash
No respinse !! Bumping it up *Pranav Prakash* "temet nosce" Twitter | Blog | Google On Fri, Dec 9, 2011 at 14:11, Pranav Prakash wrote: > Hi Group, > > I would like to have highlighting for s

Re: Copy in multivalued field and faceting

2011-12-19 Thread darul
Thank you all for these advices, you are obviously right that no need for any copyField instructions to get what we expect. I will do some tests on using facet or LukeRequestHandler which seem much more useful in my case. -- View this message in context: http://lucene.472066.n3.nabble.com/Cop

Re: disable stemming on query parser.

2011-12-19 Thread Dmitry Kan
The thing is that, if you use stemming you would prefer using it both on index and query phases. So once you have stemmed your data in the index phase, changing query parser not to stem wouldn't help, right? In our company we did so that before stemming a word we would append some unseen character