Re: Faceting on a date field multiple times

2012-05-04 Thread Ian Holsman
Thanks Marc. On May 4, 2012, at 8:52 PM, Marc Sturlese wrote: > http://lucene.472066.n3.nabble.com/Multiple-Facet-Dates-td495480.html > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Faceting-on-a-date-field-multiple-times-tp3961282p3961865.html > Sent from the Solr -

Faceting on a date field multiple times

2012-05-03 Thread Ian Holsman
Hi. I would like to be able to do a facet on a date field, but with different ranges (in a single query). for example. I would like to show #documents by day for the last week - #documents by week for the last couple of months #documents by year for the last several years. is there a way to d

Re: how does Solr/Lucene index multi-value fields

2011-05-31 Thread Ian Holsman
> a multi-valued field. > > Best > Erick > > On Tue, May 31, 2011 at 12:16 PM, Ian Holsman wrote: >> >> On May 31, 2011, at 12:11 PM, Erick Erickson wrote: >> >>> Can you explain the use-case a bit more here? Especially the post-query >>> proc

Re: how does Solr/Lucene index multi-value fields

2011-05-31 Thread Ian Holsman
y no > difference between a multi-valued field and storing all the data in a > single field > as far as relevance calculations are concerned. > so.. it will suck regardless.. I thought we had per-field relevance in the current trunk. :-( > Best > Erick > > On Tue, May

how does Solr/Lucene index multi-value fields

2011-05-31 Thread Ian Holsman
Hi. I want to store a list of documents (say each being 30-60k of text) into a single SolrDocument. (to speed up post-retrieval querying) In order to do this, I need to know if lucene calculates the TF/IDF score over the entire field or does it treat each value in the list as a unique field?

[ANN] Zoie Solr Plugin - Zoie Solr Plugin enables real-time update functionality for Apache Solr 1.4+

2010-03-07 Thread Ian Holsman
I just saw this on twitter, and thought you guys would be interested.. I haven't tried it, but it looks interesting. http://snaprojects.jira.com/wiki/display/ZOIE/Zoie+Solr+Plugin Thanks for the RT Shalin!

Re: If you could have one feature in Solr...

2010-02-28 Thread Ian Holsman
On 2/24/10 8:42 AM, Grant Ingersoll wrote: What would it be? most of this will be coming in 1.5, but for me it's - sharding.. it still seems a bit clunky secondly.. this one isn't in 1.5. I'd like to be able to find "interesting" terms that appear in my result set that don't appear in th

Re: Improvising solr queries

2010-01-04 Thread Ian Holsman
On 1/5/10 12:46 AM, Shalin Shekhar Mangar wrote: sitename:XYZ OR sitename:"All Sites") AND (localeid:1237400589415) AND > ((assettype:Gallery)) AND (rbcategory:"ABC XYZ" ) AND (startdate:[* TO > 2009-12-07T23:59:00Z] AND enddate:[2009-12-07T00:00:00Z TO > *])&rows=9&start=63&sort=date > desc

Re: Adaptive search?

2009-12-21 Thread Ian Holsman
On 12/18/09 2:46 AM, Siddhant Goel wrote: Let say we have a search engine (a simple front end - web app kind of a thing - responsible for querying Solr and then displaying the results in a human readable form) based on Solr. If a user searches for something, gets quite a few search results, and t

Re: Chrome Web Browser doesn't render properly

2009-07-15 Thread Ian Holsman
Brian Klippel wrote: Nope, chrome treats xml as html. Either view source or use another browser. I always thought the XML output should contain a XSLT file in it by default. that way I could debug with safari (and chrome). -Original Message- From: Jason Rutherglen [mailto:jason.rut

Re: Facets with an IDF concept

2009-06-23 Thread Ian Holsman
Asif Rahman wrote: Hi Grant, I'll give a real life example of the problem that we are trying to solve. We index a large number of current news articles on a continuing basis. We tag these articles with news topics (e.g. Barack Obama, Iran, etc.). We then use these tags to facet our queries.

Auto suggest.. how to do mixed case

2009-06-19 Thread Ian Holsman
hi guys. I've noticed that one of the new features in Solr 1.4 is the Termscomponent which enables the Autosuggest. but what puzzles me is how to actually use it in an application. most autosuggests are case insensitive, so there is no difference if I type in 'San Francisco' or 'san francisco'.

Re: storing complex types in a multiValued field

2009-01-11 Thread Ian Holsman
hole thing about stuffing 2 fields into the same field irks me thats all. I've got them set up as 2 separate MV fields at the moment. On Mon, Jan 12, 2009 at 5:36 AM, Ian Holsman wrote: hi. I don't think this is a FAQ, but it's been bugging me for a while. I want to store ke

storing complex types in a multiValued field

2009-01-11 Thread Ian Holsman
hi. I don't think this is a FAQ, but it's been bugging me for a while. I want to store key/value pairs in a single field. for example multiValued="true" /> where keyval would be a ID# and the value. I'm guessing it is as simple as creating my own field class, but I was wondering if there we

Re: Solr security

2008-11-17 Thread Ian Holsman
Getting warmer! Erik On Nov 17, 2008, at 4:11 PM, Ian Holsman wrote: if thats the case putting apache in front of it would be handy. something like order deny,allow deny from all allow from 192.168.0.1 might be helpful. Sean Timm wrote: I believe the Solr replication scripts require P

Re: Solr security

2008-11-17 Thread Ian Holsman
if thats the case putting apache in front of it would be handy. something like order deny,allow deny from all allow from 192.168.0.1 might be helpful. Sean Timm wrote: I believe the Solr replication scripts require POSTing a commit to read in the new index--so at least limited POST capabilit

Re: Solr security

2008-11-17 Thread Ian Holsman
There was a patch by Sean Timm you should investigate as well. It limited a query so it would take a maximum of X seconds to execute, and would just return the rows it had found in that time. Feak, Todd wrote: I see value in this in the form of protecting the client from itself. For example

Re: Solr security

2008-11-16 Thread Ian Holsman
Erik Hatcher wrote: On Nov 16, 2008, at 5:41 PM, Ian Holsman wrote: First thing I would look at is disabling write access, or writing a servlet that sits on top of the write handler to filter your data. We can turn off all the update handlers, but how does that affect replication? Can a

Re: Solr security

2008-11-16 Thread Ian Holsman
Erik Hatcher wrote: I'm pondering the viability of running Solr as effectively a UI server... what I mean by that is having a public facing browser-based application hitting a Solr backend directly for JSON, XML, etc data. I know folks are doing this (I won't name names, in case this thread c

Re: solrj and CLOSE_WAIT's

2008-11-14 Thread Ian Holsman
Ryan McKinley wrote: not sure if it is something we can do better or part of HttpClient... From: http://www.nabble.com/CLOSE_WAIT-td19959428.html it seems to suggest you may want to call: con.closeIdleConnections(0L); But if you are creating a new MultiThreadedHttpConnectionManager for each r

Re: solrj and CLOSE_WAIT's

2008-11-14 Thread Ian Holsman
Ryan McKinley wrote: not sure if it is something we can do better or part of HttpClient... From: http://www.nabble.com/CLOSE_WAIT-td19959428.html it seems to suggest you may want to call: con.closeIdleConnections(0L); But if you are creating a new MultiThreadedHttpConnectionManager for each r

solrj and CLOSE_WAIT's

2008-11-13 Thread Ian Holsman
Hi guys. I'm running a little upload project that uploads documents into a solr index. there is also a 2nd thread that runs a deleteby query and a optimize every once and a while. in an effort to reduce the probably of things being held onto I've made everything local, but it is still collec

Re: Release date of SOLR 1.3

2008-05-19 Thread Ian Holsman (Lists)
Noble Paul നോബിള്‍ नोब्ळ् wrote: If you are looking for an immediate need waiting for a release I must advice you against waiting for the solr1.3 release. The best strategy would be to take a nightly and start using it. Test is thoroughly and if bugs are found report them back . If everything is

Re: Solr replication by solr (for windows)

2008-04-29 Thread Ian Holsman
The current scripts use rsync to minimize the amount of data actually being copied. I've had a brief look and found only 1 implementation which is GPL and abandoned http://sourceforge.net/projects/jarsync. Personally I still think the size of the transfer is important (as for most use cases

Re: unique values from a field in a result

2008-04-29 Thread Ian Holsman
Hi Thijs. If you are not concerned with a *EXACT* number there is a paper that was published in 1990 that discusses this problem. http://dblab.kaist.ac.kr/Publication/pdf/ACM90_TODS_v15n2.pdf from the paper (If I understand it correctly) For 120,000,000 records you can sample 10,112,529 reco

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-02-06 Thread Ian Holsman
Clay Webster wrote: There seem to be a few other players in this space too. Are you from Rackspace? (http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop- query-terabytes-data) AOL also has a Hadoop/Solr project going on. CNET does not have much brewing there. Although Yo

Re: leading wildcards

2007-11-15 Thread Ian Holsman
the solution that works for me is to store the field in reverse order, and have your application reverse the field in the query. so the field www.example.com would be stored as moc.elmpaxe.www so now I can do a search for *.example.com in my application. Regards Ian (hat tip to erik for the id

Re: where did my foreign language go?

2007-10-24 Thread Ian Holsman
Thanks.. I'll do that sunrise1984 wrote: > Maybe the following is useful for you.(It comes from > http://wiki.apache.org/solr/SolrTomcat) > > If you are going to query Solr using international characters (>127) using > HTTP-GET, you must configure Tomcat to conform to the URI standard by > accep

where did my foreign language go?

2007-10-24 Thread Ian Holsman
Hi. I'm in the middle of bringing up a new solr server and am using the trunk. (where I was using an earlier nightly release of about 2-3 weeks ago on my old server) now, when I do a search for "日本" (japan) it used to show the kanji in the area, but now it shows gibberish instead "日本"

Re: Seeing if an entry exists in an index for a set of terms

2007-10-03 Thread Ian Holsman
Yonik Seeley wrote: On 10/3/07, Ian Holsman <[EMAIL PROTECTED]> wrote: Hi. I was wondering if there was a easy way to give solr a list of things and finding out which have entries. ie I pass it a list Bill Clinton George Bush Mary Papas (and possibly 20 others) to a solr index

Seeing if an entry exists in an index for a set of terms

2007-10-03 Thread Ian Holsman
Hi. I was wondering if there was a easy way to give solr a list of things and finding out which have entries. ie I pass it a list Bill Clinton George Bush Mary Papas (and possibly 20 others) to a solr index which contains news articles about presidents. I would like a response saying bil

Re: Geographical distance searching

2007-09-26 Thread Ian Holsman
Have you guys seen Local Lucene ? http://www.nsshutdown.com/projects/lucene/whitepaper/*locallucene*.htm no need for mysql if you don't want too. rgrds Ian Will Johnson wrote: With the new/improved value source functions it should be pretty easy to develop a new best practice. You should be a

Re: Nutch with SOLR

2007-09-25 Thread Ian Holsman
Thanks Brian. I'm sure this will help lots of people. Brian Whitman wrote: But we still use a version of Sami's patch that works on both trunk nutch and trunk solr (solrj.) I sent my changes to sami when we did it, if you need it let me know... I put my files up here: http://variogr.am/la

Re: real time search

2007-09-25 Thread Ian Holsman
we had been doing some work with this, and had gotten to the architecture stage on this at $WORK, but the guy who was leading the charge got put onto other tasks, and left before having a chance to implement it, and our priorities shifted to other things ;( From what I remember, there are seve

Re: Nutch with SOLR

2007-09-25 Thread Ian Holsman
[moving this thread to solr-user, as it really has nothing to do with hadoop] Daniel Clark wrote: There's info on website http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.htm l, but it's not clear. Sami has a patch in there which used a older version of the solr cl

Solr Injection

2007-07-02 Thread Ian Holsman
Hi. I've been playing with Kettle (http://kettle.pentaho.org/ ) as a method to inject data into Solr (and other things at the same time), and it looks really promising. I was wondering if anyone else had some experience using it with Solr and if they set it up to add a document at a time, or

RDF uploader -- has anyone built such a beast?

2007-06-19 Thread Ian Holsman
Hi. For a project i'm working on, I'm getting a RDF formatted feed. I was wondering if someone has built a RDF to solr upload function similar to the CSV and mysql ones sitting in Jira. regards Ian

Re: how to crawl when Solr is search engine?

2007-06-07 Thread Ian Holsman
ient side. otherwise there is some server side tools (cocoon I think does this) to do the transform on the server before sending it out. --Ian On 6/7/07, Ian Holsman <[EMAIL PROTECTED]> wrote: Hi Manoharam. we use nutch to do the crawl, and have used sami's patch of nutch (http://bl

Re: solr+hadoop = next solr

2007-06-07 Thread Ian Holsman
Yonik Seeley wrote: On 6/6/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote: In terms of the FederatedSearch wiki entry (updated last year), has there been any progress made this year on this topic, at least something worthy of being added or updated to the wiki page? Priorities shifted, and I d

Re: how to crawl when Solr is search engine?

2007-06-07 Thread Ian Holsman
Hi Manoharam. we use nutch to do the crawl, and have used sami's patch of nutch (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html ) to have it integrate with Solr. It works quite well for our needs. If you are concerned with the speed, Solr also has a CSV upload

Re: Requests per second/minute monitor?

2007-05-10 Thread Ian Holsman
a bit more with the idea last night and came up with https://issues.apache.org/jira/browse/SOLR-232 which allows a tomcat container access to somewhere we can put the data we need from the request. feedback is welcome, as I defiantly don't have all the answers, and am a newbie when it comes to s

Re: Requests per second/minute monitor?

2007-05-09 Thread Ian Holsman
Walter Underwood wrote: > This is for monitoring -- what happened in the last 30 seconds. > Log file analysis doesn't really do that. > I would respectfully disagree. Log file analysis of each request can give you that, and a whole lot more. you could either grab the stats via a regular cron j

Re: SolPHP

2006-07-02 Thread Ian Holsman
plus other (non-Python) sites. -- Wade Leftwich Ithaca, NY On 6/1/2006, Ian Holsman wrote: I think I could get some python bindings off those as well. and if people feel there is a need some C/APR ones as well. On 02/06/2006, at 11:16 AM, Brian Lucas wrote: Erik, I'll get the PHP bindings

Re: newbie Q regarding schema configuration

2006-06-20 Thread Ian Holsman
thanks for the input Chris (and Yonik) i'm not sure lucene is the best answer for what I want to do ;( regards Ian On 20/06/2006, at 5:58 PM, Chris Hostetter wrote: : so.. my first question in schema.xml, can you have a composite key as : the 'uniquekey' field, or do i need to do this on t

newbie Q regarding schema configuration

2006-06-19 Thread Ian Holsman
hi. so I finally managed to find a bit of time to get a SolR instance going, and now have some questions about it ;-) first the application is tagging. ie.. to associate some keywords with a given item, and to show them on a particular object (you can see this in action here http://econom

Re: SolPHP

2006-06-01 Thread Ian Holsman
I think I could get some python bindings off those as well. and if people feel there is a need some C/APR ones as well. On 02/06/2006, at 11:16 AM, Brian Lucas wrote: Erik, I'll get the PHP bindings out to see how they suit the needs of people and use that feedback for the Rails bindings. I

Re: Graduation, and SoC

2006-05-31 Thread Ian Holsman
On 01/06/2006, at 3:01 AM, Chris Hostetter wrote: : It looks like I may finally get to play with Solr in some project of : mine. I checked the Wiki and the Incubator status page for Solr, and : the whole thing looks quite "feature-complete". Are there plans to : graduate from the Incubato

Large test lucene DB to use with a SolR benchmark

2006-03-29 Thread Ian Holsman
know of a publicly available production size database which I can use? otherwise I will generate something off dmoz or randomly. regards Ian. -- Ian Holsman Zilbo.com / (425) 296-6771 USA/ ++61 (03) 9877-0909 Australia A good hockey player plays where the puck is. A great hockey player p