Re: Automatic prefix search in query parser

2021-09-03 Thread Erik Hatcher
A comparable alternative would be to use the edge ngram filter to index prefixes instead. Erik > On Sep 3, 2021, at 10:49 AM, Gauthier Roebroeck > wrote: > > Hello, > > I am using Apache Lucene 8.9.0 to parse queries that are entered by humans. > I am using the > `org.apache.lucen

Re: [VOTE] Lucene logo contest, here we go again

2020-09-01 Thread Erik Hatcher
D (binding) > On Aug 31, 2020, at 8:26 PM, Ryan Ernst wrote: > > Dear Lucene and Solr developers! > > In February a contest was started to design a new logo for Lucene > [jira-issue]. The initial attempt [first-vote] to call a vote resulted in > some confusion on the rules, as well the reques

Re: [VOTE] Lucene logo contest

2020-06-16 Thread Erik Hatcher
C - current logo > On Jun 15, 2020, at 6:08 PM, Ryan Ernst wrote: > > Dear Lucene and Solr developers! > > In February a contest was started to design a new logo for Lucene [1]. That > contest concluded, and I am now (admittedly a little late!) calling a vote. > > The entries are labeled as f

Re: Payload TFIDF Similarity in Lucene 7.1.0

2018-03-13 Thread Erik Hatcher
Payloads are only scored from certain query types. What query are you executing? > On Mar 13, 2018, at 04:58, Grdan Eenc wrote: > > Hej there, > > I want to extend the TFIDF Similarity class such that the term frequency is > neglected and the value in the payload used instead. Therefore I ba

Re: Using POS payloads for chunking

2017-06-14 Thread Erik Hatcher
Markus - how are you encoding payloads as bitsets and use them for scoring? Curious to see how folks are leveraging them. Erik > On Jun 14, 2017, at 4:45 PM, Markus Jelsma wrote: > > Hello, > > We use POS-tagging too, and encode them as payload bitsets for scoring, which > is, as f

Re: Odd Boolean Query behavior in SOLR 3.6

2017-06-13 Thread Erik Hatcher
esults as > expected. > > Thanks for the insight, my understanding here was that fq parameter works > on the result set of q parameter which is *:* here. shouldn't that be the > case here? > > Thanks, > Abhishek > > > > On Tue, Jun 13, 2017 at 6:02 PM,

Re: Odd Boolean Query behavior in SOLR 3.6

2017-06-13 Thread Erik Hatcher
Inner purely negative queries match nothing. A query is about matching, and skipping over things that don’t match. The fix is when using (-something) to do (*:* -something) to match everything and skip the negative clause items. In your example, try fq=((*:* -documentTypeId:3) AND companyId:29

Re: question

2017-01-16 Thread Erik Hatcher
Or a no-slop PhraseQuery, where order also matters. Erik > On Jan 16, 2017, at 12:27, Markus Jelsma wrote: > > Yes, they should be the same unless the field is indexed with shingles, in > that case order matters. > Markus > > -Original message- >> From:Julius Kravjar >> Sent: Mo

Re: Combination of BooleanQuery and PhraseQuery

2016-08-15 Thread Erik Hatcher
Try combining into multiple clauses… (with q.op=OR) “some phrase”~ OR (some phrase) That would boost docs with with proximity, but still allow matches for docs that don’t contain all terms. Erik > On Aug 15, 2016, at 4:02 AM, Erel Uziel wrote: > > Hi, > Is there any query

Re: multi valued facets

2015-06-04 Thread Erik Hatcher
Set the field to multiValued="true" in your schema. How'd you manage to get multiple values in there without an indexing error? An existing index built with Lucene directly? Erik > On Jun 4, 2015, at 17:27, Fielder, Todd Patrick wrote: > > I am trying to add a facet for which each doc

Re: Custom Relevancy Using Field Payloads

2013-11-29 Thread Erik Hatcher
I think what you want is a PayloadTermQuery in the mix. There's some initial stuff here: https://issues.apache.org/jira/browse/SOLR-1485 Erik On Nov 27, 2013, at 12:55 PM, Furkan KAMACI wrote: > Hi; > > I've asked same question at Solr mail list but could not get any answer. I > have

Re: classic.QueryParser - bug or new behavior?

2013-05-19 Thread Erik Hatcher
Just a thought - this looks like it could be due to the regexp (/pattern/ syntax) support added, but that was added in Lucene 4.0 so it doesn't quite fit that it would be a difference between 4.1 and 4.2.1. Erik On May 19, 2013, at 14:50 , Scott Smith wrote: > I just upgraded from lu

[WEBINAR] - "Lucene/Solr 4 – A Revolution in Enterprise Search Technology"

2013-03-26 Thread Erik Hatcher
ting, Advanced Query Parsers, Distributed search support, Dynamic core management, Performance statistics, and searches for rare values, such as Primary Key Presenter: Erik Hatcher, Lucene/Solr Committer and PMC member Erik Hatcher is the co-author of "Lucene in Action" as well as co-au

Re: ApacheCon meetup

2013-02-19 Thread Erik Hatcher
I've added a Lucene meetup to the Wednesday night meetup proposed schedule. I'm speaking on Wednesday morning. Let's get the word spread to the Portland tech community as well, making it a good way to bring in folks in the area that may not be also attending ApacheCon. Erik On Feb 1

Re: Analyzer in QueryParser behaves differently from IndexWriter

2013-01-13 Thread Erik Hatcher
The analyzer through QueryParser is invoked for each "clause" and thus in your example it's invoked 4 times and thus each invocation only sees one word/term. Erik On Jan 13, 2013, at 2:13, "Igal @ getRailo.org" wrote: > hi, > > I've created an Analyzer that performs a few filtering tasks

Lucene Revolution conference - May 7-10, Boston

2012-02-24 Thread Erik Hatcher
Lucene Revolution will be here May 9-10 in Boston (with training classes offered on May 7-8). Reserve your spot today with Early Bird pricing of $575. Committers and accepted speakers are entitled to free admission. The CFP is open and we’re actively seeking submissions from the Community. Subm

Re: Problem using custom-separator in UpdateCSV ( in solr )

2012-01-08 Thread Erik Hatcher
\t doesn't work in my shell as a tab replacement character. And Solr doesn't expand this sort of thing for you. $ echo "foo\tbar" foo\tbar Try a real tab character instead. Though more realistically you'll be using a file instead, so you won't have to be concerned with a shell for this.

Re: Phonetic search with Lucene 3.2

2011-11-09 Thread Erik Hatcher
On Nov 9, 2011, at 05:11 , Felipe Carvalho wrote: > Can I use Solr as a lib, like Lucene? My company is not willing to install > a Solr server... =/ That's too bad. What's the rationale for that decision? A large number of big big companies are deploying on Solr quite happily. I just taught

Re: Phonetic search with Lucene 3.2

2011-11-09 Thread Erik Hatcher
onetic analyzer implementation? I'm using > MetaphoneReplacementAnalyzer from LIA examples. > > I'm looking at lucene-contrib stuff at > http://lucene.apache.org/java/3_4_0/lucene-contrib/index.html but I can't > seem to find other phonetic analyzers. > > Thanks

Re: Phonetic search with Lucene 3.2

2011-11-08 Thread Erik Hatcher
On Nov 8, 2011, at 05:42 , Felipe Carvalho wrote: >> Yes, quite possible, including boosting on exact matches if you want. Use >> a BooleanQuery to wrap clauses parsed once with phonetic analysis, and once >> without, including fields at indexing time for both too of course. >> > > Would it be

Re: Phonetic search with Lucene 3.2

2011-11-08 Thread Erik Hatcher
re on http://search.maven.org/, but is there a lucene-all > published on some maven repo? or should I get those contrib classes out of > some other dependency? > > Thanks! > > On Tue, Nov 8, 2011 at 10:06 AM, Erik Hatcher wrote: > >> >> On Nov 8, 2011, at 03:58 ,

Re: Phonetic search with Lucene 3.2

2011-11-08 Thread Erik Hatcher
On Nov 8, 2011, at 03:58 , Felipe Carvalho wrote: > One other question: I'm looking at Lucene 3.4 javadocs ( > http://lucene.apache.org/java/3_4_0/api/core/index.html) but I can't find > MetaphoneReplacementAnalyzer anywhere. Does any one know if this class has > been removed from lucene-core. T

Re: Bet you didn't know Lucene can...

2011-10-25 Thread Erik Hatcher
At the group where I worked at UVa once upon a time, a coworker built Juxta, this way cool tool to diff multiple versions of a document visually with heat maps and "difference"-o-meters, and it leverages Lucene analyzers to extract words and positions and such. You can find it here: http://www.

Re: Query Time Boosting Not Working

2011-09-22 Thread Erik Hatcher
Balaji - first, you should e-mail this to the solr-user list, not the java-user list since this is Solr-specific. Next step is to use Solr's debugQuery facility (append debugQuery=true to your request) in order to get the scoring explanations. That'll tell you in gory detail how the scores are

Re: Higher rank for closer matches

2011-09-21 Thread Erik Hatcher
, at 11:08 , Em wrote: > Thanks, Erik. > If PhraseQuery does not keep track of all matching spans, how does it do > its work (in comparison to SpanNearQuery)? > > Regards, > Em > > Am 21.09.2011 19:52, schrieb Erik Hatcher: >> SpanNearQuery does more work than PhraseQue

Re: Higher rank for closer matches

2011-09-21 Thread Erik Hatcher
wrote: > Hi Erik, > > could you explain why PhraseQuery performs better than SpanNearQuery? > > Some time has passed since I read about it, however I think it was > exactly the other way round. > > Thanks! > > Em > > Am 21.09.2011 15:56, schrieb Erik Hatcher: &g

Re: Higher rank for closer matches

2011-09-21 Thread Erik Hatcher
PhraseQuery suffices for the stated requirement of boosting when query terms are closer. A common technique is to incorporate a PhraseQuery with a large slop factor of the query terms into the query automatically, which implicitly boosts matching documents when the query terms are closer. A Sp

[Commercial training announcement] Lucene training at Lucene EuroCon, Barcelona - Oct. 17,18, 2011

2011-09-12 Thread Erik Hatcher
http://www.lucidimagination.com/blog/2011/09/12/learn-lucene/ - pasted below too Hi everyone... I'm not usually much on advertising/hyping events where I speak and teach, but I'm really interested in drumming up a solid attendance for our Lucene training that I'll be teaching at Lucene EuroCon i

Re: Help needed on Ant build script for creating Lucene index

2011-05-12 Thread Erik Hatcher
There's an example build file, see It's pretty outdated stuff there though. It has some flexibility for a custom document handler in order to allow full control over how a File gets turned into a Lucene Document

[infomercial] Lucene Refcard at DZone

2011-03-29 Thread Erik Hatcher
I've written an "Understanding Lucene" refcard that has just been published at DZone. See here for details: http://www.lucidimagination.com/blog/2011/03/28/understanding-lucene-by-erik-hatcher-free-dzone-refcard-now-available/ If you're new to Lucene or Solr, this re

Re: Sort results by number of document fields

2011-01-31 Thread Erik Hatcher
On Jan 31, 2011, at 10:51 , Azhar Jassal wrote: > How can I use Lucene to sort search results by the number of fields each > document has? (highest to lowest - documents with more fields in my index > are better results) When you know you need to query on something you have available during index

NoVA/DC - Lucene/Solr Meetup - Wednesday, Nov. 17

2010-11-15 Thread Erik Hatcher
We still have some open spots for the meetup we're hosting this Wednesday night in DC. Come on out, it'll be a great time. Erik -

Re: ApacheCon Meetup in Atlanta

2010-10-18 Thread Erik Hatcher
Count me in for any kind of Lucene/Solr hanging out in Atlanta. Erik On Oct 18, 2010, at 14:57 , Grant Ingersoll wrote: > Is there interest in having a Meetup at ApacheCon? Who's going? Would > anyone like to present? We could do something less formal, too, and just > have drinks a

Re: [ANN] Free technical webinar: Mastering the Lucene Index: Wednesday, August 11, 2010 11:00 AM PST / 2:00 PM EST / 20:00 CET

2010-08-13 Thread Erik Hatcher
I have passed this report on to the folks that manage our webinars. Erik On Aug 13, 2010, at 4:51 AM, Stefan Trcek wrote: On Monday 09 August 2010 21:16:30 Mark Miller wrote: Lucid Imagination Presents a free technical webinar: Mastering the Lucene Index Wednesday, August 11, 2010 11

Free Webinar: Findability: Designing the Search Experience

2010-08-12 Thread Erik Hatcher
Here's perhaps the coolest webinar we've done to date, IMO :) I attended Tyler's presentation at Lucene EuroCon* and thoroughly enjoyed it. Search UI/UX is a fascinating topic to me, and really important to do well for the applications most of us are building. I'm pleased to pass along the

Re: understanding lucene

2010-08-09 Thread Erik Hatcher
An even better URL: http://www.manning.com/lucene :) Erik On Aug 8, 2010, at 6:19 AM, Uwe Schindler wrote: Hi Yakob, In this mailing list are all the people who wrote this book, making such a suggestion is not a good idea, especially if you need help in future. You cannot get

[webinar] Rapid Prototyping Search Applications with Solr

2010-06-07 Thread Erik Hatcher
will be an entirely custom built (building it as we speak!) application for the webinar. Erik - Want to get up and running with Apache Solr quickly and easily? Join Erik Hatcher, Apache Solr and Lucene committer and co-founder of Lucid Imagination, for a workshop on getting st

Free Webinar: Implementing Solr open source search in a .NET and DBMS environment; Thurs 27 May 13:00 GMT (9a EDT)

2010-05-23 Thread Erik Hatcher
I'd like to invite you to tune in to a great talk I saw at Apache Lucene Eurocon (www.lucene-eurocon.org) this past week in Prague by Bo Raun, of Nordjyske Medier. The talk was on how he discovered Solr and introduced it successfully in an IT environment whose strategy otherwise totally res

Re: merge results from physically separate hosts

2010-04-26 Thread Erik Hatcher
Solr's distributed search feature is about querying multiple indexes and merging the results. Different indexes, but same schema. Erik On Apr 25, 2010, at 6:02 AM, Shaun Senecal wrote: Is there currently a way to take a query, run it on multiple hosts containing different indexes, th

[WEBINAR] Practical Search with Solr: Beyond just looking it up

2010-04-21 Thread Erik Hatcher
Below is the official announcement for our exciting upcoming webinar. This one is near and dear to my heart, so I'll be eagerly listening too, and participating with my experiences as it fits with the flow of the webinar. I'm a card-carrying library geek, and I've had the pleasure of worki

Re: lucene webinterface

2010-02-19 Thread Erik Hatcher
Again, try LIMO. But what do you mean, no success with Solr? Please elaborate on the issues you encountered and what you tried. Erik On Feb 19, 2010, at 2:41 PM, luciusvorenus wrote: no success with solr Anybody another suggestion ? luciusvorenus wrote: I already have the

Re: lucene webinterface

2010-02-17 Thread Erik Hatcher
Solr can front your Lucene index, and via Solritas[1] it can provide a simple and customizable basic UI. Though to stick with pure Lucene, give LIMO[2] a try. Erik [1] http://www.lucidimagination.com/blog/2009/11/04/solritas-solr-1-4s-hidden-gem/ [2] http://limo.sourceforge.net/ O

Re: During the wild card search, will lucene 2.9.0 to convert the search string to lower case?

2010-02-01 Thread Erik Hatcher
QueryParser has a special capability to lowercase wildcard and prefix queries, simply because they are not passed to an analyzer. Term queries, phrase queries (like your example), etc are passed on to the analyzer. You are using the KeywordAnalyzer for the title field, and thus it is not

Re: Question about relevance

2010-01-08 Thread Erik Hatcher
One technique I've seen commonly used is to index both stemmed and unstemmed fields, and during search query both and boost the unstemmed field matches higher. Erik On Jan 8, 2010, at 4:05 AM, Yannick Caillaux wrote: Hi, I index 2 documents. the first contains the word "Wallis" in

Migrating to Open-Source Search with Lucene and Solr/ Free Webinar 8-Dec 2pm ET/11amPT/1900 GMT

2009-12-02 Thread Erik Hatcher
WEBINAR: Hosted by KMWorld and featuring speakers from The Motley Fool and Lucid Imagination Tuesday, Dec 8: 2pm ET/11amPT/1900 GMT Sign up here: http://www.kmworld.com/webinars/lucid/08dec2009/luc3 Greetings, I'll be presenting along with some of our customers from Motley Fool at a free we

Webinar: Apache Solr 1.4 – Faster, Easier, an d More Versatile than Ever

2009-09-30 Thread Erik Hatcher
Excuse the cross-posting and gratuitous marketing :) Erik My company, Lucid Imagination, is sponsoring a free and in-depth technical webinar with Erik Hatcher, one of our co-founders as Lucid Imagination, as well as co-author of Lucene in Action, and Lucene/Solr PMC member and

Free Webinar - Apache Lucene 2.9: Technical Overview of New Features

2009-09-18 Thread Erik Hatcher
Free Webinar: Apache Lucene 2.9: Discover the Powerful New Features --- Join us for a free and in-depth technical webinar with Grant Ingersoll, co-founder of Lucid Imagination and chair of the Apache Lucene PMC. Thursday, Septe

Scandinavia Apache Lucene/Solr September Meetup: 9 September

2009-09-05 Thread Erik Hatcher
nda: • "Solr at the Speed of Light": Erik Hatcher, Lucene/Solr PMC Member and Committer, co-author of Lucene In Action, Lucid Imagination • "Migrating from commercial search engines to Solr",Tobias Larsson Hult and Eskil Andreen, Findwise SE • Presentations followe

Re: Lucene in Action Rev2

2009-08-26 Thread Erik Hatcher
I've pinged Manning to get this corrected. Thanks for the heads-up. Erik On Aug 26, 2009, at 5:58 PM, tsuraan wrote: In the free first chapter of the new Lucene in Action book, it states that it's targetting Lucene 3.0, but on the Manning page for the book, it says the code in the boo

Lucene Meetup - September 3, Mountain View, CA

2009-08-25 Thread Erik Hatcher
Announcing a new Meetup for SFBay Apache Lucene/Solr Meetup! What: SFBay Apache Lucene/Solr June Meetup When: September 3, 2009 6:30 PM Where: Computer History Museum, 1401 N Shoreline Blvd, Mountain View, CA 94043 Presentations and discussions on Lucene/Solr, the Apache Open Source Search

Re: Group by in Lucene ?

2009-08-02 Thread Erik Hatcher
Don't overlook Solr: http://lucene.apache.org/solr Erik On Aug 1, 2009, at 5:43 AM, mschipperheyn wrote: http://code.google.com/p/bobo-browse looks like it may be the ticket. Marc -- View this message in context: http://www.nabble.com/Group-by-in-Lucene---tp13581760p24767693.html

Lucene/Solr Meetup / May 20th, Reston VA, 6-8:30 pm

2009-05-05 Thread Erik Hatcher
Lucene/Solr Meetup / May 20th, Reston VA, 6-8:30 pm http://www.meetup.com/NOVA-Lucene-Solr-Meetup/ Join us for an evening of presentations and discussion on Lucene/Solr, the Apache Open Source Search Engine/Platform, featuring: Erik Hatcher, Lucid Imagination, Apache Lucene/Solr PMC: Solr power

DC/NOVA Lucene&Solr meetup

2009-05-04 Thread Erik Hatcher
My company is co-sponsoring a Lucene/Solr meetup later this month in the Northern VA / DC area (Reston). Details will be coming out soon. We've got one night of talks planned and considering adding another consecutive night. If you're in the area and have a Lucene (any of the Lucene fami

Re: dbsight

2009-04-30 Thread Erik Hatcher
On Apr 30, 2009, at 10:32 PM, Michael Masters wrote: Sweet! I'll look more into solr. I wasn't under the impression solr could index a database like dbsight. It's not point-and-clickable, but Solr's DataImportHandler has sophisticated configuration capabilities for indexing any JDBC acces

Re: ebook resources - including lucene in action

2009-04-20 Thread Erik Hatcher
It is not legal to share purchased e-books in this manner. Please purchase copies of the books you read, otherwise authors have very little incentive to dedicate months (14 months in the case of Lucene in Action, first edition) of their lives to writing this content. Erik On Apr 2

Solr webinar

2009-04-20 Thread Erik Hatcher
(excuse the cross-post) I'm presenting a webinar on Solr. Registration is limited, so sign up soon. Looking forward to "seeing" some of you there! Thanks, Erik "Got data? You can build your own Solr-powered Search Engine!" Erik Hatcher, Lucene/Solr C

Re: Rexex Search

2009-04-13 Thread Erik Hatcher
On Apr 13, 2009, at 5:41 AM, Seid Mohammed wrote: I want to include Regular Expresion based searching to my lucene appplication Anyone who can help? There is a RegexQuery and a SpanRegexQuery available in Lucene's regex contrib:

Re: Sequential match query

2009-04-12 Thread Erik Hatcher
On Apr 12, 2009, at 8:15 AM, Tim Williams wrote: On Sun, Apr 12, 2009 at 5:56 AM, Erik Hatcher wrote: On Apr 11, 2009, at 9:11 PM, Tim Williams wrote: On Sat, Apr 11, 2009 at 12:25 PM, Erick Erickson wrote: That'll teach me to scan a post. The link I sent you is still relevant

Re: Sequential match query

2009-04-12 Thread Erik Hatcher
On Apr 11, 2009, at 9:11 PM, Tim Williams wrote: On Sat, Apr 11, 2009 at 12:25 PM, Erick Erickson wrote: That'll teach me to scan a post. The link I sent you is still relevant, but wildcards are NOT intended to be used to concatenate terms. You want a phrase query or a span query for that. i.

Re: Speed of fuzzy searches

2009-04-03 Thread Erik Hatcher
On Apr 3, 2009, at 10:58 AM, Grant Ingersoll wrote: Now, we have an implementation of JaroWinkler in the spell checker (in fact, we have pluggable distance measures there), perhaps it makes sense to think about how FuzzyQuery could leverage this pluggability? My suggestion is to make it p

Lucene in Action, 2nd edition discounted for a limited time only

2009-03-11 Thread Erik Hatcher
Lucene in Action, 2nd edition, is available in pre-release e-book (and print will ship when it's ready, later this year). Manning just provided me with a limited time discount code to share with the world. Details here: Erik

Re: Lucene Highlighting and Dynamic Summaries

2009-03-07 Thread Erik Hatcher
text again and then pass through the highlighter. The other option is storing the content in the index the downside being index is going to be large. Which would be the recommended approach? Cheers Amin On Sat, Mar 7, 2009 at 10:50 AM, Erik Hatcher >wrote: With the caveat that if you&#

Re: Lucene Highlighting and Dynamic Summaries

2009-03-07 Thread Erik Hatcher
With the caveat that if you're not storing the text you want highlighted, you'll have to retrieve it somehow and send it into the Highlighter yourself. Erik On Mar 7, 2009, at 5:40 AM, Michael McCandless wrote: You should look at contrib/highlighter, which does exactly this. Mike

Re: Learning Lucene

2009-03-05 Thread Erik Hatcher
On Mar 5, 2009, at 9:24 AM, Tuztuz T wrote: dear all I am really new to lucene Is there anyone who can guid me learning lucene I have lucene in action the old book, but I get hard time to understand the syntaxes in the book and the new lucene release (2.4) Can anyone give me copy of the new lu

Re: Luke site is down?

2009-03-04 Thread Erik Hatcher
On Mar 4, 2009, at 2:08 PM, Ruslan Sivak wrote: Is there a separate mailing list for getopt? Perhaps someone can notify the site owner? I've just sent Andrzej "Luke" Bialecki an e-mail, though I imagine he monitors this list too. Erik ---

Re: Confidence scores at search time

2009-03-04 Thread Erik Hatcher
ith a correction: <http://www.lucidimagination.com/search/document/52363ad81237162f/confidence_scores_at_search_time > Sorry 'bout that! Erik Mike Ken Williams wrote: On 3/2/09 1:58 PM, "Erik Hatcher" wrote: On Mar 2, 2009, at 2:47 PM, Ken Williams w

Re: Confidence scores at search time

2009-03-02 Thread Erik Hatcher
On Mar 2, 2009, at 2:47 PM, Ken Williams wrote: Finally, I seem unable to get Searcher.explain() to do much useful - my code looks like: Searcher searcher = new IndexSearcher(reader); QueryParser parser = new QueryParser(LuceneIndex.CONTENT, analyzer); Query query = pa

Re: How build Lucene in Action examples

2009-02-28 Thread Erik Hatcher
Please post questions/issues related to Lucene in Action to Manning's Author Online forum at: Thanks, Erik On Feb 27, 2009, at 6:33 PM, tolkienGR wrote: Hi !!! I'm new in Lucene.I started reading Lucene in action (first

Re: Indexer.Java problem

2009-02-21 Thread Erik Hatcher
Also, the first several hits here provide the tricks to update the code to the latest API: :) Erik On Feb 19, 2009, at 10:41 AM, Seid Mohammed wrote: I am using netbeans on windows to test lucene. I

Re: Faceted search with OpenBitSet/SortedVIntList

2009-02-08 Thread Erik Hatcher
On Feb 8, 2009, at 3:32 AM, Raffaella Ventaglio wrote: Hi Chris, The "SortedVIntList" approach is similar to field cache. It's better to use the fieldcache for the facet search, which is the "normal" approach and used in tools like Solr, DBSight, Bobo Browse Engine, etc. Thanks for you

Re: indexing binary files?

2009-01-29 Thread Erik Hatcher
Are these some type of parsable-into-text binary files that you have a parser handy for? Erik On Jan 29, 2009, at 10:43 PM, Paul Feuer wrote: Hi - I've looked on the FAQ, the Java Docs, and searched a little in google, but haven't been able to figure out if Lucene can index binary f

Registration for ApacheCon Europe 2009 is now open!

2009-01-29 Thread Erik Hatcher
Cross-posting this announcement. There are several relevant Lucene/ Solr talks including: Trainings - Lucene Boot Camp (Grant Ingersoll) - Solr Boot Camp (Erik Hatcher) Sessions - Introducing Apache Mahout (Grant) - Lucene Case Studies (Erik) - Advanced Indexing Techniques with

Fwd: [Travel Assistance] Applications for ApacheCon EU 2009 - Now Open

2009-01-23 Thread Erik Hatcher
Begin forwarded message: From: Tony Stevenson Date: January 23, 2009 8:28:19 AM EST To: travel-assista...@apache.org Subject: [Travel Assistance] Applications for ApacheCon EU 2009 - Now Open The Travel Assistance Committee is now accepting applications for those wanting to attend Apa

Re: Lucene in Action book. Problems with first example

2008-12-16 Thread Erik Hatcher
On Dec 16, 2008, at 6:57 AM, Oleg Oltar wrote: Also maybe there are some free manuals/articles that you can recommend for starters? There's a bunch of stuff listed here: Lucene has been changing so rapidly lately that I'm not aware of any

Re: Lucene in Action book. Problems with first example

2008-12-16 Thread Erik Hatcher
On Dec 16, 2008, at 5:53 AM, Oleg Oltar wrote: So is there another manual which I can use to start? (Seems that examples in the book, are carefully chosen for starters, and quite easy to understand) The API differences are all quite minor to adjust to the latest - hopefully the post I poi

Re: Lucene in Action book. Problems with first example

2008-12-16 Thread Erik Hatcher
The first edition of Lucene in Action was written for Lucene 1.4. Lots has changed since then in the API, but the fundamentals are still sound. The code can be easily updated to the newer API following the details I posted here: Do note t

Re: searching in 2 indexes

2008-12-15 Thread Erik Hatcher
On Dec 15, 2008, at 12:44 PM, Chris Bamford wrote: One question (thinking ahead now): how to differentiate between content and reference hits if they are retrieved from the same index? Presumably I will need to try to retrieve a field which only exists in one and if it fails, I know it must

Re: .NET list?

2008-12-12 Thread Erik Hatcher
On Dec 12, 2008, at 9:43 AM, Ian Vink wrote: I am using java-user@lucene.apache.org for help, but sometimes I'd like Lucene.net specific help. Is there a mailing list for Lucene.NET on apache? Yes, see the mail list section here: Erik

Re: Newbie: MatchAllDocsQuery sample?

2008-12-01 Thread Erik Hatcher
On Dec 1, 2008, at 8:30 AM, Ian Vink wrote: Is there a simple example on how to query for "contents:Hello" in all documents using MatchAllDocsQuery ? I want 100% of the docs with "Hello" You're looking

Re: Marked for deletion

2008-12-01 Thread Erik Hatcher
On Dec 1, 2008, at 3:28 AM, Ganesh wrote: I need to index voluminous data and i plan to shard it. The client may not know which shard db to query. Server will take care of complete shard management. I have done almost 50% of development with Lucene. In case of Solr, i think the client sh

Re: Marked for deletion

2008-11-25 Thread Erik Hatcher
On Nov 25, 2008, at 5:00 AM, Ganesh wrote: My index application is a separate process and my search application is part of web ui. When User performs delete, i want to do mark for deletion. I think i have no other option other than to update the document, but index app is a separate proce

Re: [ot] a reverse lucene

2008-11-23 Thread Erik Hatcher
On Nov 22, 2008, at 10:57 PM, Ian Holsman wrote: Hi. apologies for the off-topic question. Not off-topic at all! I was wondering if anyone knew of a open source solution (or a pointer to the algorithms) that do the reverse of lucene. By that I mean store a whole lot of queries, and run the

Re: Filter or Query

2008-11-21 Thread Erik Hatcher
On Nov 20, 2008, at 11:58 PM, Ganesh wrote: I am planning to use Filter for UserID and Date. I will not be able to cache the Filter. I have to create this filter for every request. To my knowledge, Filter will give faster results, only if it is cached. Is it a good idea to use a filter o

Re: Boosting results

2008-11-11 Thread Erik Hatcher
On Nov 11, 2008, at 8:32 AM, Stefan Trcek wrote: On Tuesday 11 November 2008 02:18:39 Erik Hatcher wrote: The integration won't be too painful... the main thing is that Solr requires* some configuration files, literally on the filesystem, in order to fire up and be happy. And you'

Re: Boosting results

2008-11-10 Thread Erik Hatcher
On Nov 10, 2008, at 2:42 PM, Stefan Trcek wrote: On Monday 10 November 2008 13:55:31 Michael McCandless wrote: Finally, you might want to instead look at Solr, which provides facet counting out of the box, rather than roll your own... Doooh - new api, but it's facet counting sounds good. An

Re: robots.txt

2008-10-20 Thread Erik Hatcher
On Oct 20, 2008, at 8:58 AM, Alexander Aristov wrote: Just wonder if Nutch takes into consideration rules from the robots.txt file while crawling a site. Wrong e-mail list, but yeah, Nutch supports robots.txt considerations. Erik ---

Re: Hiring etiquette

2008-10-19 Thread Erik Hatcher
It's a wiki... create an account and add yourself :) Erik On Oct 19, 2008, at 7:10 PM, Cam Bazz wrote: How can we get on to that list? Best, On Mon, Oct 20, 2008 at 1:58 AM, Hasan Diwan <[EMAIL PROTECTED]> wrote: 2008/10/19 Mark Miller <[EMAIL PROTECTED]>: You might in

Re: Calculation of fieldNorm causes irritating effect of sort order

2008-10-02 Thread Erik Hatcher
On Oct 2, 2008, at 7:39 AM, Jimi Hullegård wrote: Is it possible to disable the lengthNorm calculation for particular fields? Yes, use Field#setOmitNorms(true) when indexing. Erik - To unsubscribe, e-mail: [EMAIL P

Fwd: CFP open for ApacheCon Europe 2009

2008-10-02 Thread Erik Hatcher
Begin forwarded message: From: Noirin Shirley <[EMAIL PROTECTED]> Date: October 2, 2008 4:22:06 AM EDT To: [EMAIL PROTECTED] Subject: CFP open for ApacheCon Europe 2009 Reply-To: [EMAIL PROTECTED] Reply-To: [EMAIL PROTECTED] PMCs: Please send this on to your users@ lists! If you only have th

case studies

2008-10-01 Thread Erik Hatcher
Dear Lucene and Solr users - I'm presenting Lucene/Solr Case Studies at ApacheCon in a month: I would like to feature implementations by YOU. The thing is, my slides are due this Friday, so time is short to collect this info. If you have

Re: Listing fields in an index

2008-08-13 Thread Erik Hatcher
On Aug 13, 2008, at 5:02 AM, John Patterson wrote: How do I list all the fields in an index? Some documents do not contain all fields. Have a look at IndexReader#getFieldNames(). That'll give you back field names regardless of which documents have them. Erik --

Re: Indexing sections of TEI XML files

2008-08-13 Thread Erik Hatcher
Have you looked at XTF? It does what you're after and much,much more. Erik On Aug 13, 2008, at 4:03 AM, [EMAIL PROTECTED] wrote: Dear users, Question on approaches to indexing TEI XML or similar section/ subsectioned files. I'm indexi

Re: SpanRegexQuery

2008-08-01 Thread Erik Hatcher
On Jul 31, 2008, at 10:06 PM, Christopher M Collins wrote: I'm trying to use SpanRegexQuery as one of the clauses in my SpanQuery. When I give it a regex like: "L[a-z]+ing" and do a rewrite on the final query I get terms like "Labinger" and "Lackonsingh" along with the expected terms "Labe

Re: Returned mail: see transcript for details

2008-07-15 Thread Erik Hatcher
I've finally successfully removed the offending address from the list. I had tried earlier, but somehow it failed to take, but this time I think it has worked. Let me know off the list if you continue to get this bounce (something I've never seen personally, for the record). E

Re: how to get total hit count for each Searchable?

2008-07-11 Thread Erik Hatcher
On Jul 11, 2008, at 1:13 PM, xin liu wrote: I have individual index files for Audio, Image and PDF files. We build common meta fields for these different data types. When I search for a string, I want the search to return mixed search results from these 3 different index based on relevancy.

Re: Sorting case-insensitively

2008-07-01 Thread Erik Hatcher
On Jun 30, 2008, at 11:08 PM, Paul J. Lucas wrote: On Jun 30, 2008, at 7:00 PM, Erik Hatcher wrote: On Jun 30, 2008, at 8:55 PM, Paul J. Lucas wrote: If I have a SortField with a type of STRING, is there any way to sort in a case-insensitive manner? Only if you unify the case (lower case

Re: Sorting case-insensitively

2008-06-30 Thread Erik Hatcher
On Jun 30, 2008, at 8:55 PM, Paul J. Lucas wrote: If I have a SortField with a type of STRING, is there any way to sort in a case-insensitive manner? Only if you unify the case (lower case everything) on the client side that you send to Solr, but in general no. You can use a text field ty

Re: how to statistics categories amount

2008-06-28 Thread Erik Hatcher
On Jun 28, 2008, at 3:57 AM, lutan wrote: if I search a keyword likes 'computer' in a shopping website. the result may contains. total: (1000) products . categories: pc(500) products . notebook (300) products . server(200) products . so how do get each categories 'amount.

Re: random accessing term value

2008-03-26 Thread Erik Hatcher
terms are ordered by lex, seems it is possible for me to randomly get the nth term in the dictionary without having to seek to it. Thoughts? Thanks -John On Tue, Mar 25, 2008 at 11:16 AM, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Mar 25, 2008, at 1:32 PM, John Wang wrote: Is the

Re: random accessing term value

2008-03-25 Thread Erik Hatcher
On Mar 25, 2008, at 1:32 PM, John Wang wrote: Is there a way to random accessing term value in a field? e.g. in my field, content, the terms are: lucene, is, cool Is there a way to access content[2] -> cool? Via term vectors, or reanalysis of the field are two that come to mind.

  1   2   3   4   5   6   7   8   9   10   >