Re: Get fields from a Query object

2010-08-02 Thread Anuj Shah
My code has been given a query string, which we parse into the Query object
and would like to get a list of fields from.

I'm assuming there exists a method to do so, as it seems like a useful
function. If not should I be parsing the string for fields myself.

Anuj





On Sun, Aug 1, 2010 at 2:21 AM, Erick Erickson wrote:

> Could you explain more about what you're trying to do? You're writing the
> query
> after all, so you probably already know what went into it.
>
> Which shows that I don't understand what you want to do at all.
>
> Best
> Erick
>
> On Sat, Jul 31, 2010 at 9:41 AM, Anuj Shah  wrote:
>
> > Hi,
> >
> > Is there a way to get all the fields involved in a query?
> >
> > Thanks
> >
> > Anuj
> >
>


creating tag cloud (with faceted search?) for search result (filter)

2010-08-02 Thread ArminS

Hi guys,

I did some extensive research over the last days, also searched the threads
in this forum (big compliment to the users helping here!) about creating a
tag cloud of the search result(s). But I still couln't find something
satisfying me yet...

Background:

I have lots of user text comments (unstructured) stored in the database,
collected over a website as feedback.
My goal is, first to make this amount of data searchable/filterable with
good performance and then I want to create a tag cloud of the results after
searching for a single term, so that I can use the tag cloud then for
iterative search refinement by clicking on one term (for filtering again) in
the cloud! No discussion about the tag cloud decision please :) Focus is on
single terms/tags mainly, no phrases.

Lets imagine doing a search/filter with the word "performance":

- while showing the results, a tag cloud should be generated with all the
terms connected to those text comments, in which "performance" appears.
- when I click on a word in this tagcloud, it should do the same again for
this selected word etc.
=> have the "corpus" filtered in a visualized way as a tag cloud

Actually it goes in this direction ("Drill Clouds"):
http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/Ungava

Now my questions on that:

Do I need Solr (because of the faceted search feature) or is it also
possible with lucene only?
If Solr - which steps are neccessary to do a faceted search/navigation as a
tag cloud?
I know it has something to do with "term vectors", right?
If it's not possible to realise it 100%, no worries, just let me know, how
far I can go.

Please don't be too informatic in the first place, as I'm not a developer :)

Glad for any help of you guys!

Markus


Related threads to this:

http://bit.ly/94y97g
http://www.gossamer-threads.com/lists/lucene/java-user/88048
http://markmail.org/message/bjn2dhy5i4rh6cgg#query:+page:1+mid:czvpv2wt5e7gtqlu+state:results
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/creating-tag-cloud-with-faceted-search-for-search-result-filter-tp1015955p1015955.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Re: Get fields from a Query object

2010-08-02 Thread Erick Erickson
Did you look at Query.extractTerms? I think that'll work for you.
Note that the query must be rewritten, and that the set of terms will
have duplicate fields. i.e. if you search field1:Erick +field1:James
I expect you'll have two terms in the set that are on field1.

Best
Erick

On Mon, Aug 2, 2010 at 8:16 AM, Anuj Shah  wrote:

> My code has been given a query string, which we parse into the Query object
> and would like to get a list of fields from.
>
> I'm assuming there exists a method to do so, as it seems like a useful
> function. If not should I be parsing the string for fields myself.
>
> Anuj
>
>
>
>
>
> On Sun, Aug 1, 2010 at 2:21 AM, Erick Erickson  >wrote:
>
> > Could you explain more about what you're trying to do? You're writing the
> > query
> > after all, so you probably already know what went into it.
> >
> > Which shows that I don't understand what you want to do at all.
> >
> > Best
> > Erick
> >
> > On Sat, Jul 31, 2010 at 9:41 AM, Anuj Shah 
> wrote:
> >
> > > Hi,
> > >
> > > Is there a way to get all the fields involved in a query?
> > >
> > > Thanks
> > >
> > > Anuj
> > >
> >
>


Register now for Surge 2010

2010-08-02 Thread Jason Dixon
Registration for Surge Scalability Conference 2010 is open for all
attendees!  We have an awesome lineup of leaders from across the various
communities that support highly scalable architectures, as well as the
companies that implement them.  Here's a small sampling from our list of
speakers:

John Allspaw, Etsy
Theo Schlossnagle, OmniTI
Rasmus Lerdorf, creator of PHP
Tom Cook, Facebook
Benjamin Black, fast_ip
Artur Bergman, Wikia
Christopher Brown, Opscode
Bryan Cantrill, Joyent
Baron Schwartz, Percona
Paul Querna, Cloudkick

Surge 2010 focuses on real case studies from production environments;
the lessons learned from failure and how to re-engineer your way to a
successful, highly scalable Internet architecture.  The conference takes
place at the Tremont Grand Historic Venue on Sept 30 and Oct 1, 2010 in
Baltimore, MD.  Register now to enjoy the Early Bird discount and
guarantee your seat to this year's event!

http://omniti.com/surge/2010/register

Thanks,

-- 
Jason Dixon
OmniTI Computer Consulting, Inc.
jdi...@omniti.com
443.325.1357 x.241

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



heads up -- index corruption on Solr/Lucene trunk/3.x branch

2010-08-02 Thread Michael McCandless
This issue:

https://issues.apache.org/jira/browse/LUCENE-2574

which was committed 3 days ago (Friday Jul 30) can cause index corruption.

I just committed a fix for the corruption, but if you've been using
Solr/Lucene trunk or 3x branch updated after the first commit on
Friday, and you built any indexes with it, you should rebuild them!

Sorry,

Mike

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Modify how a field value is stored in Lucene

2010-08-02 Thread Fernando Wasylyszyn
Hi all. This is my question. Currently, I'm working in a project where I have 
Lucene documents with one field that use payloads. For this field, I use 
org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter, so the value 
for that field is something like: "fieldValue\1.0" where '\' is the payload 
delimiter and "1.0" is the field value payload. The thing is that the values of 
this field are stored in the index, so when I retrieve the field value from the 
index, I get "fieldValue\1.0" and manually have to remove the payload from the 
field value in order to show it to the user, put it in some XML or whatever. 
There is any way to modify how a field value is stored in the index as it can 
be 
modified how the value is indexed vía TokenFilters? In this particular case, 
the 
needed modification is remove the payload from the index stored field value, 
but 
there are a lot of cases where something like this could be useful. Thanks in 
advance. Cheers.