Re: Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread Tommaso Teofili
Hi, you can have a look at the (early stage) Lucene classification module on trunk [1], see also a brief introduction given at last ApacheCon EU [2]. Hope this helps, Tommaso [1] : http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/classification/ [2] : http://www.slideshare.net/teofili/tex

Re: Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread Shashi Kant
http://www.slideshare.net/teofili/text-categorization-with-lucene-and-solr On Wed, Jan 9, 2013 at 5:46 AM, VIGNESH S wrote: > Hi, > > can anyone suggest me how can i use lucene for text classification. > > -- > Thanks and Regards > Vignesh Srinivasan > > -

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
I verified that the index was correct using the app Luke, tested some queries using it then replicated the results via code. It seems I need to refine the token parsing but at least I have something now. Ramon Casha On 18 December 2012 15:50, Ramon Casha wrote: > Hmm ok I got something. > > > R

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
Hmm ok I got something. Ramon Casha On 18 December 2012 15:44, Ramon Casha wrote: > I converted them to TextField but the result is the same. > > doc.add(new TextField("text", text.toString(), Store.YES)); > > The search always returns an empty array. > > Ramon Casha > > > On 18 December 201

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
I converted them to TextField but the result is the same. doc.add(new TextField("text", text.toString(), Store.YES)); The search always returns an empty array. Ramon Casha On 18 December 2012 15:35, Jack Krupansky wrote: > Maybe you wanted "text" fields that are analyzed and tokenized, as o

Re: Help needed: search is returning no results

2012-12-18 Thread Jack Krupansky
Maybe you wanted "text" fields that are analyzed and tokenized, as opposed to string fields which are not analyzed and stored and queried exactly as-is. See: http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/document/TextField.html But, show us some of your indexed data and queries th

Re: Help needed: search is returning no results

2012-12-18 Thread Ian Lea
I think you need TextField rather than StringField. See also http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_no_hits_.2BAC8_incorrect_hits.3F -- Ian. On Tue, Dec 18, 2012 at 2:14 PM, Ramon Casha wrote: > I have just downloaded and set up Lucene 4.0.0 to implement a search > faci

Re: Help needed on Ant build script for creating Lucene index

2011-05-12 Thread Erik Hatcher
There's an example build file, see It's pretty outdated stuff there though. It has some flexibility for a custom document handler in order to allow full control over how a File gets turned into a Lucene Document

Re: help needed improving lucene concurret search performance

2009-10-24 Thread Wilson Wu
-- Forwarded message -- From: Wilson Wu Date: 2009/10/24 Subject: Re: help needed improving lucene concurret search performance To: java-user@lucene.apache.org Hi,      Thanks a lot for your reply. There are 4 processors in my system.      I am not sure that 100 threads is going

Re: help needed improving lucene concurret search performance

2009-10-24 Thread Wilson Wu
Hi, Thanks a lot for your reply. There are 4 processors in my system. I am not sure that 100 threads is going to be 10 times slower than 10 threads .Because all the threads don't run serial but parallel. I think when there are 100 customers accessing my system,100 http connections will

Re: help needed improving lucene concurret search performance

2009-10-23 Thread Yonik Seeley
How many processors do you have on this system? If you are CPU bound, 100 threads is going to be 10 times slower (at a minimum) than 10 threads (unless you have more than 10 CPUs). -Yonik http://www.lucidimagination.com On Fri, Oct 23, 2009 at 2:18 AM, Wilson Wu wrote: > Dear Friend, >     I hav

RE: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Uwe Schindler
Original Message- > From: Jake Mannix [mailto:jake.man...@gmail.com] > Sent: Thursday, October 08, 2009 7:24 AM > To: java-user@lucene.apache.org > Subject: Re: Help needed figuring out reason for maxClauseCount is set to > 1024 error > > When such precision is needed, this

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Jake Mannix
taphi.de > > > > -Original Message- > > From: Jake Mannix [mailto:jake.man...@gmail.com] > > Sent: Thursday, October 08, 2009 2:35 AM > > To: java-user@lucene.apache.org > > Subject: Re: Help needed figuring out reason for maxClauseCount is set

RE: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Uwe Schindler
: u...@thetaphi.de > -Original Message- > From: Jake Mannix [mailto:jake.man...@gmail.com] > Sent: Thursday, October 08, 2009 2:35 AM > To: java-user@lucene.apache.org > Subject: Re: Help needed figuring out reason for maxClauseCount is set to > 1024 error > > On

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Jake Mannix
On Wed, Oct 7, 2009 at 4:42 PM, mitu2009 wrote: > > Hi, > > I've two sets of search indexes. TestIndex (used in our test environment) > and ProdIndex(used in PRODUCTION environment). Lucene search query: > +date:[20090410184806 TO 20091007184806] works fine for test index but > gives > this error

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Adriano Crestani
Hi, Can you provide to us the exception stack trace? Thanks, Adriano Crestani On Wed, Oct 7, 2009 at 7:42 PM, mitu2009 wrote: > > Hi, > > I've two sets of search indexes. TestIndex (used in our test environment) > and ProdIndex(used in PRODUCTION environment). Lucene search query: > +date:[200

Re: Help needed bubbling up relevant records with most recent date

2009-10-02 Thread Karl Wettin
Use a span near query to add boost for the phrases. If you only want to add boost for exact phrases (0 slop) you might want to consider using shingles. In order to add greater score for a date closer in time you can choose between a range of solutions depending on your needs. Using a functi

Re: Help needed ordering search results

2009-10-01 Thread Karl Wettin
Not quite sure what you ask for, but I think you want to use a span near query (for adding boost to phrases) in a disjunction max query (to define weights of the different fields). karl 1 okt 2009 kl. 02.40 skrev mitu2009: Hi, I've 3 records in Lucene index. Record 1 contains healt

Re: Help Needed...

2009-05-28 Thread Karl Wettin
28 maj 2009 kl. 12.22 skrev Gaurav Kumar: Hi everyone, I am doing a project using Lucene where i need to index HTML files. I am using Tika to parse HTML files. But i need to index files according to their tags which means that every text present in different HTML tag (like ) should be s

Re: Help Needed...

2009-05-28 Thread Anshum
Indexing/Storing are at developers discretion. You may choose to store or not store a field as per your requirement. -- Anshum Gupta Naukri Labs! http://ai-cafe.blogspot.com The facts expressed here belong to everybody, the opinions to me. The distinction is yours to draw On Thu, Ma

Re: Help Needed...

2009-05-28 Thread Alexander Aristov
you will need to develop parser and indexer. but remember that in current implementation content is not stored in lucene index, indexed - yes nut not stored. Best Regards Alexander Aristov 2009/5/28 Gaurav Kumar > Hi everyone, > > I am doing a project using Lucene where i need to index HTML

Re: Help Needed...

2009-05-28 Thread Paul Libbrecht
Kumar, you'll have to make your own documents with after parsing yourself the HTML (e.g. with Nekohtml to dom). As for the weights of tokens, supplementarily to IDF, you can do that per field, i.e. when you add a field into the document. paul Le 28-mai-09 à 12:22, Gaurav Kumar a écrit :

Re: Help needed

2007-11-23 Thread Hasan Diwan
On 23/11/2007, Liaqat Ali <[EMAIL PROTECTED]> wrote: > *Can not find symbol: method Text (java.lang.String, java.lang.String) > location: class org.apache.lucene.document.Field > document.add(Field.Text("fieldname", text)); > * If you're using Lucene 2.x.x, the static methods of the Field class

Re: Help needed with BooleanQuery formation

2006-01-25 Thread Chris Hostetter
: I want a query of the form: : : x AND ( a OR b OR c OR d) what your code is currenlty doing is adding 5 term queries to a single boolean query. The structure you want is not a single boolean query, it's a boolean query containing two mandatory clauses: the first being a term query, and the sec

Re: Help needed with BooleanQuery formation

2006-01-25 Thread Michael D. Curtin
Michael Pickard wrote: Can anyone help me with the formation of a BooleanQuery ? I want a query of the form: x AND ( a OR b OR c OR d) You're going to need 2 BooleanQuery objects, one for the OR'd expression in parentheses, and another for the AND and expression. Something like this: