Help needed with AnalyzingInfixLookupFactory [Solr 6.1]

2017-05-19 Thread Sajid Nadeem
Hi Fellow members !! Hope you are doing well !! We are using Solr version 6.1.0 and struggling to build suggestions with AnalyzingInfixLookupFactory. Below is the config we are using in the solrconfig.xml file. We are getting a lock error (listed below) and this fails our index job. text_spe

Re: Lucene Help Needed for Doctoral Student

2014-07-28 Thread Yalamarthi, Vineel
Hey Guys, Can you tell me how to find TF-IDF using lucene ? Regards Vineel On 7/28/14, 3:11 PM, "Prakash Dubey" wrote: >In Lucene 2.4.0 there is *org.apache.lucene.index.TermFreqVector* method >*getTermFrequencies()* method. You can use Apache Maths for other >mathematical operations. Also see

RE: Lucene Help Needed for Doctoral Student

2014-07-28 Thread Uwe Schindler
chlapk...@yahoo.com.INVALID] > Sent: Monday, July 28, 2014 2:49 AM > To: java-user@lucene.apache.org > Subject: Lucene Help Needed for Doctoral Student > > Hi, I am working on my doctoral dissertation in CS and am trying to use > Lucene to do custom similarity measures, namely MMM (mixe

Re: Lucene Help Needed for Doctoral Student

2014-07-28 Thread Prakash Dubey
In Lucene 2.4.0 there is *org.apache.lucene.index.TermFreqVector* method *getTermFrequencies()* method. You can use Apache Maths for other mathematical operations. Also see following Blog for more information. Hope this

Lucene Help Needed for Doctoral Student

2014-07-27 Thread Erin Colvin
Hi, I am working on my doctoral dissertation in CS and am trying to use Lucene to do custom similarity measures, namely MMM (mixed, min and Max), Paice and p-norm and then compare those results to the traditional Boolean similarity and TF/IDF similarity. So far I have tried creating custom simi

Re: EarlyTerminatingSortingCollector help needed..

2014-06-25 Thread Adrien Grand
On Mon, Jun 23, 2014 at 3:56 PM, Ravikumar Govindarajan wrote: > Yes, we can get the top-50 docs finally. I am not denying that. > > I will probably re-phrase my question. Apologize if I am not clear > > How do we ensure global sort-order during search across all segments of the > index, when usin

Re: EarlyTerminatingSortingCollector help needed..

2014-06-23 Thread Ravikumar Govindarajan
> > This means that even though you have eg. 15 segments, if you requested > 50 documents, you will get the top 50 documents out of your > TopHitsCollector. Yes, we can get the top-50 docs finally. I am not denying that. I will probably re-phrase my question. Apologize if I am not clear How do

Re: EarlyTerminatingSortingCollector help needed..

2014-06-23 Thread Adrien Grand
On Sun, Jun 22, 2014 at 6:44 PM, Ravikumar Govindarajan wrote: > For a normal sorting-query, on a top-level searcher, I execute > > TopDocs docs = searcher.search(query, 50, sortField) > > Then I can issue reader.document() for final list of exactly 50 docs, which > gives me a global order across

Re: EarlyTerminatingSortingCollector help needed..

2014-06-22 Thread Ravikumar Govindarajan
Thanks for your reply & clarifications What do you mean by "When I use a SortField instead"? Unless you are > using early termination, Collector.collect is supposed to be called > for every matching document For a normal sorting-query, on a top-level searcher, I execute TopDocs docs = searcher

Re: EarlyTerminatingSortingCollector help needed..

2014-06-21 Thread Adrien Grand
Hi Ravikumar, On Fri, Jun 20, 2014 at 12:14 PM, Ravikumar Govindarajan wrote: > If my "numDocsToCollect" = 50 and no.of. segments = 15, then > collector.collect() will be called 750 times. That is the worst-case indeed. However if some of your segments have less than 50 matches, `collect` will o

EarlyTerminatingSortingCollector help needed..

2014-06-20 Thread Ravikumar Govindarajan
I was planning to use ETSC in-conjunction with SortingMergePolicy and got stuck. In ESTC, we have @Override public void collect(int doc) throws IOException { in.collect(doc); if (++numCollected >= numDocsToCollect) { throw new CollectionTerminatedException(); } } I und

Re: Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread Tommaso Teofili
Hi, you can have a look at the (early stage) Lucene classification module on trunk [1], see also a brief introduction given at last ApacheCon EU [2]. Hope this helps, Tommaso [1] : http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/classification/ [2] : http://www.slideshare.net/teofili/tex

Re: Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread Shashi Kant
http://www.slideshare.net/teofili/text-categorization-with-lucene-and-solr On Wed, Jan 9, 2013 at 5:46 AM, VIGNESH S wrote: > Hi, > > can anyone suggest me how can i use lucene for text classification. > > -- > Thanks and Regards > Vignesh Srinivasan > > -

Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread VIGNESH S
Hi, can anyone suggest me how can i use lucene for text classification. -- Thanks and Regards Vignesh Srinivasan - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
;>> as-is. >>> >>> See: >>> http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/document/TextField.html >>> >>> But, show us some of your indexed data and queries that fail. >>> >>> -- Jack Krupansky >>> >>&g

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
gt; http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/document/TextField.html >> >> But, show us some of your indexed data and queries that fail. >> >> -- Jack Krupansky >> >> -Original Message- From: Ramon Casha >> Sent: Tuesday, December

Re: Help needed: search is returning no results

2012-12-18 Thread Ramon Casha
; > -- Jack Krupansky > > -Original Message- From: Ramon Casha > Sent: Tuesday, December 18, 2012 9:14 AM > To: java-user@lucene.apache.org > Subject: Help needed: search is returning no results > > > I have just downloaded and set up Lucene 4.0.0 to implement a searc

Re: Help needed: search is returning no results

2012-12-18 Thread Jack Krupansky
nd queries that fail. -- Jack Krupansky -Original Message- From: Ramon Casha Sent: Tuesday, December 18, 2012 9:14 AM To: java-user@lucene.apache.org Subject: Help needed: search is returning no results I have just downloaded and set up Lucene 4.0.0 to implement a search facility for a w

Re: Help needed: search is returning no results

2012-12-18 Thread Ian Lea
I think you need TextField rather than StringField. See also http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_no_hits_.2BAC8_incorrect_hits.3F -- Ian. On Tue, Dec 18, 2012 at 2:14 PM, Ramon Casha wrote: > I have just downloaded and set up Lucene 4.0.0 to implement a search > faci

German compound decomposition (native speakers: help needed).

2011-06-14 Thread Dawid Weiss
First of all I should probably congratulate my fellow Germans -- Dirk Nowitzki's outstanding performance during this year's NBA finals will become part of the history of basketball. As a Pole, I admit I'm really freaking jealous. Now... back to the subject. A number of people have expressed an in

Re: Help needed on Ant build script for creating Lucene index

2011-05-12 Thread Erik Hatcher
There's an example build file, see It's pretty outdated stuff there though. It has some flexibility for a custom document handler in order to allow full control over how a File gets turned into a Lucene Document

Help needed on Ant build script for creating Lucene index

2011-05-11 Thread Saurabh Gokhale
Hi, Can someone pls direct me to an example where I can get help on creating ant build script for creating lucene index?. It is part of Lucene contrib but I did not get much idea from the documentation on Lucene site. Thanks Saurabh

Re: help needed improving lucene concurret search performance

2009-10-24 Thread Wilson Wu
-- Forwarded message -- From: Wilson Wu Date: 2009/10/24 Subject: Re: help needed improving lucene concurret search performance To: java-user@lucene.apache.org Hi,      Thanks a lot for your reply. There are 4 processors in my system.      I am not sure that 100 threads is going

Re: help needed improving lucene concurret search performance

2009-10-24 Thread Wilson Wu
Hi, Thanks a lot for your reply. There are 4 processors in my system. I am not sure that 100 threads is going to be 10 times slower than 10 threads .Because all the threads don't run serial but parallel. I think when there are 100 customers accessing my system,100 http connections will

Re: help needed improving lucene concurret search performance

2009-10-23 Thread Yonik Seeley
How many processors do you have on this system? If you are CPU bound, 100 threads is going to be 10 times slower (at a minimum) than 10 threads (unless you have more than 10 CPUs). -Yonik http://www.lucidimagination.com On Fri, Oct 23, 2009 at 2:18 AM, Wilson Wu wrote: > Dear Friend, >     I hav

help needed improving lucene concurret search performance

2009-10-22 Thread Wilson Wu
Dear Friend, I have encountered some performance problems recently in lucene search 2.9. I use a single IndexSearcher in the whole system, It seems perfect when there is less than 10 threads doing search concurrenty. Bu if there is more than 100 threads doing concurrent search,the average resp

RE: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Uwe Schindler
Original Message- > From: Jake Mannix [mailto:jake.man...@gmail.com] > Sent: Thursday, October 08, 2009 7:24 AM > To: java-user@lucene.apache.org > Subject: Re: Help needed figuring out reason for maxClauseCount is set to > 1024 error > > When such precision is needed, this

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Jake Mannix
taphi.de > > > > -Original Message- > > From: Jake Mannix [mailto:jake.man...@gmail.com] > > Sent: Thursday, October 08, 2009 2:35 AM > > To: java-user@lucene.apache.org > > Subject: Re: Help needed figuring out reason for maxClauseCount is set

RE: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Uwe Schindler
: u...@thetaphi.de > -Original Message- > From: Jake Mannix [mailto:jake.man...@gmail.com] > Sent: Thursday, October 08, 2009 2:35 AM > To: java-user@lucene.apache.org > Subject: Re: Help needed figuring out reason for maxClauseCount is set to > 1024 error > > On

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Jake Mannix
On Wed, Oct 7, 2009 at 4:42 PM, mitu2009 wrote: > > Hi, > > I've two sets of search indexes. TestIndex (used in our test environment) > and ProdIndex(used in PRODUCTION environment). Lucene search query: > +date:[20090410184806 TO 20091007184806] works fine for test index but > gives > this error

Re: Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread Adriano Crestani
ecords(around 1300) than > those in test one (around 950). > > Thanks for reading. > > -- > View this message in context: > http://www.nabble.com/Help-needed-figuring-out-reason-for-maxClauseCount-is-set-to-1024-error-tp257

Help needed figuring out reason for maxClauseCount is set to 1024 error

2009-10-07 Thread mitu2009
hey only differ wrt to number of records/data.PROD index has got higher number of records(around 1300) than those in test one (around 950). Thanks for reading. -- View this message in context: http://www.nabble.com/Help-needed-figuring-out-reason-for-maxClauseCount-is-set-to-1024-error-tp2579634

Re: Help needed bubbling up relevant records with most recent date

2009-10-02 Thread Karl Wettin
descending value of relevancy. How do i achieve this with Lucene? Please help. -- View this message in context: http://www.nabble.com/Help-needed-bubbling-up-relevant-records-with-most-recent-date-tp25709909p25709909.html Sent from the Lucene - Java Users mail

Re: Help needed ordering search results

2009-10-01 Thread Karl Wettin
rd #3---because it contains both the words of the input together(ie.as a phrase) b.Record #1 c.Record #2 How do i achieve this in lucene? Thanks. -- View this message in context: http://www.nabble.com/Help-needed-ordering-search-results-tp25691552p25691552.html Sent from the Lucene - Java

Help needed bubbling up relevant records with most recent date

2009-10-01 Thread mitu2009
order by date.And then, I want to show remaining records sorted by descending value of relevancy. How do i achieve this with Lucene? Please help. -- View this message in context: http://www.nabble.com/Help-needed-bubbling-up-relevant-records-with-most-recent-date-tp25709909p25709909.html Sent

Help needed ordering search results

2009-09-30 Thread mitu2009
cords in the following order in search results... a.Record #3---because it contains both the words of the input together(ie.as a phrase) b.Record #1 c.Record #2 How do i achieve this in lucene? Thanks. -- View this message in context: http://www.nabble.com/Help-needed-ordering-search-re

Re: Help Needed...

2009-05-28 Thread Karl Wettin
28 maj 2009 kl. 12.22 skrev Gaurav Kumar: Hi everyone, I am doing a project using Lucene where i need to index HTML files. I am using Tika to parse HTML files. But i need to index files according to their tags which means that every text present in different HTML tag (like ) should be s

Re: Help Needed...

2009-05-28 Thread Anshum
Indexing/Storing are at developers discretion. You may choose to store or not store a field as per your requirement. -- Anshum Gupta Naukri Labs! http://ai-cafe.blogspot.com The facts expressed here belong to everybody, the opinions to me. The distinction is yours to draw On Thu, Ma

Re: Help Needed...

2009-05-28 Thread Alexander Aristov
you will need to develop parser and indexer. but remember that in current implementation content is not stored in lucene index, indexed - yes nut not stored. Best Regards Alexander Aristov 2009/5/28 Gaurav Kumar > Hi everyone, > > I am doing a project using Lucene where i need to index HTML

Re: Help Needed...

2009-05-28 Thread Paul Libbrecht
Kumar, you'll have to make your own documents with after parsing yourself the HTML (e.g. with Nekohtml to dom). As for the weights of tokens, supplementarily to IDF, you can do that per field, i.e. when you add a field into the document. paul Le 28-mai-09 à 12:22, Gaurav Kumar a écrit :

Help Needed...

2009-05-28 Thread Gaurav Kumar
Hi everyone, I am doing a project using Lucene where i need to index HTML files. I am using Tika to parse HTML files. But i need to index files according to their tags which means that every text present in different HTML tag (like ) should be stored in different fields. Can i do that. If yes how

Re: Some Help needed in search.

2008-01-29 Thread Doron Cohen
e want to display the > document with WRITER as "Sachin Tendulkar", if it exact match. If we > search > for the word "Sachin", what is working now should remain. > > How can we do this? > > Regards, > Suresh N. > -- > View this messag

Some Help needed in search.

2008-01-28 Thread Sure
rd "Sachin", what is working now should remain. How can we do this? Regards, Suresh N. -- View this message in context: http://www.nabble.com/Some-Help-needed-in-search.-tp15153692p15153692.html Sent from the Lucene - Java Users mail

Re: Help needed

2007-11-23 Thread Hasan Diwan
On 23/11/2007, Liaqat Ali <[EMAIL PROTECTED]> wrote: > *Can not find symbol: method Text (java.lang.String, java.lang.String) > location: class org.apache.lucene.document.Field > document.add(Field.Text("fieldname", text)); > * If you're using Lucene 2.x.x, the static methods of the Field class

Help needed

2007-11-23 Thread Liaqat Ali
I m facing problem while using a small indexing example; IndexWriter writer = new IndexWriter(indexDir, analyzer, createFlag); Document document = new Document(); document.add(Field.Text("fieldname", text)); writer.addDocument(document)

Re: Indexing help needed

2007-05-25 Thread Andrzej Bialecki
jim shirreffs wrote: Thanks for the advice, I just don't see where in the Lucene code I should plug OOParcer into Lucene. I've walked the code in LIUS and Nutch (moving on to Solr) trying to find common objects. If I can find common objects in Lucene and Nutch I'll know where to plug in. Yo

Re: Indexing help needed

2007-05-25 Thread jim shirreffs
code up a Reader the just spites out "Here I am" a few hundred times and see what happens. LOL. thank you for the reply and advice. jim s - Original Message - From: "Andrzej Bialecki" <[EMAIL PROTECTED]> To: Sent: Friday, May 25, 2007 1:10 PM Subject: R

Re: Indexing help needed

2007-05-25 Thread Andrzej Bialecki
jim shirreffs wrote: Thanks to all that try to help me out Jim S P.S. If I get it working I will be happy to email post the code. If you looked at the code in Nutch, you can take most of the parse-oo plugin verbatim, because all this plugin does is it extracts the text content and metadata

Indexing help needed

2007-05-25 Thread jim shirreffs
I've been working on this for a while, I am trying to get the demo code that comes with Lucene to index OpenOffice documentss. I've looked at LIUS code and at Nutch code. But can't find an easy way. So I am digging into the code. I wrote a KcmiDocument class that returns a Document. In it I

Re: Help needed with BooleanQuery formation

2006-01-25 Thread Chris Hostetter
: I want a query of the form: : : x AND ( a OR b OR c OR d) what your code is currenlty doing is adding 5 term queries to a single boolean query. The structure you want is not a single boolean query, it's a boolean query containing two mandatory clauses: the first being a term query, and the sec

Re: Help needed with BooleanQuery formation

2006-01-25 Thread Michael D. Curtin
Michael Pickard wrote: Can anyone help me with the formation of a BooleanQuery ? I want a query of the form: x AND ( a OR b OR c OR d) You're going to need 2 BooleanQuery objects, one for the OR'd expression in parentheses, and another for the AND and expression. Something like this:

Help needed with BooleanQuery formation

2006-01-25 Thread Michael Pickard
Can anyone help me with the formation of a BooleanQuery ? I want a query of the form: x AND ( a OR b OR c OR d) The nearest I've managed to get is query.add(new TermQuery(new Term(2, "x")),true,false); Term term = null; for (int i=1; i