回复: How to map lucene scores to range from 0~100?

2014-11-11 Thread Harry Yu
to search POI. And I want final score to show they text similarity. It like inverse of levenshtein distance of query and hit docs.‍ Do you have any ideas. Sincerely thank you~ Yu -- 原始邮件 -- 发件人: "Rajendra Rao";; 发送时间: 2014年11月11日(星期二) 晚上10:55

How to map lucene scores to range from 0~100?

2014-11-11 Thread Harry Yu
Hi everyone, I met a new trouble. In my system, we should score the doc range from 0 to 100. There are some easy ways to map lucene scores to this scope. Thanks for your help~ Yu

How to improve the performance in Lucene when query is long?

2014-11-10 Thread Harry Yu
Hi everyone, I have been using Lucene to build a POI searching & geocoding system. After test, I found that when query is long(above 10 terms). And the speed of searching is too slow near to 1s. I think the bottleneck is that I used OR to generate my BooleanQuery. It would get plenty of candi

How to rank by name in the same score when using DisjunctionMaxQuery?

2014-09-22 Thread Harry Yu
Hi everyone, I searched an index with 3 its' fields({"name", "addr", "fullname"}), and used DisjunctionMaxQuery to rank the results by the max score of 3 fields. When the hits have same score, lucene would rank by doc Id(low doc Id would be front). But I don't want rank by doc Id in that ca

solr facet query with Tagging and Excluding Filters

2014-09-18 Thread Andy Yu
Hi guys, I want to do a facet with facet query,and let it has the [Tagging and Excluding Filters] ( https://cwiki.apache.org/confluence/display/solr/Faceting)style which facet.field has,so how to do it , pls guide me! Thanks, Andy

回复: Speed up searching in multiple-thread?

2014-09-15 Thread Harry Yu
rvice, include accessing complete data form db, calling REST service etc. Regards, Harry Yu -- 原始邮件 -- 发件人: "Toke Eskildsen";; 发送时间: 2014年9月15日(星期一) 下午4:47 收件人: "java-user@lucene.apache.org"; 主题: Re: Speed up searching in multiple-thread?

回复: Speed up searching in multiple-thread?

2014-09-15 Thread Harry Yu
. Best Regards, Harry Yu -- 原始邮件 -- 发件人: "Michael McCandless";; 发送时间: 2014年9月15日(星期一) 下午3:48 收件人: "Lucene Users"; 主题: Re: Speed up searching in multiple-thread? If you run 30 search threads on a core i5 it's expected there will be big s

Speed up searching in multiple-thread?

2014-09-15 Thread Harry Yu
Dear members at Lucene project, I'm developing poi search application using lucene 4.8 . Recently, I met a trouble that the performance of IndexSearcher.search is bad in multiple-thread environment. According the test results, I found that if thread number is 1, the response time of searching

SpanNearQuery behaviour?

2013-11-04 Thread Yu Zhou
Hi, We use SpanNearQueries intensively for proximity searching. However, we are confused by two different ways to use them. Could anybody explain in details what we can expect for nested and flatten SpanNearQueries? We used to build nested SpanNearQueries. However, we found that using nested S

Unifying IDF for unbalanced shards?

2013-08-28 Thread Yu Zhou
Hello, We have a large collection of documents that consists of multiple balanced shards. Now each shard is quickly approaching its limit. Therefore, we would like to explore the possibility of adding unbalanced shards into the mix. However, that means the IDF and Relevance would take a hit. 

custom solr sort problem

2013-01-05 Thread Andy Yu
Hi, Maybe this is an old thread or maybe it's different with previous one. I want to custom solr sort and pass solr param from client to solr server, so I implemented SearchComponent which named MySortComponent in my code, and also implemented FieldComparatorSource and FieldComparator. when I u

Re: sort by field and score

2012-12-02 Thread Andy Yu
eak your code down into a simple standalone program > and post that if it still doesn't work. > > > -- > Ian. > > On Thu, Nov 29, 2012 at 4:20 AM, Andy Yu wrote: > > I revise the code to > > > > SortField sortField[] = {new Sor

Re: sort by field and score

2012-11-28 Thread Andy Yu
NaN I think you'll need > to use a TopFieldCollector. See for example > http://www.gossamer-threads.com/lists/lucene/java-user/86309 > > > -- > Ian. > > > On Tue, Nov 27, 2012 at 3:51 AM, Andy Yu wrote: > > Hi All, > > > > > > Now I want to sor

Re: Where to download lucene-analyzers and lucene-highlighter?

2009-09-26 Thread Peng Yu
ar > lucene-2.4.0/contrib/analyzers/*.jar > > - Original Message - > From: "Peng Yu" > To: java-user@lucene.apache.org > Sent: Saturday, 26 September, 2009 12:11:02 GMT +00:00 GMT Britain, Ireland, > Portugal > Subject: Where to download lucene-analyzers an

Re: Where to download lucene-analyzers and lucene-highlighter?

2009-09-26 Thread Peng Yu
;> -----Original Message- >> From: Peng Yu [mailto:pengyu...@gmail.com] >> Sent: Saturday, September 26, 2009 2:03 PM >> To: java-user@lucene.apache.org >> Subject: Re: Where to download lucene-analyzers and lucene-highlighter? >> >> Hi, >> >> I am w

Where to download lucene-analyzers and lucene-highlighter?

2009-09-26 Thread Peng Yu
Hi, I don't see where I can download lucene-analyzers.jar and lucene-highlighter.jar? Can somebody show me? Regards, Peng - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-us

Re: Where to download lucene-analyzers and lucene-highlighter?

2009-09-26 Thread Peng Yu
- > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > >> -Original Message- >> From: Peng Yu [mailto:pengyu...@gmail.com] >> Sent: Saturday, September 26, 2009 2:16 PM >> To: java-user@lucene.apache

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

2008-03-25 Thread yu
/java/2_3_1/api/contrib-analyzers/org/apache/lucene/analysis/ngram/package-summary.html Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: yu <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, March 26, 2008 12:04:33 AM Subj

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

2008-03-25 Thread yu
Hi Otis, I checked that contrib before and could not find NgramStemFilter. Am I missing other contrib? Thanks for the link! Jay Otis Gospodnetic wrote: Hi Jay, Sorry, lapsus calami, that would be Lucene *contrib*. Have a look: http://lucene.apache.org/java/2_3_1/api/contrib-analyzers/index.h

Re: update field boost

2008-02-11 Thread yu
thanks, Hoss! I read the doc for the api indexreader.setNorm() after I posted the question earlier. To use that setNorm() to modify the field boost, it seems to me that one has to know how the boost is folded to the norm (in the default impl, it's boost* lengthNorm) and has to know the old norm

Reuse single document and fields

2008-01-31 Thread yu
Hi, I am trying to use the latest 2.3 API on Field to improve the indexing performance by reusing Documents and Fields. After reading lucene-java wiki and the java doc on Field, I have a couple of questions about the comment in Field.setValue(), namely, "Note that you should only use this me

Re: thread safe shared IndexSearcher

2007-09-25 Thread Jay Yu
y, it does work. I am also thinking about how to handle the Analyzer, and I think the solution (the need to init some indexaccessor params) might involve all these issues. - Mark Jay Yu wrote: Mark, Looking at your implementation of the DefaultIndexAccessor regarding the writer, I think t

Re: thread safe shared IndexSearcher

2007-09-25 Thread Jay Yu
code. I will take your comments into account and update the code. I have some other pieces to polish as well. Previously, I extended and built upon the original code, but I can't give it away, so this is my attempt at something lessor, but cleaner. Jay Yu wrote: Thanks for the tip. One

Re: thread safe shared IndexSearcher

2007-09-24 Thread Jay Yu
reset analyzer/Dir as in my own version. Jay Mark Miller wrote: One final noteif you are using the IndexAccessor and you are only accessing the index from one JVM, you can use the NoLockFactory and save some sync cost there. Jay Yu wrote: Mark, Great effort getting the original

Re: thread safe shared IndexSearcher

2007-09-24 Thread Jay Yu
Mark, Great effort getting the original lucene index accessor package in this shape. I am sure this will benefit a lot of people using Lucene in a multithread env. I have a quick question to ask you: Do you have to use the core Lucene 2.3-dev in order to use the accessor? I will take a look a

Re: thread safe shared IndexSearcher

2007-09-24 Thread Jay Yu
total time to parse a query and run a search. I'll try and get around to posting the code tonight. - Mark Jay Yu wrote: Mark Miller wrote: Good luck Jay. Keep in mind, pretty much all LuceneIndexAccessor does is sync Readers with Writers and allow multiple threads to share the same in

Re: thread safe shared IndexSearcher

2007-09-20 Thread Jay Yu
will appreciate it! Thanks for your help! - Mark Jay Yu wrote: Mark, Thanks for sharing your valuable exp. and thoughts. Frankly our system already has most of the functionalities LuceneIndexAcessor offers. The only thing I am looking for is to sync the searchers' close. That's

Re: thread safe shared IndexSearcher

2007-09-20 Thread Jay Yu
less your bulk loading). I've had such a system with 5+ million docs on a standard machine and searches where still well below a second after the first Searcher is cached (and even the first search is darn quick). And that includes a lot of extra crap I am doing. - Mark Jay Yu wrote

Re: thread safe shared IndexSearcher

2007-09-19 Thread Jay Yu
from the originally submitted code. 4. I will look into getting together some code that I can share. The multisearcher changes that are need are a couple of one liners really, so at a minimum I will give you the changes needed. - Mark On 9/19/07, Jay Yu <[EMAIL PROTECTED]> wrote: Mark,

Re: thread safe shared IndexSearcher

2007-09-19 Thread Jay Yu
raight from the originally submitted code. 4. I will look into getting together some code that I can share. The multisearcher changes that are need are a couple of one liners really, so at a minimum I will give you the changes needed. - Mark On 9/19/07, Jay Yu <[EMAIL PROTEC

Re: thread safe shared IndexSearcher

2007-09-19 Thread Jay Yu
k its overlooked far too much. - Mark On 9/19/07, Jay Yu <[EMAIL PROTECTED]> wrote: In a multithread app like web app, a shared IndexSearcher could throw a AlreadyClosedException when another thread is trying to update the underlying IndexReader by closing the shared searcher after t

thread safe shared IndexSearcher

2007-09-19 Thread Jay Yu
In a multithread app like web app, a shared IndexSearcher could throw a AlreadyClosedException when another thread is trying to update the underlying IndexReader by closing the shared searcher after the index is updated. Searching over the past discussions on this mailing list, I found several

Re: Search for null

2007-07-25 Thread Jay Yu
bits final BitSet filterBitSet = queryFilter.bits(reader); filterBitSet.flip(0,filterBitSet.size()); Now you have a filter that contains document matching the opposite of that specified by the query, and can use in subsequent queries Dan On Tue, 2007-07-24 at 09:40 -0700, Jay Yu wrote: daniel ro

Re: Search for null

2007-07-24 Thread Jay Yu
d can cheaply be stored, generated once and used often. Dan On Mon, 2007-07-23 at 13:57 -0700, Jay Yu wrote: If you want performance, a better way might be to assign some special string/value (if it's easy to create) to the missing field of docs and index the field without tokenizing it. Then you

Re: Search for null

2007-07-23 Thread Jay Yu
If you want performance, a better way might be to assign some special string/value (if it's easy to create) to the missing field of docs and index the field without tokenizing it. Then you may search for that special value to find the docs. Jay Les Fletcher wrote: Does this particular range

Re: MoreLikeThis

2007-07-18 Thread yu
You can put lucene-queries-2.2.0.jar on your class path or your Eclipse project build path. That's all you need. Jay Akanksha Baid wrote: I am using Lucene 2.1.0 and want to use MoreLikeThis for querying documents. I understand that the jar file for the same is in contrib. I have the contrib

Re: RangeFilter

2007-07-10 Thread Jay Yu
Thanks for clarifying this, Chris! I agree with you that javadocs usual should doc all they do but often times they skip few important things they do do. Chris Hostetter wrote: : Does anyone know if the RangeFilter is a cached filter? I could not : tell from the api. Generally speaking cla

RangeFilter

2007-07-10 Thread Jay Yu
Hi All, Does anyone know if the RangeFilter is a cached filter? I could not tell from the api. Thanks! Jay - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: How to make a case insensitive search using a FuzzyQuery?

2007-07-06 Thread Jiye Yu
You may store the original text in the doc w/o index it and index the lower case version without storing it. This may save you some space/time. Eloi Rocha Neto wrote: Hi Daniel, I dont lowercase the field at index time, because I have to show the results in the same way as it was found. F

Re: boosting indexing terms

2007-06-26 Thread yu
Thanks, Karl, for your link/suggestion. I did not know Payloads and will check it out. Jay karl wettin wrote: 27 jun 2007 kl. 04.55 skrev yu: I like Lucene's ability to boost each query term during query time.. However, for indexing, the smallest unit you can boost is a field. I wond

boosting indexing terms

2007-06-26 Thread yu
I like Lucene's ability to boost each query term during query time.. However, for indexing, the smallest unit you can boost is a field. I wonder if anyone here has good idea or suggestion on how one might boost terms without putting them in separate fields? Thanks. Jay ---

Re: Analyzer sharing

2007-06-22 Thread Jiye Yu
I see. I guess those Filters (e.g. PorterStemFilter) that make up the analyzer are not thread safe or cannot be shared. Thanks for your quick response! Jay Yonik Seeley wrote: On 6/22/07, Jiye Yu <[EMAIL PROTECTED]> wrote: I guess an Analyzer (built in ones such as StandardAn

Analyzer sharing

2007-06-22 Thread Jiye Yu
Hi, I guess an Analyzer (built in ones such as StandardAnalyzer, POrterStemAnalyer and etc) is not thread safe. But I wonder if it's ok to share the same analyzer object within a thread. For example, if I want to create a PerFieldAnalyzer for 5 fields, can I use the same Analyzer object for a

Re: efficient way to filter out unwanted results

2007-06-15 Thread Jiye Yu
Thanks Antony for the idea. The only thing that may prevent it from working well is that the index is updated frequently so the docid to ext id or cache needs to be updated freq, which may affect the performance. Thanks again for your help. Antony Bowesman wrote: yu wrote: Thanks Sawan for

Re: efficient way to filter out unwanted results

2007-06-15 Thread yu
. Hope it will be useful. Sawan (Chambal.com inc. NJ USA) On 6/15/07, yu <[EMAIL PROTECTED]> wrote: Hi everyone, I am trying to remove several docs from search results each time I do query. The docs can be identified by an exteranl ids whcih are saved/indexed. I could use a Qu

efficient way to filter out unwanted results

2007-06-14 Thread yu
Hi everyone, I am trying to remove several docs from search results each time I do query. The docs can be identified by an exteranl ids whcih are saved/indexed. I could use a Query or QueryFilter to achieve this but not sure if it's the most efficient way to do that. Anyone has any experienc

Re: Can I do "Google Suggest" Like Search? - - - from - - -vikas

2006-05-24 Thread Zhenjian YU
yes, PrefixQuery will help. On 5/24/06, mark harwood <[EMAIL PROTECTED]> wrote: >>What will happen if I send PrefixQuery A search returns a list of docs - you want a list of words which is why I suggested using the IndexReader "terms" APIs which PrefixQuery uses internally. If you are not in

Re: Can I do "Google Suggest" Like Search? - - - from - - -vikas

2006-05-24 Thread Zhenjian YU
tarting with "a". Either way is a solution. Best Regards, Zhenjian On 5/24/06, Vikas Khengare <[EMAIL PROTECTED]> wrote: Hi Zhenjian YU I have implemented that with AJAX Search But when I type "*a*" in textbox. It gives me all results which contains letter &

Re: Can I do "Google Suggest" Like Search? - - - from - - -vikas

2006-05-24 Thread Zhenjian YU
hi, *vikas, *the ajax way should work for you. while in the lucene part, you should do something to handle the prefix query. Best Regards, Zhenjian On 5/24/06, Vikas Khengare <[EMAIL PROTECTED]> wrote: Hi Friend I want to perform search like "Google search" which will give

Re: does anybody have the experience to do some pooling upon lucene?

2006-05-22 Thread Zhenjian YU
OK, got it. Thanks. On 5/23/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: On May 21, 2006, at 10:56 PM, Zhenjian YU wrote: > I didn't dig the source code of lucence deep enough, but I noticed > that the > IndexSearcher uses an IndexReader, while the cost of initializing &g

Re: does anybody have the experience to do some pooling upon lucene?

2006-05-21 Thread Zhenjian YU
system needs to use (in the basic sense). Pooling multiple instances pointing to the same index won't benefit your performance. Things get trickier when you are updating the index and want to see the updates. Erik On May 19, 2006, at 5:13 AM, Zhenjian YU wrote: > I'm

does anybody have the experience to do some pooling upon lucene?

2006-05-19 Thread Zhenjian YU
I'm using apache commons pooling library to pool the IndexSearcher, so that my system can provide high performance. I wonder if it is reasonable to pooling objects of lucene? If yes, is there any other objects I can also pool? Thank you!