to search POI. And I want final score to show they text
similarity. It like inverse of levenshtein distance of query and hit docs.
Do you have any ideas. Sincerely thank you~
Yu
-- 原始邮件 --
发件人: "Rajendra Rao";;
发送时间: 2014年11月11日(星期二) 晚上10:55
Hi everyone,
I met a new trouble. In my system, we should score the doc range from 0 to 100.
There are some easy ways to map lucene scores to this scope. Thanks for your
help~
Yu
Hi everyone,
I have been using Lucene to build a POI searching & geocoding system. After
test, I found that when query is long(above 10 terms). And the speed of
searching is too slow near to 1s. I think the bottleneck is that I used OR to
generate my BooleanQuery. It would get plenty of candi
Hi everyone,
I searched an index with 3 its' fields({"name", "addr", "fullname"}), and used
DisjunctionMaxQuery to rank the results by the max score of 3 fields. When the
hits have same score, lucene would rank by doc Id(low doc Id would be front).
But I don't want rank by doc Id in that ca
Hi guys,
I want to do a facet with facet query,and let it has the [Tagging and
Excluding Filters] (
https://cwiki.apache.org/confluence/display/solr/Faceting)style which
facet.field has,so how to do it , pls guide me!
Thanks,
Andy
rvice, include accessing
complete data form db, calling REST service etc.
Regards,
Harry Yu
-- 原始邮件 --
发件人: "Toke Eskildsen";;
发送时间: 2014年9月15日(星期一) 下午4:47
收件人: "java-user@lucene.apache.org";
主题: Re: Speed up searching in multiple-thread?
.
Best Regards,
Harry Yu
-- 原始邮件 --
发件人: "Michael McCandless";;
发送时间: 2014年9月15日(星期一) 下午3:48
收件人: "Lucene Users";
主题: Re: Speed up searching in multiple-thread?
If you run 30 search threads on a core i5 it's expected there will be
big s
Dear members at Lucene project,
I'm developing poi search application using lucene 4.8 . Recently, I met a
trouble that the performance of IndexSearcher.search is bad in multiple-thread
environment. According the test results, I found that if thread number is 1,
the response time of searching
Hi,
We use SpanNearQueries intensively for proximity searching. However, we are
confused by two different ways to use them. Could anybody explain in details
what we can expect for nested and flatten SpanNearQueries?
We used to build nested SpanNearQueries. However, we found that using nested
S
Hello,
We have a large collection of documents that consists of multiple balanced
shards. Now each shard is quickly approaching its limit. Therefore, we would
like to explore the possibility of adding unbalanced shards into the mix.
However, that means the IDF and Relevance would take a hit.
Hi,
Maybe this is an old thread or maybe it's different with previous one.
I want to custom solr sort and pass solr param from client to solr server,
so I implemented SearchComponent which named MySortComponent in my code,
and also implemented FieldComparatorSource and FieldComparator. when I u
eak your code down into a simple standalone program
> and post that if it still doesn't work.
>
>
> --
> Ian.
>
> On Thu, Nov 29, 2012 at 4:20 AM, Andy Yu wrote:
> > I revise the code to
> >
> > SortField sortField[] = {new Sor
NaN I think you'll need
> to use a TopFieldCollector. See for example
> http://www.gossamer-threads.com/lists/lucene/java-user/86309
>
>
> --
> Ian.
>
>
> On Tue, Nov 27, 2012 at 3:51 AM, Andy Yu wrote:
> > Hi All,
> >
> >
> > Now I want to sor
ar
> lucene-2.4.0/contrib/analyzers/*.jar
>
> - Original Message -
> From: "Peng Yu"
> To: java-user@lucene.apache.org
> Sent: Saturday, 26 September, 2009 12:11:02 GMT +00:00 GMT Britain, Ireland,
> Portugal
> Subject: Where to download lucene-analyzers an
;> -----Original Message-
>> From: Peng Yu [mailto:pengyu...@gmail.com]
>> Sent: Saturday, September 26, 2009 2:03 PM
>> To: java-user@lucene.apache.org
>> Subject: Re: Where to download lucene-analyzers and lucene-highlighter?
>>
>> Hi,
>>
>> I am w
Hi,
I don't see where I can download lucene-analyzers.jar and
lucene-highlighter.jar? Can somebody show me?
Regards,
Peng
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-us
-
> Uwe Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
>
>> -Original Message-
>> From: Peng Yu [mailto:pengyu...@gmail.com]
>> Sent: Saturday, September 26, 2009 2:16 PM
>> To: java-user@lucene.apache
/java/2_3_1/api/contrib-analyzers/org/apache/lucene/analysis/ngram/package-summary.html
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: yu <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Wednesday, March 26, 2008 12:04:33 AM
Subj
Hi Otis,
I checked that contrib before and could not find NgramStemFilter. Am I
missing other contrib?
Thanks for the link!
Jay
Otis Gospodnetic wrote:
Hi Jay,
Sorry, lapsus calami, that would be Lucene *contrib*.
Have a look:
http://lucene.apache.org/java/2_3_1/api/contrib-analyzers/index.h
thanks, Hoss!
I read the doc for the api indexreader.setNorm() after I posted the
question earlier. To use that setNorm() to modify the field boost, it
seems to me that one has to know how the boost is folded to the norm (in
the default impl, it's boost* lengthNorm) and has to know the old norm
Hi,
I am trying to use the latest 2.3 API on Field to improve the indexing
performance by reusing Documents and Fields.
After reading lucene-java wiki and the java doc on Field, I have a
couple of questions about the comment in Field.setValue(), namely,
"Note that you should only use this me
y, it does work. I am also thinking about how to
handle the Analyzer, and I think the solution (the need to init some
indexaccessor params) might involve all these issues.
- Mark
Jay Yu wrote:
Mark,
Looking at your implementation of the DefaultIndexAccessor regarding
the writer, I think t
code.
I will take your comments into account and update the code.
I have some other pieces to polish as well. Previously, I extended and
built upon the original code, but I can't give it away, so this is my
attempt at something lessor, but cleaner.
Jay Yu wrote:
Thanks for the tip.
One
reset
analyzer/Dir as in my own version.
Jay
Mark Miller wrote:
One final noteif you are using the IndexAccessor and you are only
accessing the index from one JVM, you can use the NoLockFactory and save
some sync cost there.
Jay Yu wrote:
Mark,
Great effort getting the original
Mark,
Great effort getting the original lucene index accessor package in this
shape. I am sure this will benefit a lot of people using Lucene in a
multithread env.
I have a quick question to ask you:
Do you have to use the core Lucene 2.3-dev in order to use the accessor?
I will take a look a
total time to
parse a query and run a search. I'll try and get around to posting the
code tonight.
- Mark
Jay Yu wrote:
Mark Miller wrote:
Good luck Jay. Keep in mind, pretty much all LuceneIndexAccessor does
is sync Readers with Writers and allow multiple threads to share the
same in
will appreciate it!
Thanks for your help!
- Mark
Jay Yu wrote:
Mark,
Thanks for sharing your valuable exp. and thoughts.
Frankly our system already has most of the functionalities
LuceneIndexAcessor offers. The only thing I am looking for is to sync
the searchers' close. That's
less your bulk loading). I've had such a system with 5+ million docs
on a standard machine and searches where still well below a second after
the first Searcher is cached (and even the first search is darn quick).
And that includes a lot of extra crap I am doing.
- Mark
Jay Yu wrote
from the
originally submitted code.
4. I will look into getting together some code that I can share. The
multisearcher changes that are need are a couple of one liners really, so at
a minimum I will give you the changes needed.
- Mark
On 9/19/07, Jay Yu <[EMAIL PROTECTED]> wrote:
Mark,
raight from the
originally submitted code.
4. I will look into getting together some code that I can share. The
multisearcher changes that are need are a couple of one liners really, so at
a minimum I will give you the changes needed.
- Mark
On 9/19/07, Jay Yu <[EMAIL PROTEC
k its overlooked far too much.
- Mark
On 9/19/07, Jay Yu <[EMAIL PROTECTED]> wrote:
In a multithread app like web app, a shared IndexSearcher could throw a
AlreadyClosedException when another thread is trying to update the
underlying IndexReader by closing the shared searcher after t
In a multithread app like web app, a shared IndexSearcher could throw a
AlreadyClosedException when another thread is trying to update the
underlying IndexReader by closing the shared searcher after the index is
updated. Searching over the past discussions on this mailing list, I
found several
bits
final BitSet filterBitSet = queryFilter.bits(reader);
filterBitSet.flip(0,filterBitSet.size());
Now you have a filter that contains document matching the opposite of
that specified by the query, and can use in subsequent queries
Dan
On Tue, 2007-07-24 at 09:40 -0700, Jay Yu wrote:
daniel ro
d
can cheaply be stored, generated once and used often.
Dan
On Mon, 2007-07-23 at 13:57 -0700, Jay Yu wrote:
If you want performance, a better way might be to assign some special
string/value (if it's easy to create) to the missing field of docs and
index the field without tokenizing it. Then you
If you want performance, a better way might be to assign some special
string/value (if it's easy to create) to the missing field of docs and
index the field without tokenizing it. Then you may search for that
special value to find the docs.
Jay
Les Fletcher wrote:
Does this particular range
You can put lucene-queries-2.2.0.jar on your class path or your Eclipse
project build path. That's all you need.
Jay
Akanksha Baid wrote:
I am using Lucene 2.1.0 and want to use MoreLikeThis for querying
documents. I understand that the jar file for the same is in contrib.
I have the contrib
Thanks for clarifying this, Chris!
I agree with you that javadocs usual should doc all they do but often
times they skip few important things they do do.
Chris Hostetter wrote:
: Does anyone know if the RangeFilter is a cached filter? I could not
: tell from the api.
Generally speaking cla
Hi All,
Does anyone know if the RangeFilter is a cached filter? I could not
tell from the api.
Thanks!
Jay
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
You may store the original text in the doc w/o index it and index the
lower case version without storing it. This may save you some space/time.
Eloi Rocha Neto wrote:
Hi Daniel,
I dont lowercase the field at index time, because I have to show the
results in the same way as it was found.
F
Thanks, Karl, for your link/suggestion. I did not know Payloads and will
check it out.
Jay
karl wettin wrote:
27 jun 2007 kl. 04.55 skrev yu:
I like Lucene's ability to boost each query term during query time..
However, for indexing, the smallest unit you can boost is a field.
I wond
I like Lucene's ability to boost each query term during query time..
However, for indexing, the smallest unit you can boost is a field.
I wonder if anyone here has good idea or suggestion on how one might
boost terms without putting them in separate fields?
Thanks.
Jay
---
I see.
I guess those Filters (e.g. PorterStemFilter) that make up the analyzer
are not thread safe or cannot be shared.
Thanks for your quick response!
Jay
Yonik Seeley wrote:
On 6/22/07, Jiye Yu <[EMAIL PROTECTED]> wrote:
I guess an Analyzer (built in ones such as StandardAn
Hi,
I guess an Analyzer (built in ones such as StandardAnalyzer,
POrterStemAnalyer and etc) is not thread safe. But I wonder if it's ok
to share the same analyzer object within a thread. For example, if I
want to create a PerFieldAnalyzer for 5 fields, can I use the same
Analyzer object for a
Thanks Antony for the idea.
The only thing that may prevent it from working well is that the index
is updated frequently so the docid to ext id or cache needs to be
updated freq, which may affect the performance.
Thanks again for your help.
Antony Bowesman wrote:
yu wrote:
Thanks Sawan for
.
Hope it will be useful.
Sawan
(Chambal.com inc. NJ USA)
On 6/15/07, yu <[EMAIL PROTECTED]> wrote:
Hi everyone,
I am trying to remove several docs from search results each time I do
query. The docs can be identified by an exteranl ids whcih are
saved/indexed. I could use a Qu
Hi everyone,
I am trying to remove several docs from search results each time I do
query. The docs can be identified by an exteranl ids whcih are
saved/indexed. I could use a Query or QueryFilter to achieve this but
not sure if it's the most efficient way to do that.
Anyone has any experienc
yes, PrefixQuery will help.
On 5/24/06, mark harwood <[EMAIL PROTECTED]> wrote:
>>What will happen if I send PrefixQuery
A search returns a list of docs - you want a list of
words which is why I suggested using the IndexReader
"terms" APIs which PrefixQuery uses internally.
If you are not in
tarting with "a".
Either way is a solution.
Best Regards,
Zhenjian
On 5/24/06, Vikas Khengare <[EMAIL PROTECTED]> wrote:
Hi Zhenjian YU
I have implemented that with AJAX Search But when I type "*a*" in
textbox. It gives me all results which contains letter &
hi, *vikas,
*the ajax way should work for you.
while in the lucene part, you should do something to handle the prefix
query.
Best Regards,
Zhenjian
On 5/24/06, Vikas Khengare <[EMAIL PROTECTED]> wrote:
Hi Friend
I want to perform search like "Google search" which will give
OK, got it. Thanks.
On 5/23/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:
On May 21, 2006, at 10:56 PM, Zhenjian YU wrote:
> I didn't dig the source code of lucence deep enough, but I noticed
> that the
> IndexSearcher uses an IndexReader, while the cost of initializing
&g
system needs to use (in the basic
sense). Pooling multiple instances pointing to the same index won't
benefit your performance. Things get trickier when you are updating
the index and want to see the updates.
Erik
On May 19, 2006, at 5:13 AM, Zhenjian YU wrote:
> I'm
I'm using apache commons pooling library to pool the IndexSearcher, so that
my system
can provide high performance.
I wonder if it is reasonable to pooling objects of lucene?
If yes, is there any other objects I can also pool?
Thank you!
52 matches
Mail list logo