FW: Challenges with Chinese Query Matching and Wildcard Search in Lucene (StandardAnalyzer / CJKAnalyzer)

2025-07-08 Thread Singh, Divya
From: Singh, Divya Sent: 04 July 2025 14:40 To: d...@lucene.apache.org Cc: Birajdar, Sharad (DI SW PLM LCS APPS ALM R&D7) Subject: FW: Challenges with Chinese Query Matching and Wildcard Search in Lucene (StandardAnalyzer / CJKAnalyzer) From: Thakare, Monika (ext) (DI SW PLM LCS APPS A

FW: PointValues ordering

2018-02-26 Thread Allison, Timothy B.
Prob better question for user list. From: Dominik Safaric [mailto:dominiksafa...@gmail.com] Sent: Monday, February 26, 2018 1:20 PM To: d...@lucene.apache.org Subject: PointValues ordering Given a multi-valued and non-indexed point value field, how does Lucene internally store this kind of field

FW: [Jts-topo-suite-user] Persistent STR tree

2014-01-17 Thread Smiley, David W.
FYI for those with spatial interests… From: , "Smiley, David W." mailto:dsmi...@mitre.org>> Date: Friday, January 17, 2014 at 11:53 AM To: Demeter Sztanko mailto:szta...@gmail.com>> Cc: "jts-topo-suite-u...@lists.sourceforge.net" mailto:jts-topo

fw:

2013-07-21 Thread Ozgur Yilmazel
http://volumizercum.freeenhancementpills.com/apnzhdmv/pewnepiccjcjomtiqadeosdpvxbe ozguryilmazel 7/22/2013 4:10:47 AM - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional com

Re: FW: How to find the original position of the match in the pdf document

2011-10-19 Thread Ian Lea
Sivasubramanian wrote: > Hi > > Can someone answer my question please > > Regards, > Vidya > > > From: Vidya Kanigiluppai Sivasubramanian > Sent: Wednesday, October 19, 2011 6:06 PM > To: ''java-user@lucene.apache.org' > Subject: FW: How to find the or

FW: How to find the original position of the match in the pdf document

2011-10-19 Thread Vidya Kanigiluppai Sivasubramanian
Hi Can someone answer my question please Regards, Vidya From: Vidya Kanigiluppai Sivasubramanian Sent: Wednesday, October 19, 2011 6:06 PM To: ''java-user@lucene.apache.org' Subject: FW: How to find the original position of the match in the pdf document Hi Can someone ans

FW: Thans for your help with my Lucene problem

2011-07-28 Thread Uwe Schindler
Forwarded personal mail: Hi Uwe, Many thanks for your excellent help with my problem in Lucene (below), I really appreciate such a prompt and professional support that you guys provide. I'm sending this email directly to you because I don't know how to reply to your reply on the Lucene Mailin

Re: FW: Indexer Threads Getting Into BLOCKED State While Optimization Taking Place On Large Indexes Of Size > 2GB

2011-07-20 Thread Michael McCandless
Hmm can you double-check your Lucene version? SerialMergeScheduler wasn't added until 2.3, so you are at least at that version. It looks like you are using SerialMergeScheduler, which, by design, can only do one merge at a time (this is why you see the threads BLOCKED). You can try switching to

Re: FW: Re: lucene3.0.2: getting incorrect no. of occurrence in file

2010-12-08 Thread Erick Erickson
And I kind of skipped over the query part. Between the two of us, we might make one decent code reviewer :D Erick On Wed, Dec 8, 2010 at 8:46 AM, Simon Willnauer < simon.willna...@googlemail.com> wrote: > On Wed, Dec 8, 2010 at 2:43 PM, Erick Erickson > wrote: > > I don't think this code is doi

Re: FW: Re: lucene3.0.2: getting incorrect no. of occurrence in file

2010-12-08 Thread Simon Willnauer
On Wed, Dec 8, 2010 at 2:43 PM, Erick Erickson wrote: > I don't think this code is doing anything predictable. From the javadocs for > TermDocs.skipTo(): > Skips entries to the first beyond the current whose document number is > greater than or equal to *target*. > > Returns true iff there is such

Re: FW: Re: lucene3.0.2: getting incorrect no. of occurrence in file

2010-12-08 Thread Simon Willnauer
Hey there, so let me quickly explain what is going on here. you search for: sql server (without quotes) which QueryParser will turn into a BooleanQuery like sql OR server since the StandardAnalyzer you are using splits on whitespaces. That query will return all document either containing sql or s

Re: FW: Re: lucene3.0.2: getting incorrect no. of occurrence in file

2010-12-08 Thread Erick Erickson
I don't think this code is doing anything predictable. From the javadocs for TermDocs.skipTo(): Skips entries to the first beyond the current whose document number is greater than or equal to *target*. Returns true iff there is such an entry. You're not testing the return value from skipTo. The do

FW: Re: lucene3.0.2: getting incorrect no. of occurrence in file

2010-12-08 Thread Ranjit Kumar
Hi, Thanks for your replay!!! Below is code I am using for search String line="sql server"; IndexReader reader = IndexReader.open(FSDirectory.open(new File(indexpath)), true); // contains index file p

Re: FW: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Ahmet Arslan
Browne. --- On Mon, 10/25/10, Martin O'Shea wrote: > From: Martin O'Shea > Subject: FW: Use of hyphens in StandardAnalyzer > To: java-user@lucene.apache.org > Date: Monday, October 25, 2010, 12:28 AM > A good suggestion. But I'm using > Lucene 3.0.2 and the construct

FW: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Martin O'Shea
A good suggestion. But I'm using Lucene 3.0.2 and the constructor for a StandardAnalyzer has Version_30 as its highest value. Do you know when 3.1 is due? -Original Message- From: Steven A Rowe [mailto:sar...@syr.edu] Sent: 24 Oct 2010 21 31 To: java-user@lucene.apache.org Subject: RE:

FW: Save 45% and Join the Search Revolution!

2010-09-27 Thread Fuad Efendi
Click here to view in your browser.

FW: Binary indexing / query efficiency

2009-04-16 Thread Eger, Patrick
Resending, I think this got dropped by the list for some reason - Hi, was recently looking to incorporate Lucene for a simple "parametric"/"faceted" type search. The documents are very small, roughly 15 fields of short length (5-15 characters, generally strings and padded integers). When

FW: Case Sensitivity

2008-08-28 Thread Dino Korah
Looks like my question got unnoticed among the more important Jira discussion. :( On the same topic, what would be the effect of the following code. Document doc = new Document(); Field f = new Field("body", bodyText, Field.Store.NO, Field.Index.TOKENIZED); f.setOmitNorms(true); Would that be eq

Re: FW: Re: Occasional Hang in IndexWriter.close()

2008-04-22 Thread Stu Hood
; Sent: Tuesday, April 22, 2008 7:32am To: java-user@lucene.apache.org, [EMAIL PROTECTED] Subject: Re: FW: Re: Occasional Hang in IndexWriter.close() - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:

Re: FW: Re: Occasional Hang in IndexWriter.close()

2008-04-22 Thread Michael McCandless
The hang also only happens if you are using SerialMergeScheduler. Stu, one question: was there an interesting reason why you switched back to SerialMergeScheduler? Did you hit an issue with ConcurrentMergeScheduler? Mike Stu Hood <[EMAIL PROTECTED]> wrote: > Hey gang, > > The finally block was

Re: FW: Re: Occasional Hang in IndexWriter.close()

2008-04-22 Thread Michael McCandless
OK this output was very helpful, thanks! I think I see what's happening here. Basically a merge can sneak in when Lucene doesn't expect it to (on copying a single external segment over), and as a result it never gets scheduled. This happens only with addIndexesNoOptimize, when the index you addi

FW: Re: Occasional Hang in IndexWriter.close()

2008-04-21 Thread Stu Hood
Hey gang, The finally block was not hiding any exceptions: I'm still seeing the hangs in IndexWriter.close(). I'm appending the relevant output from 'setInfoStream(System.out)'. It looks like after I order the close, the IndexWriter begins a flush, and then hangs. Any ideas? Thanks, Stu >>>

FW: Eliminating duplicate documents when indexing

2007-10-03 Thread Rod Giles
Duplicate Documents In An Index The updateDocument method of Index Writer indicates that a delete term occurs before the update document takes place (i.e. the document is replaced in the index, but not duplicated).Has anyone been able to get this process to work? The term that I am using ha

RE: FW: Lucene indexing vs RDBMS insertion.

2007-07-05 Thread Chew Yee Chuang
I'm wrong. Any suggestion to lighten the indexing is always welcome. Thank you -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 20, 2007 8:56 PM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. That's

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-20 Thread Erick Erickson
there is no way to schedule the indexing. What I can do is to find a solution to minimize delay so system can get "almost" real time data to display. Thanks. --- eChuang, Chew -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 19, 2007 10:03

RE: FW: Lucene indexing vs RDBMS insertion.

2007-06-20 Thread Chew Yee Chuang
y. Thanks. --- eChuang, Chew -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 19, 2007 10:03 PM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. You still haven't described how often you need to index and why. That

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-19 Thread Erick Erickson
ferent with searching in a optimized index compare to a un-optimize ? -- eChuang, Chew -Original Message- From: Chris Lu [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 19, 2007 2:19 AM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. Definitely very

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-19 Thread Chris Lu
un-optimize ? -- eChuang, Chew -Original Message- From: Chris Lu [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 19, 2007 2:19 AM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. Definitely very aggressive. Currently my experience is that, together with datab

RE: FW: Lucene indexing vs RDBMS insertion.

2007-06-18 Thread Chew Yee Chuang
? -- eChuang, Chew -Original Message- From: Chris Lu [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 19, 2007 2:19 AM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. Definitely very aggressive. Currently my experience is that, together with

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-18 Thread Chris Lu
gestion and will come back to you once I tested the solution. Thanks, eChuang, Chew -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Friday, June 15, 2007 11:11 PM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. From my perspec

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-18 Thread Erick Erickson
ne will have enough time to index it. Anyway, thanks for your suggestion and will come back to you once I tested the solution. Thanks, eChuang, Chew -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Friday, June 15, 2007 11:11 PM To: java-user@lucene.apache.org Subje

RE: FW: Lucene indexing vs RDBMS insertion.

2007-06-18 Thread Chew Yee Chuang
rom: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Friday, June 15, 2007 11:11 PM To: java-user@lucene.apache.org Subject: Re: FW: Lucene indexing vs RDBMS insertion. >From my perspective, this is an irrelevant question. The real question is "is Lucene indexing fast enough for my application?&

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-15 Thread Erick Erickson
From my perspective, this is an irrelevant question. The real question is "is Lucene indexing fast enough for my application?". Which nobody can answer for you, you have to experiment. If you're building an index that's only updated every 6 months, Lucene is certainly "fast enough". If you're re

Re: FW: Lucene indexing vs RDBMS insertion.

2007-06-15 Thread Chris Lu
It's better to first understand what's the computation difference between Lucene Indexing and database insertiong. For Lucene Indexing need to stem all words out, sort them, save them to disk. And since Lucene is an incremental merge model, saved documents may need to merge and saved again. There

FW: Lucene indexing vs RDBMS insertion.

2007-06-15 Thread Chew Yee Chuang
Hi, I’m a new user to Lucene, and heard that it is a powerful tool for full text search and I’m planning to use it in my project for data storage purpose. Before the implementation, I could like to know whether there is performance issue on Lucene indexing process. I have no doubt on the retrievin

Fw: About Lucene-patch-446

2007-05-29 Thread Doron Cohen
- taking theis discussion back to the user list - "Huajing Li" wrote on 29/05/2007: > Hi Doron, > > Days ago I published a post in the Lucene user maillist asking > about merging database data with Lucene que

FW: Question concerning Analyzers

2007-02-07 Thread Damien McCarthy
"Can an analyzer index something but ignore it during a search" It sounds like you might be using a different analyzer to parse your search query than you used when you indexed the data. Perhaps you could print out the toString() of the query you pass to the IndexSearcher, as this would show you i

Fw: Urgent : Specific search problem with whitespace analyzer

2006-11-21 Thread Krishnendra Nandi
Hi, I am doing "field:text" kind of search using my own analyzer which behaves like whitespaceanalyzer. Following are the code snippets for my own whitespaceanalyzer and whitespacetokenizer. // WhiteSpaceAnalyzerMaestro.java package com.hewitt.itk.maestro.support.service.simplesearch; import

Re: Fw: Urgent : Specific search problem with whitespace analyzer

2006-11-21 Thread Chris Hostetter
: I have modified the tokenizer class by making it return characters in : lower case. there is really no reason to do this ... have your analyzer use the WhitespaceTokenizer, wrapped in a LowerCaseFilter ... that will illiminate some of your custom code, and perhaps some of your problems as well.

Fw: tomcat 5 can't find static method FSDirectory.getDirectory( String s, boolean b)?

2006-07-05 Thread maffittd
--- David Maffitt, Senior Programmer-Analyst Washington University School of Medicine [EMAIL PROTECTED] Electronic Radiology Laboratory voice 314.362.6965 Mallinckrodt Institute of Radiology fax314.362.6971 510 S. Kingshighway, St. Louis, MO

FW: how Boolean query work internally in lucene

2006-07-01 Thread Amit
Hi All, I just want to know how the lucene processes the Boolean query internally?? As per my knowledge: if I search for "java apache". Note: let consider i want documents that contents both words and i constructed boolean query for that (i.e. +java +apache). Please let me clear if i

FW: Searching API: QueryParser vs Programatic queries

2006-05-22 Thread Irving, Dave
Erick Erickson wrote: ... > It seems to me that you can always do something like: > BooleanQuery bq; > QueryParser qp1 = new QueryParser("field1", "", analyzer); > Query q1 = qp1.parse("search term or clause); bq.add(q1,,,); > QueryParser qp2 = new QueryParser("field2", "", analyzer); > Query q

Fw: Sorting by Score

2006-02-06 Thread Daniel . Clark
~ Daniel Clark, Senior Consultant Sybase Federal Professional Services 6550 Rock Spring Drive, Suite 800 Bethesda, MD 20817 Office - (301) 896-1103 Office Fax - (301) 896-1604 Mobile - (703) 403-0340 ~ - Forward

FW: Issue with sounds-like queries

2005-09-28 Thread Peter Gelderbloem
You should present all the alternatives to the user as well as the contexts of each hit in terms of country, state and full name etc. and let them decide which one they intended. Peter -Original Message- From: Jayakumar.V [mailto:[EMAIL PROTECTED] Sent: 28 September 2005 12:03 To: Pe

FW: Boolean Query - TooManyClauses Exception

2005-05-19 Thread Jayakumar.V
Hi, Further to the below, if I remove the RangeQuery, it works beautifully. :-( Any pointers to where the issue is ? jayakumar.v _ From: Jayakumar.V [mailto:[EMAIL PROTECTED] Sent: Thursday, May 19, 2005 16:40 PM To: 'java-user@lucene.apache.org' Subject: Boolean Query - TooMa

RE: FW: CVS Lucene 2.0

2005-04-26 Thread Peter Veentjer - Anchor Men
Onderwerp: Re: FW: CVS Lucene 2.0 Term.field is interned, so equals() isn't needed. -Yonik On 4/26/05, Peter Veentjer - Anchor Men <[EMAIL PROTECTED]> wrote: [...] > Term other = (Term) o; > return field.equals(other.field) &&

RE: FW: CVS Lucene 2.0

2005-04-26 Thread Peter Veentjer - Anchor Men
ley [mailto:[EMAIL PROTECTED] Verzonden: dinsdag 26 april 2005 16:04 Aan: java-user@lucene.apache.org CC: Lucene Users List Onderwerp: Re: FW: CVS Lucene 2.0 Term.field is interned, so equals() isn't needed. -Yonik On 4/26/05, Peter Veentjer - Anchor Men <[EMAIL PROTECTED]> w

Re: FW: CVS Lucene 2.0

2005-04-26 Thread Yonik Seeley
Term.field is interned, so equals() isn't needed. -Yonik On 4/26/05, Peter Veentjer - Anchor Men <[EMAIL PROTECTED]> wrote: [...] > Term other = (Term) o; > return field.equals(other.field) && > text.equals(other.text); > } > Third: if the field values of re

FW: CVS Lucene 2.0

2005-04-26 Thread Peter Veentjer - Anchor Men
-Oorspronkelijk bericht- Van: Peter Veentjer - Anchor Men Verzonden: dinsdag 26 april 2005 15:44 Aan: 'Daniel Naber' Onderwerp: RE: CVS Lucene 2.0 -Oorspronkelijk bericht- Van: Daniel Naber [mailto:[EMAIL PROTECTED] Verzonden: dinsdag 26 april 2005 15:36 Aan: Peter Veentjer

FW: Indexing aborts in mid-process

2005-04-19 Thread Jayakumar.V
Hi, I need some clarification on the indexing process. A process is initiated for indexing 1000 documents. If for some reason, the process fails mid-way during the indexing activity, say while indexing the 501st document, what is the status of the index files? Does it commit after each docu