can Lucene index different amount of attributes to one document?

2007-01-09 Thread David
Hi all: I know that we can use doc.add(attr, value) to add attributes to the document, can we add different amount of attributes to one document? for example, can this pseudo code runnable? public static Document Document(MyObject o) { Document doc = new Document(); for(i=0; i

How to support float type term in Lucene?

2007-01-09 Thread David
Hi all: How can I index float type term in Lucene so that we can search in a range? I learned that it is possible to convert float to sortable string, but I don't know how to do it. -- David

can we add duplicated field in Lucene?

2007-01-09 Thread David
if I want add two authors to the index , how can I do? except concat the authors' string, because maybe I should support Range search for datetime. Appreciate your help! -- David

How to retrieve the document by document ID?

2007-01-12 Thread David
Hi all: How do Lucene give each document an ID when the document is added and How do we retrieve a document by document ID? appreciate your help! -- David

Re: How to retrieve the document by document ID?

2007-01-14 Thread David
thanks, How do Lucene give each document an ID when the document is added? Is the document ID unchanged until the document is deleted? 2007/1/12, Otis Gospodnetic <[EMAIL PROTECTED]>: David, please look at the Javadoc for IndexReader. I believe the API is reader.document(int), where

How to open IndexWriter to append document?

2007-01-14 Thread David
se.py", line 30, in store writer = PyLucene.IndexWriter(store, self.analyzer, False ) PyLucene.JavaError: java.io.FileNotFoundException: D:\work\contentstore\contentstore\backends\lucene\index\segments: The system cannot find the file specified. How to fix it? Appreciate your help! -- David

Re: How to open IndexWriter to append document?

2007-01-15 Thread David
You should instantiate the writer with create=true : IndexWriter(store, StandardAnalyzer, True ) Because with create=False, an existing index is expected. David <[EMAIL PROTECTED]> wrote on 14/01/2007 23:56:23: > Hi all: > I want first erase the original index and then c

Do frequent addition and deletion affect the search speed?

2007-01-15 Thread David
, 4. user can search on the attributes, range search should be supported. Is Lucene sortable for this case, do the addition and deletion affect the search speed? when the index file become large, is there something unstable? Appreciate your help! -- David

Re: How to support float type term in Lucene?

2007-01-16 Thread David
]>: On 1/9/07, David <[EMAIL PROTECTED]> wrote: > Hi all: > How can I index float type term in Lucene so that we can search in a > range? I learned that it is possible to convert float to sortable string, > but I don't know how to do it. It's easiest/fastest in

what's the newest version of "Lucene in Action"

2007-01-17 Thread David
Hi all: the pdf format "Lucene in Action" I'm reading now is talk about Lucene 1.4, Is the book updated with Lucene 2.0? I don't have any information about it. Appreciate your help -- David

confuse of required and prohibited in BooleanQuery

2007-01-17 Thread David
setting the required and prohibited flags both to false 2.There are four logical combinations of these flags, but the case where both are true is an illogical and invalid combination but I don't know why, Can anybody explain it to me? Appreciate your help! -- David

search in all fields

2007-01-18 Thread David
search on all fields without specify any fields ? Appreciate your help! -- David

how to make RangeQuery action as > < != operators?

2007-01-18 Thread David
Hi all: I need to make range query actions as > < and != operators, The RangeQuery class just support RangeQuery(begin, end, inclusive), but How to support > < and != ? Appreciate your help! -- David

doc.getFields argument error

2007-01-18 Thread David
akes exactly one argument (0 given) how to fix this error? Appreciate your help! -- David

Re: doc.getFields argument error

2007-01-18 Thread David
author.html -Hoss - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- David

Re: Suspected problem in the QueryParser

2007-05-16 Thread david
.* AND NOT B +MatchAllDocsQuery -body:B 2, 4 2, 4, 5 Can someone explain why doc 5 is matched for both "*:* NOT B" and "*.* AND NOT B"? Are my Expected Matches incorrect? Thanks, david On 11/3/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : When I enter the query

Payload instance and byte buffer instance re-use

2011-12-14 Thread David Causse
afraid of some cases where payload instance or data could be buffered and then overwritten by myself while building the next token. Thanks for your help. -- David Causse Spotter http://www.spotter.com/ - To unsubscribe, e-

extractterms Output

2012-01-12 Thread David Olson
get what I want (or hack the explanation class). Why does extractterms do that, even though the search worked on the compound terms? -David- -- View this message in context: http://lucene.472066.n3.nabble.com/extractterms-Output-tp3654833p3654833.html Sent from the Lucene - Java

Re: extractterms Output

2012-01-13 Thread David Olson
As a follow on, I am using StandardAnalyzer for the doc and query parse. What I get from Luke is a hit, but positions indicate the terms next to each other. Is that how I can get the multi-word fequencies? -David- -- View this message in context: http://lucene.472066.n3.nabble.com/extractterms

Query building performance

2012-01-16 Thread David Olson
ery marked as SHOULD. To complicate things, (like my previous post on explain), some keywords are phrases. Parsing does conveniently convert multi-word keywords to phrases, which I assume I'd have to do if I'm building up a BooleanQuery list. Thanks! -David- -- View this message in con

any tips for upgrading Lucene 3.0.3 -> 3.5.0?

2012-01-19 Thread David Carlton
input.) Are there intermediate versions that we should try, e.g. 3.0.3 -> 3.4.0 -> 3.5.0 instead of going straight from 3.0.3 to 3.5.0? If it matters: the one unusual Lucene usage that we have is that we've written our own Tokenizer. Thanks for any pointers that people have. -- David Carlton carl...@sumologic.com

Re: any tips for upgrading Lucene 3.0.3 -> 3.5.0?

2012-01-19 Thread David Carlton
ucene 4.0 (that can only read indexs >= 3.0). > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: David Carlton [mailto:carl...@sumologic.com] >

Re: [Bulk] RE: any tips for upgrading Lucene 3.0.3 -> 3.5.0?

2012-01-20 Thread David Carlton
ache.org > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > -- David Carlton carl...@sumologic.com

Query term counting, again...

2012-01-25 Thread David Olson
Hi all, After much code and forum searching, I've hit a frustrating point that should be more obvious. I've trolled through a ton of postings and messaging on keyword counting and it seems like all the examples cover single word terms. I've got several code bits I've written that can get me what I

RE: Query term counting, again...

2012-01-26 Thread David Olson
something about that doesn't sit right. With a hundred vendors that could have 30 keywords each, ouch. Thanks again! -David- -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Thursday, January 26, 2012 8:44 AM To: java-user@lucene.apache.org Sub

Is it possible to create an index with lucene core version 3.3+ by using Version.2_3 that I can then open an index with the original lucene core 2.3 version?

2012-02-20 Thread David Linde
in advance! David

PyLucene Error Message

2012-03-29 Thread David Mosca
keep launching the script again until the script stops crashing (it sometimes takes up to 5 or 6 times). Could you please have a look into this? Thank you. David - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org

RE: PyLucene Error Message

2012-03-29 Thread David Mosca
I have re-attached the log. Thanks, David -Original Message- From: Greg Bowyer [mailto:gbow...@fastmail.co.uk] Sent: 29 March 2012 16:55 To: java-user@lucene.apache.org Subject: Re: PyLucene Error Message I dont see any attached log, can you attach the log please. -- Greg On 29/03

RE: PyLucene Error Message

2012-03-29 Thread David Mosca
but I still get the same error message. Thanks, David From: Greg Bowyer [mailto:gbow...@fastmail.co.uk] Sent: 29 March 2012 17:30 To: David Mosca Cc: java-user@lucene.apache.org Subject: Re: PyLucene Error Message Wow a VM / hotspot crash, are you using threads ? If you are make sure you hav

RE: PyLucene Error Message

2012-03-29 Thread David Mosca
Unfortunately not. Interestingly enough I had already tried a sleep(1) when populating the queue, with no results. From: Greg Bowyer [mailto:gbow...@fastmail.co.uk] Sent: 29 March 2012 18:09 To: David Mosca Cc: java-user@lucene.apache.org Subject: Re: PyLucene Error Message Its a bit crap, but

RE: PyLucene Error Message

2012-03-30 Thread David Mosca
On 29/03/12 10:27, David Mosca wrote: > Unfortunately not. Interestingly enough I had already tried a sleep(1) when > populating the queue, with no results. > > > From: Greg Bowyer [mailto:gbow...@fastmail.co.uk] > Sent: 29 March 2012 18:09 > To: David Mosca > Cc: jav

Re: DisjunctionMaxQuery and scoring

2012-04-19 Thread David Murgatroyd
On Apr 19, 2012, at 6:36 PM, Benson Margulies wrote: > I see why I'm so confused, but I think I need to construct a simpler test > case. > > My top-level BooleanQuery, which has disableCoord=false, has 22 > clauses. All but three are ordinary SHOULD TermQueries. the remainder > are a spanNe

Re: DisjunctionMaxQuery and scoring

2012-04-19 Thread David Murgatroyd
[apologies for the earlier errant send] I think BooleanQuery bq = new BooleanQuery(false); doesn't quite accomplish the desired "name IN (dick, rich)" scoring behavior. This is because (name:dick | name:rich) with coord=false would score the 'document' "Dick Rich" higher than "Rich" because the f

Re: Better Way of calculating Cosine Similarity between documents

2012-05-18 Thread nemeskey . david
It would also be a good idea to compute the distance without IDF weighting to see if you experience a similar effect. Regards, David Nemeskey Quoting Kasun Perera : Hi all I’m indexing collection of documents using Lucene specifying TermVerctor at the indexing time. Then I retrieve terms

Re: Ex.

2012-11-09 Thread David Pilato
Hi Fabio, Did you consider using Hibernate Search? It's based on Lucene. It may help you. David. Le 9 novembre 2012 à 14:15, Fabio Ebner a écrit : > Hi.. can someone help me? i need to integrate LUCENE + SPRING + HIBERNATE > and don't find any articlhe about this.. > >

Re: what is the offsets and payload in DocsAndPositionsEnum for ??

2012-11-27 Thread David Causse
unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands

Re: Configuration for edge ngram typeahead

2013-01-04 Thread David Pilato
Did you define mappings for your docs and fields to use that analyzer? See: http://www.elasticsearch.org/guide/reference/api/admin-indices-put-mapping.html -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 4 janv. 2013 à 22:30, "C. Benson Manica" a écrit : I

Re: Configuration for edge ngram typeahead

2013-01-04 Thread David Pilato
Oh. My bad! Sorry. I misread your JSON. BTW I see that you solve yourself your problem on StackOverFlow. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le 4 janv. 2013 à 23:21, "C. Benson Manica" a écrit : Do I have to do it that way, i.e. POST a separate settin

Re: Lucene vs RDBMS indexing at scale

2013-02-05 Thread David Pilato
So you probably should ask your question to the Elasticsearch mailing list. I think that some ES users already scales to x billion docs. Even if ES is Lucene based, it adds features to scale out (sharding, routing...). HTH -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs Le

handling nonexistent fields in an index

2013-07-03 Thread David Carlton
field, *) will have good performance if field only takes a small number of values. -- David Carlton carl...@sumologic.com

Re: handling nonexistent fields in an index

2013-07-03 Thread David Carlton
ng field, boolean negate)) > > -- Jack Krupansky > > -Original Message- From: David Carlton > Sent: Wednesday, July 03, 2013 4:27 PM > To: java-user@lucene.apache.org > Subject: handling nonexistent fields in an index > > > I have a bunch of Lucene indices l

Searching within a Search Result

2013-08-06 Thread David Miranda
scoreDocs; After the results of the first survey, I want to do a search in the abstract field only in results obtained from the first survey. How i do this? Thanks in advance. David

Re: Searching within a Search Result

2013-08-06 Thread David Miranda
e query generated by QueryParser or direct > construction of a TermQuery. See also CachingWrapperFilter if the > index isn't constantly changing. > > > -- > Ian. > > > On Tue, Aug 6, 2013 at 3:19 PM, David Miranda > wrote: > > Hi, > > > > I have a

Re: Searching within a Search Result

2013-08-06 Thread David Miranda
I experimented with the previous code, but no results are returned from the index. Someone can give me an example? I have been for some time trying to implement this functionality. Thanks. 2013/8/6 David Miranda > Do this with the BooleanQuery: > >> Query q1 = new QueryParser(Versi

Re: Searching within a Search Result

2013-08-19 Thread David Miranda
smallest possible > self-contained example that shows the problem. > > > -- > Ian. > > > On Tue, Aug 6, 2013 at 9:55 PM, David Miranda > wrote: > > I experimented with the previous code, but no results are returned from > the > > index. Someone can give me an example? I h

Lucene Text Similarity

2013-09-03 Thread David Miranda
r (Version.LUCENE_43, "abstract", analyzer). Parse (mytext); Search field abstract the "mytext", not me no results are returned. What can I do to implement this feature? Thanks in advance, David - To

Re: Lucene Text Similarity

2013-09-04 Thread David Miranda
st, > >Tim > > > From: Ivan Krišto [ivan.kri...@gmail.com] > Sent: Wednesday, September 04, 2013 3:17 AM > To: java-user@lucene.apache.org > Subject: Re: Lucene Text Similarity > > On 09/03/2013 07:33 PM, David Miranda wrote: > > Is there any wa

Lucene Concurrent Search

2013-09-04 Thread David Miranda
Hi, I'm developing a web application, that contains a REST service in the Tomcat, that receives several requests per second. The REST requests do research in a Lucene index, to do this i use the IndexSearch. My questions are: - There are concurrency problems in multiple research? - What the best

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
rcherManager sm = new SearcherManager(dir, new SearcherFactory()); > > to get default behaviour. The javadocs for SearcherFactory explain > that you can write your own implementation if you want custom > behaviour such as warming. > > > -- > Ian. > > > On Thu, Sep 5, 2013 a

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
- Search from 1 Million open source projects. > > > > On Thu, Sep 5, 2013 at 6:46 AM, David Miranda >wrote: > > > Hi, > > > > I'm developing a web application, that contains a REST service in the > > Tomcat, that receives several requests per second. &

Re: Lucene Concurrent Search

2013-09-05 Thread David Miranda
> > > I use a singleton class but there are other ways in tomcat. Can't > > remember what - maybe application scope. > > > > > > -- > > Ian. > > > > > > On Thu, Sep 5, 2013 at 4:46 PM, David Miranda > > > wrote: > > >

Re: Lucene Concurrent Search

2013-09-06 Thread David Miranda
; } > } > > Usually does the trick. You need to put some parameters ("indexDir") into > your web.xml and make sure that it knows that SearchListener is a > ServletListener for your Web app. > > But, to re-iterate what someone else said: if you really just want RE

java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-25 Thread David Sitsky
uler$MergeThread.run(ConcurrentMergeScheduler.java:291) -- Cheers, David - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-25 Thread David Sitsky
Correction - this was with Lucene 2.9.3. On 26 July 2010 14:21, David Sitsky wrote: > Hi, > > A customer has been indexing a very large collection of documents that > has been running over many days using 2.9.0.  During the optimisation > stage, the following error occurred, an

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-25 Thread David Sitsky
continued to add more documents to the index after the restart without issue, before doing the final optimise operation, which is where the above error comes from, like the trace I showed before. On 26 July 2010 14:39, David Sitsky wrote: > Correction - this was with Lucene 2.9.3. > >

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-26 Thread David Sitsky
Running CheckIndex -fix fixed both indexes which is a relief. In both cases, it said 1 broken segment containing 1 document detected. So any ideas on what might have caused this in the first place? On 26 July 2010 16:58, David Sitsky wrote: > As another data point, this happened on anot

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-26 Thread David Sitsky
dexing work to disk. Its interesting in both instances, CheckIndex said there was 1 broken segment containing 1 document. Cheers, David On 26 July 2010 22:24, Michael McCandless wrote: > It's great that CheckIndex resolved it, but I'd like to get to the > root cause if possib

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-27 Thread David Sitsky
Hi Mike, Definitely no OOME (or other critical errors), and if there were, we would have terminated the program straight away. Cheers, David On 28 July 2010 01:44, Michael McCandless wrote: > Were there any exceptions during indexing, before the > TerminateProcess() call?  EG OOME? &g

Re: java.io.IOException: background merge hit exception: Caused by: java.io.FileNotFoundException: _1vw.fdt

2010-07-27 Thread David Sitsky
e if fsync() equivalent is called, that the pending I/O operations can't be cancelled, but I do wonder? On 28 July 2010 15:04, David Sitsky wrote: > Hi Mike, > > Definitely no OOME (or other critical errors), and if there were, we > would have terminated the program straight away. &g

Cannot view open issues in Hudson

2010-10-14 Thread David Clarke
Hey Guys Whenever I try to view open issues in hudson it doesn't display any information. Does anyone know why this is the case or how I could fix it? Thanks in advance -Dave Clarke

Re: determining the type of a term - retrieving a payload

2010-10-15 Thread David Causse
ot something like this : // goto to the doc with skipTo(int internalId) or next() // Iterate over positions for(int i = 0; i < currentTermPos.freq(); i++) { int p = currentTermPos.nextPosition(); payloadBuffer = currentTermPos.getPayload(payloadBuffer, 0); ... } --

Search returning documents matching a NOT range

2010-11-04 Thread David Fertig
I have an active lucene implementation that has been in place for a couple years and was recently upgraded to the 3.02 branch. We are now occasionally seeing documents returned from searches that should not be returned. I have reduced the code and indexes to the smallest set possible where I can st

RE: RangeQuery with multiple ranges ?

2010-11-05 Thread David Fertig
You can use a query parser to parse your text query into the appropriate query objects. -Original Message- From: Alain Camus [mailto:a...@ngi.be] Sent: Friday, November 5, 2010 11:26 AM To: java-user@lucene.apache.org Subject: RangeQuery with multiple ranges ? Hello list, I'm new to l

RE: Search returning documents matching a NOT range

2010-11-05 Thread David Fertig
OT range Do you get the bogus hit on the small index if search that index alone? Are you positive it only holds the one doc? Loading the one doc into a new RAM based index in the test would prove it. You are more likely to get help if post a self-contained example - people can see everything rel

RE: Search returning documents matching a NOT range

2010-11-07 Thread David Fertig
index (as expected) and those terms are not in the second index. David -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Sunday, November 7, 2010 11:12 AM To: java-user@lucene.apache.org Subject: Re: Search returning documents matching a NOT range What kind o

RE: Antw.: Search returning documents matching a NOT range

2010-11-08 Thread David Fertig
ut may try using Ian's more uniformly generated index. Should I add a JIRA ticket for this or will someone else take care of this? Hopefully I can add my email so I can track this issue. David -Original Message- From: Ian Lea [mailto:ian@gmail.com] Sent: Monday, November 8

RE: Search returning documents matching a NOT range

2010-11-17 Thread David Fertig
I noticed there is still no JIRA ticket for this, do we have any type on consensus on how this issue will/will not be resolved? If MultiSearcher and and MultiReader do not give the same results, I would think one would be considered "broken" and/or possibly "unfixable". Is MultiSearcher goin

RE: best practice: 1.4 billions documents

2010-11-21 Thread David Fertig
Actually I've been bitten by an still-unresolved issue with the parallel searchers and recommend a MultiReader instead. We have a couple billion docs in our archives as well. Breaking them up by day worked well for us, but you'll need to do something. -Original Message- From: Luca Ronda

RE: best practice: 1.4 billions documents

2010-11-22 Thread David Fertig
ni [mailto:luca.rondan...@gmail.com] Sent: Monday, November 22, 2010 1:47 AM To: java-user@lucene.apache.org Subject: Re: best practice: 1.4 billions documents Hi David, thanks for your answer. it really helped a lot! so, you have an index with more than 2 billions segments. this is pretty

RE: best practice: 1.4 billions documents

2010-11-22 Thread David Fertig
hings like warming searchers, indexing additional documents, or filling FieldCache in parallel. Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: David Fertig [mailto:dfer...@cymfony.com] >

a proof that every word is indexing properly

2010-12-01 Thread David Linde
a new search engine, is there anyway to logically prove, that lucene indexes every word properly? One idea we considered is attempting to rebuild the source from the index, but it seems like doing that would take a huge effort. Any ideas or thoughts, would be very much appreciated. Thanks in advance David

RE: a proof that every word is indexing properly

2010-12-01 Thread David Fertig
Stop words are never indexed; you may need to empty your stop list. Luke (open-source w/code available) can browse and re-create documents in indexes using their terms already. Compare that to the original to see if you are satisfied. -Original Message- From: David Linde

ParallelReader

2011-02-21 Thread David Saile
there maybe any other approaches around, that I overlooked? What do I need to be aware of, when using two parallel indices? Thanks for any help! David - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.or

RE: Lucene paid support

2011-03-03 Thread Jarrin, David
issues we might encounter within Lucene itself. I will look into LucidImagination and see what they offer. Thanks again for you kind response, -David. -Original Message- From: Paul Libbrecht [mailto:p...@hoplahup.net] Sent: Thursday, March 03, 2011 4:26 PM To: java-user@lucene.apache.org

RE: Lucene paid support

2011-03-03 Thread Jarrin, David
Very helpful! Thank you Ahmet! -David. -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Thursday, March 03, 2011 5:24 PM To: java-user@lucene.apache.org Subject: RE: Lucene paid support > Thanks for the quick reply. Sorry I was vague in my message. We

Re: Is ConcurrentMergeScheduler useful for multiple running IndexWriter's?

2011-03-11 Thread David Causse
short lived thread, (mostly due to a not so smart IW usage, the new NRT Reader helps in this way). A good idea would be MergeScheduler implementation that accept an application controlled thread pool, some sort of ExecutorServiceMergeScheduler. Regards. -- David

Re: FieldSelector with Lucene 2.3.2

2011-03-17 Thread David Causse
lector that do the whole job on a doc by doc basis and not collecting and saving all docs in a Collection. -- David Causse Spotter http://www.spotter.com/ - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Wanted: a directory of quick-and-(not too)dirty analyzers for multi-language RDF.

2011-03-22 Thread David Causse
use multiple analyzers at index time you'll have to use multiple analyzers at query time (tricky part of the process). Regards. -- David Causse Spotter http://www.spotter.com/ - To unsubscribe, e-mail: java-user-unsubscr...

Re: using 2 different Analyzer for indexing ?

2011-03-25 Thread David Causse
d your two indexed fields in the same Document object. Regards. -- David Causse Spotter http://www.spotter.com/ - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Update Document based on Query instead of Term

2011-04-13 Thread David Causse
ateDocument(Query query, Document doc) Hi, as updateDocument(Term t, Document d) is just a delete + add, you can use : IndexWriter.delete(Query query); IndexWriter.add(Document d); Regards. -- David Causse Spotter http://www.spotter.com/

Adding Lucene Search Capability to an Android app

2011-08-15 Thread David Linde
I am wanting to add lucene searching capability to my android app. I have trying to find out if anyone has done this yet? IF its possible? and what problems, or draw backs other people have had adding lucene to their droid app. I know the Droid has several different search options like SQLite FTS a

Re: SSD Experience (on developer machine)

2011-08-24 Thread David Nemeskey
tic' data (data of cells that have not been written to for a long time) to the 5GB in question and use the 'fresher' cells as free space; this could be done in a round-robin fashion. Do SSDs (or some one them) implement a similar functionality? Or alternative

Scoring in Lucene

2011-10-05 Thread David Ryan
Hi, The defaulting scoring in Lucene uses tf x idf^2 instead of tf x idf . Does any have have insight that why not using tf x idf? Here is the note on score calculation. https://lucene.apache.org/java/3_4_0/api/core/org/apache/lucene/search/Similarity.html

idf calculation in Lucene ?

2011-10-20 Thread David Ryan
According to https://lucene.apache.org/java/3_4_0/api/core/org/apache/lucene/search/Similarity.html idf(t) = 1 + log ( numDocs/(docFreq+1)) For example, in the following example, ln(26

Re: idf calculation in Lucene ?

2011-10-31 Thread David Ryan
Thanks! Is there any way to extend the Similarity class to overwrite the behavior (e.g., using the max idf instead of the sum of each term idfs)? On Thu, Oct 27, 2011 at 5:41 AM, Robert Muir wrote: > On Thu, Oct 20, 2011 at 3:11 PM, David Ryan wrote: > > > > > However, in

Re: idf calculation in Lucene ?

2011-11-02 Thread David Ryan
ion, > org.apache.lucene.search.Searcher) > > On Mon, Oct 31, 2011 at 5:24 PM, David Ryan wrote: > > Thanks! Is there any way to extend the Similarity class to overwrite the > > behavior (e.g., using the max idf instead of the sum of each term idfs)? > > > > > > On Thu, Oct 27, 2011 at 5:

Search in a specific ScoreDoc result

2013-09-17 Thread David Miranda
Hi, I want to do a kind of 'facet search', that initial research in a field of all documents in the Lucene index, and second search in other field of the documents returned to the first research. Currently I'm do the first research : Query q = new QueryParser(Version.LUCENE_43, "label", analyzer

BooleanScorer - Maximum Prohibited Scorers?

2014-04-16 Thread David Stimpert
Hello, I have found useful functionality in BooleanQuery which allows me to specify a minimum number of matching optional terms (i.e. setMinimumNumberShouldMatch). I do not, however, see similar functionality available for setting the maximum number of MUST_NOTs (i.e. setMaximumNumberMustNotMatch)

RE: best choice for ramBufferSizeMB

2014-05-16 Thread Baldwin, David
Is this true as well for 2.9.2? -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Wednesday, May 14, 2014 8:54 AM To: Lucene Users Subject: Re: best choice for ramBufferSizeMB Generally larger is better, as long as JVM's heap is big enough to allow IW t

RE: Lucene Upgrade from 2.9.x to 4.7.x

2014-05-29 Thread Baldwin, David
I am looking for the same. Need to upgrade from 2.9.2 . -Original Message- From: Buddhavarapu, Suresh [mailto:suresh.buddhavar...@emc.com] Sent: Thursday, May 29, 2014 7:57 AM To: java-user@lucene.apache.org Subject: Lucene Upgrade from 2.9.x to 4.7.x Hello, I'm looking for some docum

RE: Lucene Upgrade from 2.9.x to 4.7.x

2014-05-29 Thread Baldwin, David
I am curious what type of time-frame it took you to do the upgrade, including acceptable unit and QA tesing? -Original Message- From: Xiaolong Zheng [mailto:xiaolong.zh...@mathworks.com] Sent: Thursday, May 29, 2014 9:14 AM To: java-user@lucene.apache.org Subject: Re: Lucene Upgrade from

Re: How to handle words that stem to stop words

2014-07-07 Thread David Murgatroyd
rom the web.) The work integrating OpenNLP <https://issues.apache.org/jira/browse/LUCENE-2899> might also prove helpful. Best, David Murgatroyd ww.linkedin.com/in/dmurga/ <http://www.linkedin.com/in/dmurga/> [image: Inline image 1] On Mon, Jul 7, 2014 at 5:53 PM, Sujit Pal wrote: &g

2.9.2 Memory issue 8.0GB or more / OOM with Term / Highlighting

2014-07-30 Thread Baldwin, David
I am looking to track down an issue in 2.9.2 where during highlighting, certain data may cause rapid memory usage and OOM exception in java: --- java.lang.OutOfMemoryError: Java heap space at org.apache.lucene.analysis.Token.growTermBuffer(Token.java:470) at

How to properly correlate relevance in a search across multiple collections

2014-09-05 Thread Baldwin, David
anyone have any good experience with this and could they share some tid-bits or info I may not have run across yet? -David - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
Would it be possible, or does anyone have any experience, in using the raw score from each separate collection to order and then after a merge come up with relevancy? -Original Message- From: atawfik [mailto:contact.txl...@gmail.com] Sent: Sunday, September 07, 2014 9:50 AM To: java-use

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
as performant as a simple non-df similarity approach. It might also be nice for apps to offer users pure-tf scoring if it provides faster search results, and then the user could click on a "refine results" button to re-do the search with the more expensive cross-corpus df-based

RE: How to properly correlate relevance in a search across multiple collections

2014-09-08 Thread Baldwin, David
ested. Best, Erick On Mon, Sep 8, 2014 at 8:59 AM, Baldwin, David wrote: > Would it be possible, or does anyone have any experience, in using the raw > score from each separate collection to order and then after a merge come up > with relevancy? > > -Original Message- &

RE: How to properly correlate relevance in a search across multiple collections

2014-09-09 Thread Baldwin, David
g. Anyone? -Original Message- From: atawfik [mailto:contact.txl...@gmail.com] Sent: Tuesday, September 09, 2014 12:42 AM To: java-user@lucene.apache.org Subject: RE: How to properly correlate relevance in a search across multiple collections Hi David, It seems that MultiSearcher is deprecated i

Issues with lucene 4.10.0 on android

2014-09-25 Thread David Linde
“DirectoryReader.” try { sis.read(dir, fileName); } catch (NoSuchFileException | FileNotFoundException var10) { sis = null; } Is this something you would consider changing to make it compatible with android? Thanks in Advance, David

Re: Search "_all" field with a term

2014-10-11 Thread David Pilato
You should ask this on the elasticsearch mailing list. BTW, look at elasticsearch copy_to feature. Better than _all field. My 2 cents. -- David ;-) Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs > Le 11 oct. 2014 à 11:31, "haiwei.xie-soulinfo" a > écrit : > > H

  1   2   3   >