Re: [VOTE] Lucene logo contest, third time's a charm

2020-09-03 Thread jim ferenczi
A1 (binding) Le jeu. 3 sept. 2020 à 07:09, Noble Paul a écrit : > A1, A2, D binding > > On Thu, Sep 3, 2020 at 7:22 AM Jason Gerlowski > wrote: > > > > A1, A2, D (binding) > > > > On Wed, Sep 2, 2020 at 10:47 AM Michael McCandless > > wrote: > > > > > > A2, A1, C5, D (binding) > > > > > > Than

Re: [VOTE] Lucene logo contest

2020-06-17 Thread jim ferenczi
I vote option A (PMC vote) Le mer. 17 juin 2020 à 14:24, Felix Kirchner < felix.kirch...@uni-wuerzburg.de> a écrit : > A > > non-PMC > > Am 16.06.2020 um 00:08 schrieb Ryan Ernst: > > Dear Lucene and Solr developers! > > > > In February a contest was started to design a new logo for Lucene [1]. >

[ANNOUNCE] Apache Lucene 8.0.0 released

2019-03-14 Thread jim ferenczi
14 March 2019, Apache Lucene™ 8.0.0 available The Lucene PMC is pleased to announce the release of Apache Lucene 8.0.0. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full

[ANNOUNCE] Apache Lucene 7.7.0 released

2019-02-11 Thread jim ferenczi
11 February 2019, Apache Lucene™ 7.7.0 available The Lucene PMC is pleased to announce the release of Apache Lucene 7.7.0. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires f

[ANNOUNCE] Apache Lucene 7.5.0 released

2018-09-24 Thread jim ferenczi
24 September 2018, Apache Lucene™ 7.5.0 available The Lucene PMC is pleased to announce the release of Apache Lucene 7.5.0. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires

[ANNOUNCE] Apache Lucene 7.2.1 released

2018-01-15 Thread jim ferenczi
15 January 2018, Apache Lucene™ 7.2.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 7.2.1. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires fu

[ANNOUNCE] Apache Lucene 6.5.1 released

2017-04-27 Thread jim ferenczi
27 April 2017, Apache Lucene™ 6.5.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-

[ANNOUNCE] Apache Solr 6.5.1 released

2017-04-27 Thread jim ferenczi
27 April 2017, Apache Lucene™ 6.5.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.1 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-

[ANNOUNCE] Apache Lucene 6.5.0 released

2017-03-27 Thread jim ferenczi
27 March 2017 - Apache Lucene™ 6.5.0 available The Lucene PMC is pleased to announce the release of Apache Lucene 6.5.0. Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full

[ANNOUNCE] Apache Lucene 6.4.0 released

2017-01-23 Thread jim ferenczi
possible that the mirror you are using may not have replicated the release yet. If that is the case, please try another mirror. This also applies to Maven access. Thanks, Jim Ferenczi

Re: Leftover write.lock file

2016-05-22 Thread Jianbao (Jim) Tao
Never mind, folks. Just found an answer: http://www.gossamer-threads.com/lists/lucene/java-user/110708 On Sun, May 22, 2016 at 10:20 AM, Jianbao (Jim) Tao wrote: > Hi, > > I am new to lucene. I was following the example, IndexFiles, from the > lucene demo package. However, one

Leftover write.lock file

2016-05-22 Thread Jianbao (Jim) Tao
Hi, I am new to lucene. I was following the example, IndexFiles, from the lucene demo package. However, one thing I find surpring is that the write.lock file is left over, even when the IndexWriter#close() method is called. Below are details. Environment: OS X Yosemite java version "1.8.0_72" Jav

RE: QueryParser for DisjunctionMaxQuery, et al.

2013-07-23 Thread Beale, Jim (US-KOP)
Oh, sorry! Jim Beale hibu.com 2201 Renaissance Boulevard, King of Prussia, PA, 19406 Office: 610-879-3864 Mobile: 610-220-3067 -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Tuesday, July 23, 2013 1:42 PM To: java-user@lucene.apache.org Subject: Re

RE: QueryParser for DisjunctionMaxQuery, et al.

2013-07-23 Thread Beale, Jim (US-KOP)
our log files. I have to admit that I didn't understand what you meant by query enrichment. Where can I find more information about that? Thanks! Jim Beale hibu.com 2201 Renaissance Boulevard, King of Prussia, PA, 19406 Office: 610-879-3864 Mobile: 610-220-3067 -Original Me

QueryParser for DisjunctionMaxQuery, et al.

2013-07-23 Thread Beale, Jim (US-KOP)
r the full cases. Thanks for any info/guidance. Cheers, Jim Beale Lead Developer Hibu.com The information contained in this email message, including any attachments, is intended solely for use by the individual or entity named above and may be confidential. If the reader of this message

Indexing into SolrCloud

2013-07-18 Thread Beale, Jim (US-KOP)
;t find anything useful on the usual places. I suppose I could setup wireshark to see what is happening but I'm hoping that someone has a better suggestion. Thanks in advance for any help! Best regards, Jim Beale hibu.com 2201 Renaissance Boulevard, King of Prussia, PA, 19406 Office: 610

Block tree terms dict & index

2013-04-30 Thread Beale, Jim (US-KOP)
7;m still not clear about how best to employ the codec. Has anyone had any experience with it? Thanks for any and all insights. Best regards, Jim Beale The information contained in this email message, including any attachments, is intended solely for use by the individual or entity named abo

Re: Grouping Clauses to Preserve Order of Boolean Precedence

2011-08-05 Thread Jim Swainston
Brilliant, that looks perfect. We're currently using an older version of Lucene in which this was an experimental class. Looks like we should upgrade. Thanks Jim On 5 August 2011 02:10, Trejkaz wrote: > On Fri, Aug 5, 2011 at 1:57 AM, Jim Swainston > wrote: > > So if

Re: Grouping Clauses to Preserve Order of Boolean Precedence

2011-08-04 Thread Jim Swainston
es I want my program to work out that this should be grouped as the following (as AND has higher precedence than OR): (Marketing AND Smith) OR Davies. I'm effectively looking for an algorithm that will properly group any number of terms.. Thanks Jim On 4 August 2011 16:47, Chris

Re: Grouping Clauses to Preserve Order of Boolean Precedence

2011-08-04 Thread Jim Swainston
But the query parser doesn't seem to do that for me with the input Marketing AND Smith OR Davies. The query parser gives me 3 clauses. 1 must clause for the term Marketing, 1 must clause for the term smith and 1 should clause for the term Davies. e.g. +Marketing +Smith SHOULD Davies. What I would l

Re: Grouping Clauses to Preserve Order of Boolean Precedence

2011-08-03 Thread Jim Swainston
Thanks Ian. How would you achieve the logic of the below query using BooleanQuery and BooleanClause.occur? How would you achieve the grouping effect? (Marketing AND Smith) OR Davies Thanks a lot. Jim On 3 August 2011 14:54, Ian Lea wrote: > I don't think there is an easy way. Brac

Grouping Clauses to Preserve Order of Boolean Precedence

2011-08-03 Thread Jim Swainston
I work out where to place the brackets if the query was something like: Marketing AND Smith OR Davies OR Management OR Business AND Science.? Can any suggest an effective way to group clauses so that the order of Boolean precedence is preserved? Thanks very much. Jim

StandardAnalyzer exclude numbers

2008-09-22 Thread jim
Hello Is it possible to exclude numbers using StandardAnalyzer just like SimpleAnalyzer? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Re: Highlight huge documents

2008-08-11 Thread jim
it works!! Thanks I believe Highlighter.setMaxDocBytesToAnalyze(int byteCount) should be used for this. On Mon, Aug 11, 2008 at 11:40 AM, <[EMAIL PROTECTED]> wrote: > Hello > > I am using Highlighter to highlight query terms in documents getting from a > database founded from lucene search.

Highlight huge documents

2008-08-11 Thread jim
Hello I am using Highlighter to highlight query terms in documents getting from a database founded from lucene search. My problem is that when i display the full document, highlighter works fine for most of documents but if the document is huge the highlighter returns only a part of documen

Highlighting terms with different style

2008-07-10 Thread jim
Hi Is it possible to Hightlight more than one terms with highlighter but with different style for each term ?? 1st term with SimpleHTMLFormatter("", ""); 2rd term with SimpleHTMLFormatter("", ""); .. n-th term with SimpleHTMLFormatter("", ""); or for foloween code SimpleHTMLFormatter

Re: Highlight an Greek

2008-06-30 Thread jim
Hi Sorry about doubble messaging, it was from mistake I am working on an application that searches documents with lucene. The documents are taken from a database and the basic field is in HTML. The index and searching works fine with Greek characters. I am using the lucene to take the ID a

Highlight and greek characters

2008-06-26 Thread jim
Hello i am having the following code to highlight a text public String highlight(String text, String query ) throws IOException { TermQuery query = new TermQuery(new Term("f", query)); QueryScorer scorer = new QueryScorer(query); SimpleHTMLFormatter formatter = new SimpleHTMLForm

Highlight an Greek

2008-06-26 Thread jim
Hello i am having the following code to highlight a text public String highlight(String text, String query ) throws IOException { TermQuery query = new TermQuery(new Term("f", query)); QueryScorer scorer = new QueryScorer(query); SimpleHTMLFormatter formatter = new SimpleHTMLForm

Re: regex expressions within phrase queries

2008-02-25 Thread Jim Bogan
Thanks for the advice Chris. What I am working on now is extracting the matching phrases. The current code for MultiPhraseQuery and SpanQueries just return all matching terms, not matching phrases. I implemented some code matching up the TermPositions, but this is pretty slow. Is there any way

Re: regex expressions within phrase queries

2008-02-18 Thread Jim Bogan
By custom phrase query class I was trying to ask if it would be possible, or even a good idea, to create a modified PhraseQuery class that is more efficient that span queries (as I only want to use it for phrases). This class might have multiple possible terms generated from a regex at a certain po

regex expressions within phrase queries

2008-02-14 Thread Jim Bogan
I would like to be able to handle the following: "/\d\d\d{4} \\d\\d/ office" Where / indicates a regex expression phrase. One option is extending MultiFieldQueryParser and catching the phrase within getFieldQuery evaluating whether /, the regex identifier, is present and then returning a SpanNe

Re: Indexing MSword Documents

2007-06-09 Thread jim shirreffs
thanks the apprach you and Donna Gresh suggested worked out fine. I now have a much better understanding of the Document class. here is the create Document code in case another newie is interested. as more mine types are added I will expand the in if thanks again jim s public class

Re: Indexing MSword Documents

2007-06-08 Thread jim shirreffs
taking the time to reply jim s - Original Message - From: "Mathieu Lecarme" <[EMAIL PROTECTED]> To: Sent: Friday, June 08, 2007 12:48 PM Subject: Re: Indexing MSword Documents Why don't use Document? http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightl

Re: Indexing MSword Documents

2007-06-08 Thread jim shirreffs
many thanks I will try that, thanks again! jim s - Original Message - From: "Donna L Gresh" <[EMAIL PROTECTED]> To: Sent: Friday, June 08, 2007 12:52 PM Subject: Re: Indexing MSword Documents I do this exact thing. "text" (the second input to the Field co

Indexing MSword Documents

2007-06-08 Thread jim shirreffs
Hi, I am trying to index msword documents. I've got things working but I do not think I am doing things properly. To index msword docs I use an extractor to extract the text. Then I write the text to a .txt file and index that using an HTMLDocument object. Seems to me that since I have the te

Re: IndexWriter.Optimize() is too slow and IOException! How Can I do?

2007-06-08 Thread jim shirreffs
I am trying to index msword documents. I’ve got things working but I do not think I am doing things properly. To index msword docs I use an extractor to extract the text. Then I write the text to a .txt file and index that using an HTLMDocument object. Seems to me that since I have the text

Re: Indexing PDF document

2007-06-06 Thread jim shirreffs
ok thnaks found FontBox/jar on the net, but now I see the included jars with pdfbox. I expected them to be in /lib not /external my bad. thanks again jim s - Original Message - From: "Ben Litchfield" <[EMAIL PROTECTED]> To: Sent: Wednesday, June 06, 2007 6:1

Re: Indexing PDF document

2007-06-06 Thread jim shirreffs
e and got java.lang.NoClassDefFoundError: org/fontbox/afm/AFMParser googling trying to find the jar that contains AFMPaser thanks for the help jim s - Original Message - From: "Chris Hostetter" <[EMAIL PROTECTED]> To: Sent: Wednesday, June 06, 2007 5:33 PM Subject: Re: Indexing PDF document :

Indexing PDF document

2007-06-06 Thread jim shirreffs
r easy method to extract text from a pdf file I would centrainly like to know. I can live without openoffice (for a while) but not being able to index pdf would be a Lucene show stopper. thanks jim s - To unsub

Re: Indexing help needed

2007-05-25 Thread jim shirreffs
code up a Reader the just spites out "Here I am" a few hundred times and see what happens. LOL. thank you for the reply and advice. jim s - Original Message - From: "Andrzej Bialecki" <[EMAIL PROTECTED]> To: Sent: Friday, May 25, 2007 1:10 PM Subject: R

Indexing help needed

2007-05-25 Thread jim shirreffs
here. It is kind of important that I learn how to add different files types like OO or AutoCad, so we can make a build (with Lucene) or buy call. Thanks to all that try to help me out Jim S P.S. If I get it working I will be happy to email post the code. -

Re: CAD files, Images

2007-05-23 Thread jim shirreffs
thank you for the reply, I knew the answer but was compelled to ask anyway. CAD files like AutoCad/ProE/CaTia do contain some useful text and it is possible to get at that and index it. But mostly it's vectors and there is not much a text engine can do with a vectors. thanks again.

CAD files, Images

2007-05-23 Thread jim shirreffs
Is it possibe to index CAD formats such as AutoCad or CGM? I know some commecail products (excalaber) claim to be able to do that? If so what about TIFF? thanks jim s - To unsubscribe, e-mail: [EMAIL PROTECTED] For

Indexing Open Office documents

2007-05-17 Thread jim shirreffs
Anyone know how to add OpenOffice document to a Lucene index? Is there a parser for OpenOffice? thanks in advance jim s. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Implementing lagre secure Lucene search system questions.

2007-05-03 Thread jim shirreffs
aster index or index locally and then merge a tiny index into the master index? Thanks to any and all that take the time to advice me. jim s. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Merging Indeces

2007-04-21 Thread jim shirreffs
OK I get it, thanks for the info, just told my boss the opposite, guess I will have to send another email. thanks again jim s - Original Message - From: "Erick Erickson" <[EMAIL PROTECTED]> To: Sent: Saturday, April 21, 2007 8:32 AM Subject: Re: Merging Indeces

Re:Merging Indeces

2007-04-21 Thread jim shirreffs
"Lucene has no concept of "document identity" in that you can index the same document 15 times in a row and Lucene will have 15 entries. " Is this true? When ever I run the demo indexing logic document already indexed are skipped. What am I missing.

Can indexing logic on one host update an index on another host?

2007-04-20 Thread jim shirreffs
added but update the master index on the main application host. I read blurbs about merge and think maybe I might be able to do something with that, but I am unsure how that would work or if there is another way. Any ideas, suggestions would be very much appreciated

Re: Newbie needs help "addField"

2007-04-19 Thread jim shirreffs
Thanks to Karl and Donna, I followed your suggestions and was able to get a test driver (modified demo code) working, thanks again. jim s - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL

Newbie needs help "addField"

2007-04-18 Thread jim shirreffs
as a guide. If some could show me how to use fields add/retrieve (assuming that is the correct way to go) I would greatly appreciate it. thanks in advance jim s - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Searching missing documents after doing an addIndexes

2006-05-22 Thread Jim Wilson
= new IndexSearcher(current.getDirectory()); } } Jim Wilson Colorado Springs, CO 719-266-4431 (Home) 719-661-6768 (Cell) [EMAIL PROTECTED] IM:jwilsonsprings Registered Linux User # 302849 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additio

Install problem

2006-03-28 Thread Jim Douglas
mmon-build.xml, and build-deprecated.xml?? Jim - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Install issue

2006-03-26 Thread Jim Douglas
mmon-build.xml, and build-deprecated.xml?? Jim - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Setting the COMMIT lock timeout.

2006-03-15 Thread Jim Bedford-roberts
: Setting the COMMIT lock timeout. On Dienstag 14 März 2006 10:52, Jim Bedford-roberts wrote: > Admittedly we could have developed a single central search component, > but given the way the code has evolved our current approach is simplest > for us. With separate application instances shari

RE: Setting the COMMIT lock timeout.

2006-03-14 Thread Jim Bedford-roberts
Thanks for your prompt response! You ask about the use case. We have a series of similar intranet sites, each represented by a separate tomcat application instance using the same code base but with different start-up parameters. The intranets all provide a common search function based on the sam

Setting the COMMIT lock timeout.

2006-03-13 Thread Jim Bedford-roberts
yone point me in the right direction? Thanks Jim

Re: grouping results by fields

2006-01-29 Thread Jim Powers
a bitset (you do this by traversing the index directly for each word). Call these bitsets WBS(n) (word bit set n. n spans 1..m where m is the total number of words you want to count against) Finally: to get a count per word bit-wise AND each WBS(n) with FRSBS and count up the 1s Jim Powers On Sun

Lucene does not delete old index files after re-indexing?

2005-10-09 Thread Jim Coble
o the IndexWriter constructor would cause it to remove the old index files once it had created the new ones. Have I misunderstood something? Or done something incorrect? Thanks. --Jim ===== Jim Coble Head, Instructional Technology Applicat