from:"Michael Prichard"

Re: Lucene Search Performance

2008-02-27 Thread Michael Prichard

I'm wondering if your date field's precision may be a little too much? What I mean is that you are going all the way down to seconds. Whenever you do a range query you are essentially spawning a BooleanQuery with a representation of that range. Do you really need to be that precise? I u

Re: is lucene fit for doing this?

2008-01-29 Thread Michael Prichard

I would say Lucene is a capable of helping you do this. Remember that it is a set of libraries and you have to build the functionality you need with that. BUT with a little planning and elbow grease you will be able to use it to create a great search engine. Good luck. On Jan 29, 2008,

Matching w/in X% ?

2008-01-21 Thread Michael Prichard

Say I have a field of To addresses from an email archive. I do a search and I get 10 To addresses for a single hit. Then I want to find similar email with the To addresses containing roughly 75% of those email addresses as well. How would I do this? In other words: I get a result with: To:

Highlighting with wildcards?

2008-01-18 Thread Michael Prichard

I have highlighter working out of the box but now I need to highlight wildcards (if they are there). I would imagine someone has done this on this group :) Any suggestions? Thanks! Michael - To unsubscribe, e-mail: [EMAIL

IndexWriter.deleteDocument()

2008-01-15 Thread Michael Prichard

When I run through and delete a few documents from my index, is it wise to call .flush() afterwards? Or is it better to close the index? Thanks! Michael - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-m

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard

rong field, that not only does not exist yet but will never exist. To prevent hidden bugs cases like this it is important to throw an exception in this case. Do you agree? On Jan 8, 2008 11:21 PM, Michael Prichard <[EMAIL PROTECTED]> wrote: Meaning that it says "field is no

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard

Meaning that it says "field is not indexed". Where is sortMissingLastAttribute? thanks. On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote: what do you mean by "fail"? -- there is the sortMissingLast attribute Michael Prichard wrote: ok... i should read the manual more

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard

yetI run a sort on BCC.I think it fails now...anyway around that? On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote: Is it possible to sort on a tokenized field? For example, I break email address into pieces, i.e. [EMAIL PROTECTED] becomes [EMAIL PROTECTED] michael.prichard

Sorting on tokenized fields

2008-01-08 Thread Michael Prichard

Is it possible to sort on a tokenized field? For example, I break email address into pieces, i.e. [EMAIL PROTECTED] becomes [EMAIL PROTECTED] michael.prichard michael prichard email.com email so when sorting on this field I get some strange results. Do I need to create another field

Re: Office 2007

2007-11-08 Thread Michael Prichard

out it. -Grant On Nov 8, 2007, at 8:37 AM, Michael Prichard wrote: Hello, I know this has gone around a bit but anyone had any success with pulling text from Office 2007 files? Any recommendations? Thanks, Michael

Office 2007

2007-11-08 Thread Michael Prichard

Hello, I know this has gone around a bit but anyone had any success with pulling text from Office 2007 files? Any recommendations? Thanks, Michael - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:

Re: Threading Indexing Processes : Can we write concurrently to Index?

2007-10-30 Thread Michael Prichard

So I should create a singe IndexWriter but allow multiple threads to use it? On Oct 30, 2007, at 12:49 AM, Karl Wettin wrote: 30 okt 2007 kl. 05.08 skrev Michael Prichard: If I multithread an indexing process can two or more processes write to the same index? Multiple threads can add

Threading Indexing Processes : Can we write concurrently to Index?

2007-10-29 Thread Michael Prichard

The subject pretty much describes the issue. If I multithread an indexing process can two or more processes write to the same index? My knowledge of Lucene says no but I am hoping that I am missing something. Also, what is a good solution? Thanks. Michael --

extracting non-english text from word, pdf, etc....??

2007-07-31 Thread Michael Prichard

I know how to do english text with POI and PDFBox and so on. Now, I want to start indexing non-english language such as french and spanish. Which extraction libs are available for me? I want to do: Excel Word PowerPoint PDF HTML RTF Thanks! Michael --

Re: Too many open files?!

2007-02-14 Thread Michael Prichard

://wiki.apache.org/jakarta-lucene/ LuceneFAQ#head-48921635adf2c968f79 36dc07d51dfb40d638b82 -Original Message- From: Michael Prichard [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 14, 2007 5:02 PM To: java-user@lucene.apache.org Subject: Too many open files?! I am getting this exception

Too many open files?!

2007-02-14 Thread Michael Prichard

I am getting this exception: Exception in thread "main" java.io.FileNotFoundException: /index/_gna.f13 (Too many open files) This is happening on a SLES10 (64-bit) box when trying to index 18k items. I can run it on a much lesser SLES9 box without any issues. Any ideas?! Thanks, Michael ---

Re: Query does not work past 26 characters?!

2006-07-20 Thread Michael Prichard

ARRRGH!!! That's it. Darn, I was half asleep last night when I was experimenting. I totally feel like a dope. It worksThanks! -Michael On Thursday, July 20, 2006, at 00:36AM, Doron Cohen <[EMAIL PROTECTED]> wrote: >> doc.add(new Field("to", >> "[EMAIL PROTECTED]", >> ... >> PrefixQu

Query does not work past 26 characters?!

2006-07-19 Thread Michael Prichard

Tell me I am totally missing something here I created an index w/ StandardAnalyzer with two fields as follows: Document doc = new Document(); doc.add(new Field("to", "[EMAIL PROTECTED]", Field.Store.YES, Field.Index.TOKENIZED)); doc.add(new Field("content", "blah3 blah3 blah3", Field.Store.Y

Building easy to use search guis? How to save queries...

2006-07-17 Thread Michael Prichard

Not sure if anyone out is doing this, thought about doing this or is just plain curious. I want to figure out a way to build a search/rule gui's whereas the user can build searches much like building rules in some mail clients such as outlook or thunderbird. For example, the user could use dro

Re: Lucene Search Performance

Re: is lucene fit for doing this?

Matching w/in X% ?

Highlighting with wildcards?

IndexWriter.deleteDocument()

Re: Sorting on tokenized fields

Re: Sorting on tokenized fields

Re: Sorting on tokenized fields

Sorting on tokenized fields

Re: Office 2007

Office 2007

Re: Threading Indexing Processes : Can we write concurrently to Index?

Threading Indexing Processes : Can we write concurrently to Index?

extracting non-english text from word, pdf, etc....??

Re: Too many open files?!

Too many open files?!

Re: Query does not work past 26 characters?!

Query does not work past 26 characters?!

Building easy to use search guis? How to save queries...

19 matches

Site Navigation

Mail list logo

Footer information