Re: Lucene Search Performance

2008-02-27 Thread Michael Prichard
I'm wondering if your date field's precision may be a little too much? What I mean is that you are going all the way down to seconds. Whenever you do a range query you are essentially spawning a BooleanQuery with a representation of that range. Do you really need to be that precise? I u

Re: is lucene fit for doing this?

2008-01-29 Thread Michael Prichard
I would say Lucene is a capable of helping you do this. Remember that it is a set of libraries and you have to build the functionality you need with that. BUT with a little planning and elbow grease you will be able to use it to create a great search engine. Good luck. On Jan 29, 2008,

Matching w/in X% ?

2008-01-21 Thread Michael Prichard
Say I have a field of To addresses from an email archive. I do a search and I get 10 To addresses for a single hit. Then I want to find similar email with the To addresses containing roughly 75% of those email addresses as well. How would I do this? In other words: I get a result with: To:

Highlighting with wildcards?

2008-01-18 Thread Michael Prichard
I have highlighter working out of the box but now I need to highlight wildcards (if they are there). I would imagine someone has done this on this group :) Any suggestions? Thanks! Michael - To unsubscribe, e-mail: [EMAIL

IndexWriter.deleteDocument()

2008-01-15 Thread Michael Prichard
When I run through and delete a few documents from my index, is it wise to call .flush() afterwards? Or is it better to close the index? Thanks! Michael - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-m

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard
rong field, that not only does not exist yet but will never exist. To prevent hidden bugs cases like this it is important to throw an exception in this case. Do you agree? On Jan 8, 2008 11:21 PM, Michael Prichard <[EMAIL PROTECTED]> wrote: Meaning that it says "field is no

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard
Meaning that it says "field is not indexed". Where is sortMissingLastAttribute? thanks. On Jan 8, 2008, at 4:13 PM, Ryan McKinley wrote: what do you mean by "fail"? -- there is the sortMissingLast attribute Michael Prichard wrote: ok... i should read the manual more

Re: Sorting on tokenized fields

2008-01-08 Thread Michael Prichard
yetI run a sort on BCC.I think it fails now...anyway around that? On Jan 8, 2008, at 1:22 PM, Michael Prichard wrote: Is it possible to sort on a tokenized field? For example, I break email address into pieces, i.e. [EMAIL PROTECTED] becomes [EMAIL PROTECTED] michael.prichard

Sorting on tokenized fields

2008-01-08 Thread Michael Prichard
Is it possible to sort on a tokenized field? For example, I break email address into pieces, i.e. [EMAIL PROTECTED] becomes [EMAIL PROTECTED] michael.prichard michael prichard email.com email so when sorting on this field I get some strange results. Do I need to create another field

Re: Office 2007

2007-11-08 Thread Michael Prichard
out it. -Grant On Nov 8, 2007, at 8:37 AM, Michael Prichard wrote: Hello, I know this has gone around a bit but anyone had any success with pulling text from Office 2007 files? Any recommendations? Thanks, Michael

Office 2007

2007-11-08 Thread Michael Prichard
Hello, I know this has gone around a bit but anyone had any success with pulling text from Office 2007 files? Any recommendations? Thanks, Michael - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:

Re: Threading Indexing Processes : Can we write concurrently to Index?

2007-10-30 Thread Michael Prichard
So I should create a singe IndexWriter but allow multiple threads to use it? On Oct 30, 2007, at 12:49 AM, Karl Wettin wrote: 30 okt 2007 kl. 05.08 skrev Michael Prichard: If I multithread an indexing process can two or more processes write to the same index? Multiple threads can add

Threading Indexing Processes : Can we write concurrently to Index?

2007-10-29 Thread Michael Prichard
The subject pretty much describes the issue. If I multithread an indexing process can two or more processes write to the same index? My knowledge of Lucene says no but I am hoping that I am missing something. Also, what is a good solution? Thanks. Michael --

extracting non-english text from word, pdf, etc....??

2007-07-31 Thread Michael Prichard
I know how to do english text with POI and PDFBox and so on. Now, I want to start indexing non-english language such as french and spanish. Which extraction libs are available for me? I want to do: Excel Word PowerPoint PDF HTML RTF Thanks! Michael --

Re: Too many open files?!

2007-02-14 Thread Michael Prichard
://wiki.apache.org/jakarta-lucene/ LuceneFAQ#head-48921635adf2c968f79 36dc07d51dfb40d638b82 -Original Message- From: Michael Prichard [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 14, 2007 5:02 PM To: java-user@lucene.apache.org Subject: Too many open files?! I am getting this exception

Too many open files?!

2007-02-14 Thread Michael Prichard
I am getting this exception: Exception in thread "main" java.io.FileNotFoundException: /index/_gna.f13 (Too many open files) This is happening on a SLES10 (64-bit) box when trying to index 18k items. I can run it on a much lesser SLES9 box without any issues. Any ideas?! Thanks, Michael ---

Re: Query does not work past 26 characters?!

2006-07-20 Thread Michael Prichard
ARRRGH!!! That's it. Darn, I was half asleep last night when I was experimenting. I totally feel like a dope. It worksThanks! -Michael On Thursday, July 20, 2006, at 00:36AM, Doron Cohen <[EMAIL PROTECTED]> wrote: >> doc.add(new Field("to", >> "[EMAIL PROTECTED]", >> ... >> PrefixQu

Query does not work past 26 characters?!

2006-07-19 Thread Michael Prichard
Tell me I am totally missing something here I created an index w/ StandardAnalyzer with two fields as follows: Document doc = new Document(); doc.add(new Field("to", "[EMAIL PROTECTED]", Field.Store.YES, Field.Index.TOKENIZED)); doc.add(new Field("content", "blah3 blah3 blah3", Field.Store.Y

Building easy to use search guis? How to save queries...

2006-07-17 Thread Michael Prichard
Not sure if anyone out is doing this, thought about doing this or is just plain curious. I want to figure out a way to build a search/rule gui's whereas the user can build searches much like building rules in some mail clients such as outlook or thunderbird. For example, the user could use dro