Expanding 25+ GB per day is not ideal. If its possible to index the binary directly, as it sounds like it might, we'll just do that.
I think what I was missing was - I didn't see AbstractField which seems like it has the stuff I need (if indeed Field is used as I assume it is) ./paul Sent from my Verizon Wireless BlackBerry -----Original Message----- From: Shashi Kant <shashi_k...@yahoo.com> Date: Fri, 30 Jan 2009 09:38:16 To: <java-user@lucene.apache.org> Subject: Re: indexing binary files? Unless I am missing something, not sure I see the issue here. You can convert to Base64 purely for indexing purposes and leave the original binary as-is. ----- Original Message ---- From: Paul Feuer <paul...@gmail.com> To: Lucene User List <java-user@lucene.apache.org>; Shashi Kant <sk...@sloan.mit.edu> Sent: Friday, January 30, 2009 10:12:33 AM Subject: Re: indexing binary files? The binary events in the file are parsable by both our java server-side processes and the clients of these processes, so we need to keep the data in the binary format. ../paul Sent from my Verizon Wireless BlackBerry -----Original Message----- From: Shashi Kant <shashi_k...@yahoo.com> Date: Fri, 30 Jan 2009 06:32:19 To: <java-user@lucene.apache.org> Subject: Re: indexing binary files? Hi Paul, have you tried persisting the binaries in Base64 format and then indexing them? As you are aware, Base64 is a robust representation used in email attachments for example. Thanks Shashi ----- Original Message ---- From: Paul Feuer <paul...@gmail.com> To: java-user@lucene.apache.org Sent: Thursday, January 29, 2009 10:43:36 PM Subject: indexing binary files? Hi - I've looked on the FAQ, the Java Docs, and searched a little in google, but haven't been able to figure out if Lucene can index binary files. Our binary files can get up into the 20-30 gigabyte range. If it is possible, anyone have any pointers to what interfaces I should look at? Thanks, .../paul --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org