CLucene is one possibility: http://sourceforge.net/projects/clucene/
Since you are asking in the PostgreSQL group, why not use the built-in full text search: http://www.postgresql.org/docs/8.4/static/textsearch.html From: pgsql-general-ow...@postgresql.org [mailto:pgsql-general-ow...@postgresql.org] On Behalf Of Sandeep Srinivasa Sent: Thursday, August 19, 2010 10:11 PM To: Miguel Vaz Cc: pgsql-general@postgresql.org Subject: Re: [GENERAL] searchable book database If you dont ever need to return the complete book text to a user (which means, you only need the book text for your search indexes only), then keep the text on file and use Apache Solr to index it. regards Sandeep On Fri, Aug 20, 2010 at 1:05 AM, Miguel Vaz <pagong...@gmail.com<mailto:pagong...@gmail.com>> wrote: Hi, I need to make a database of books. Several specific subject books that are to be searchable. Is it viable to have the complete book text on a database and search inside it? Or should i consider keeping only its metadata (name, author, filename, etc) on the DB, keep the book file on the HD and use some sort of search algorithm on the file? If you agree on the second option, what would you guys suggest for text file searching? Its for a web project, so how could i go about doing this? (PHP, python...) Thanks. MV