The short answer is: not without writing some code. The longer answer is:
To search PDFs in RiakSearch you must write your own extractor[1], which requires writing some Erlang, and use some PDF extraction library. Or if you already have the text handy, manually index the data yourself using update commands. Another option, is to take a look at Yokozuna[2]. Since it uses Solr to index, it is possible. We have an issue[3] to integrate SolrCell. Again, this will require some coding, but potentially less. Or, write a yokozuna extractor. The community would certainly appreciate the help. [1]: http://docs.basho.com/riak/1.3.1/cookbooks/Riak-Search---Indexing-and-Querying-Riak-KV-Data/#Other-Data-Encodings [2]: http://github.com/basho/yokozuna [3]: https://github.com/basho/yokozuna/issues/17 On Jun 14, 2013, at 11:55 AM, Wagner Caixeta Rodrigues <wag...@assembleia.go.gov.br> wrote: > Hi Everyone, > > At 1st i want to say, Riak is a amazing project and i love open source. > > I was searching some kind of storage api to make attached files cross apps > and found Riak. > > In the other way i need full text search too. > > So Riak looks like to fits perfectly. > > But i have one question: > > What is the best way to store "searchable PDFs" in Riak and allow > full-text-search. Will i need to extractthe text from PDFs in my Ruby App and > store the text in Riak too? > > Looks like i need to have one bucket with the PDFs files and other with the > same key but in clear text. Ami right ? > > Would be perfect is could be possible just search directly in the PDFs files, > natively or using some kind ofplugin. > > Tks. > > > -- > Wagner Caixeta Rodrigues > Assembleia Legislativa do Estado de Goiás > Seção de Administração de Dados > (62) 3221-3255 / (62) 8412-6415 > > _______________________________________________ > riak-users mailing list > riak-users@lists.basho.com > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
_______________________________________________ riak-users mailing list riak-users@lists.basho.com http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com