The short answer is: not without writing some code.

The longer answer is:

To search PDFs in RiakSearch you must write your own extractor[1], which 
requires writing some Erlang, and use some PDF extraction library. Or if you 
already have the text handy, manually index the data yourself using update 
commands.

Another option, is to take a look at Yokozuna[2]. Since it uses Solr to index, 
it is possible. We have an issue[3] to integrate SolrCell. Again, this will 
require some coding, but potentially less. Or, write a yokozuna extractor. The 
community would certainly appreciate the help.


[1]: 
http://docs.basho.com/riak/1.3.1/cookbooks/Riak-Search---Indexing-and-Querying-Riak-KV-Data/#Other-Data-Encodings
[2]: http://github.com/basho/yokozuna
[3]: https://github.com/basho/yokozuna/issues/17


On Jun 14, 2013, at 11:55 AM, Wagner Caixeta Rodrigues 
<wag...@assembleia.go.gov.br> wrote:

> Hi Everyone,
> 
> At 1st  i want to say, Riak is a amazing project and i love open source.
> 
> I was searching some kind of storage api to make attached files cross apps 
> and found Riak.
> 
> In the other way i need full text search too.
> 
> So Riak looks like to fits perfectly.
> 
> But i have one question:
> 
> What is the best way to store "searchable PDFs" in Riak and allow 
> full-text-search. Will i need to extractthe text from PDFs in my Ruby App and 
> store the text in Riak too?
> 
> Looks like i need to have one bucket with the PDFs files and other with the 
> same key but in clear text. Ami right ?
> 
> Would be perfect is could be possible just search directly in the PDFs files, 
> natively or using some kind ofplugin.
> 
> Tks.
> 
> 
> --
> Wagner Caixeta Rodrigues
> Assembleia Legislativa do Estado de Goiás
> Seção de Administração de Dados
> (62) 3221-3255 / (62) 8412-6415
> 
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to