And is there some default parser to recursively index all files in archive? Can I just configure it somehow to work this way or do I have to write my own parser (out of the box it seems to index only file names in archive)?
-----Original Message----- From: Nick Burch [mailto:[email protected]] Sent: Saturday, January 05, 2013 6:00 AM To: [email protected] Subject: Re: fetching content from archives and images On 04/01/13 12:09, Maciej Liżewski wrote: > 1. does tika recursively fetch content from archives (zip, rar, etc)? If you ask it to. You need to attach the parser you want to use for recursion to the ParseContext, and it'll be called for any embedded resources. (If you want, you can give your own parser that only does it for certain source or target types, or anything else like that) Nick
