[ https://issues.apache.org/jira/browse/JSPWIKI-469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16802221#comment-16802221 ]
Juan Pablo Santos RodrÃguez commented on JSPWIKI-469: ----------------------------------------------------- Hi Ulf, thanks a lot for your contribution! :-) as noted on comments above and yesterday on the ML, apache tika is a fairly large dependency (55MB), so maybe we could have the attached provider as an another maven module on the build, not included in the war by default. Thus, using it would involve setting the appropiate value on {{jspwiki[-custom].properties}} and bringing in the dependency on the war. With appropiate instructions on how to install it, shouldn't be a big deal. sounds reasonable? best regards, juan pablo > Enhance LuceneSearchProvider for other Attachments > --------------------------------------------------- > > Key: JSPWIKI-469 > URL: https://issues.apache.org/jira/browse/JSPWIKI-469 > Project: JSPWiki > Issue Type: Improvement > Reporter: NicolaFischer > Assignee: Florian Holeczek > Priority: Minor > Fix For: FutureVersion > > Attachments: TikaSearchProvider.java, patch.txt > > > LuceneProvider should index more filestypes then only plain text. This is one > attempt to index pdf-files. > Required jars: > * [Apache POI|http://ftp.tpnet.pl/vol/d1/apache/poi/release/bin] (not tested > with 3.0.1 final) > * [PDFBox|http://www.pdfbox.org] > * [FontBox|http://www.fontbox.org] > * [OpenDocumentTextInputStream|http://books.evc-cit.info/odf_utils/index.html] > Patch attached for 2.8.1 > Maybe we should check how to index more documents. -- This message was sent by Atlassian JIRA (v7.6.3#76005)