Rosenbaum, Larry M. wrote:

We can use antiword to render text from MSWord files, and unrtf to render text 
from RTF files.  What is the best tool to render text from PDF files?

I don't know what the best tool is, but I'm currently using pdftohtml in XML mode (and then stripping the XML) in my ExtractText plugin.

(For more info about the plugin, see my post with subject "ExtractText plugin", or download it from <http://whatever.frukt.org/graphdefang/ExtractText.zip>).

Regards
/Jonas
--
Jonas Eckerman
Fruktträdet & Förbundet Sveriges Dövblinda
http://www.fsdb.org/
http://www.frukt.org/
http://whatever.frukt.org/

Reply via email to