On Wed, Nov 2, 2011 at 6:27 AM, pacopyc <paco...@gmail.com> wrote:
> Hi, I have about 10000 files .doc and I want know the program used to
> create them: writer? word? abiword? else? I'd like develop a script
> python to do this. Is there a module to do it? Can you help me?
>

Technically, you can't find out just from the file what it was that
created it. But if you mean "figure out what type of file each one is"
(eg recognize an ODF, a PDF, a DOC, etc), then the easiest way is to
read in the first few bytes of the file and look for well-known magic
numbers[1]. As Dave says, Linux comes with a command that does exactly
that (and a bit more), called 'file'.

ChrisA
[1] http://en.wikipedia.org/wiki/Magic_number_(programming)
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to