On Wed, Nov 2, 2011 at 6:27 AM, pacopyc <paco...@gmail.com> wrote: > Hi, I have about 10000 files .doc and I want know the program used to > create them: writer? word? abiword? else? I'd like develop a script > python to do this. Is there a module to do it? Can you help me? >
Technically, you can't find out just from the file what it was that created it. But if you mean "figure out what type of file each one is" (eg recognize an ODF, a PDF, a DOC, etc), then the easiest way is to read in the first few bytes of the file and look for well-known magic numbers[1]. As Dave says, Linux comes with a command that does exactly that (and a bit more), called 'file'. ChrisA [1] http://en.wikipedia.org/wiki/Magic_number_(programming) -- http://mail.python.org/mailman/listinfo/python-list