Hello, I need to determine programmatically a file type from its content/extension (much like the "file" UNIX command line utility)

I searched for a suitable Python library module, with little luck. Do you know something useful ?

Are you looking for something comprehensive? Or are you just looking for particular file-types that your application can handle? I'd start with the python mimetypes library[1] which does detection based on extensions (to which you can add your own mappings). For sniffing by content, there are a wide variety of document types, I don't know of any pre-existing library. The first couple bytes can often tell you something, but you'd have to go digging into the source for "file" to see what it does.

-tkc

[1]
http://docs.python.org/library/mimetypes.html



--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to