New submission from disconnect3d <dominik.b.czarn...@gmail.com>:
The mimetype builtin module allows users to guess extension for a given mimetype through the `mimetypes.guess_extension` function. Default mimetypes are stored in `types_map` and `_types_map_default` dictionaries that maps extensions to mimetypes. Those dictionaries are created by `_default_mime_types` function in `cpython/Lib/mimetypes.py`. If a given extension have more than one mimetype, this information is lost. This happens currently for ".bmp" extension in CPython's codebase. This can be seen in the linked code below: https://github.com/python/cpython/blob/110a47c4f42cf4db88edc1876899fff8f05190fb/Lib/mimetypes.py#L490-L502 Here is an example in an interactive IPython session: ``` In [1]: import mimetypes In [2]: mimetypes.guess_extension('image/bmp') Out[2]: '.bmp' In [3]: mimetypes.guess_extension('image/x-ms-bmp') In [4]: ``` The issue has been found by using Semmle's LGTM: https://lgtm.com/projects/g/python/cpython/snapshot/d099f261c762ac81042e47b530d279f932d89e09/files/Lib/mimetypes.py?sort=name&dir=ASC&mode=heatmap PS / offtopic / loud thinking: Maybe there should be a debug build of CPython that would detect such key overwrites during dicts initialisation and warn about them? ---------- components: Library (Lib) messages: 347562 nosy: Dominik Czarnota priority: normal severity: normal status: open title: Mimetype module duplicates type: behavior versions: Python 2.7, Python 3.5, Python 3.6, Python 3.7, Python 3.8, Python 3.9 _______________________________________ Python tracker <rep...@bugs.python.org> <https://bugs.python.org/issue37529> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com