Hi, Copyright also covers databases, so we'll need to honor the license terms equally when copying file's code or detection patterns. Luckily file (from http://www.darwinsys.com/file/) comes under a BSD license, so reusing the code or data is quite simple from a licensing perspective. In fact we've already done some of that earlier, see https://github.com/apache/tika/commit/f807af0ee947affd34d84b334bbdc32c11576b2e for an example.
BR, Jukka