On 2014-09-12 17:57:39 -0700, Paul Eggert wrote:
> Currently, for example, the tz package <http://www.iana.org/time-zones> has
> a Make rule 'check_character_set' that verifies that the source files are
> all properly encoded.  It executes this shell command:
> 
> ! grep -nv '^.*$' file names
> 
> This relies on GNU grep's behavior that "." does not match an encoding
> error.  But it's a command that is not obvious.  It'd be simpler and clearer
> to write this:
> 
> ! grep -n '[[:error:]]' file names
> 
> if such a feature were available.

But both of these solutions have the drawback of working only in
UTF-8 locales. One may wonder whether grep is the right tool, as
"iconv -f UTF-8 -t UTF-8" can do such a check in any locale.

-- 
Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)



Reply via email to