On Wed, Jun 01, 2016 at 07:23:49AM -0000, Karl Kastner wrote:
> Apparently the issue is not the umlauts (at least on my machine), but
> ligatures, &c. I've a script to rename files, but some always slip.
> Especially the incapability of the system to properly handle Russian
> file names and contents, due to different encodings, is a nuisance. And
> when processing strings in perl-scripts, it is a nightmare.

I had the impression that the Russian papers were encoded in KOI8 -- it
was odd that some of my tools showed the Cyrillic without trouble and
others showed the usual garbage on wrong encoding. Maybe recoding those
into utf8 would also help.

Almost nothing handles multiple file name encodings well; and ligatures in
filenames is an extremely unpleasant thing to find. The iconv(1) tool may
help renaming files, but I doubt it can help with ligatures.

> I was not aware of that grep switches from text to binary mode while
> parsing, and that it only does so if a grepped line contains a binary
> character. It would be good if the warning was send to stderr, so that
> it does not get lost in pipes. Anyway, I already added the alias grep
> --text to my ~/.bashrc.

Yes, I was surprised that it went to stdout too; however, it's an old
enough tool that they may not be able to make changes of that scope even
if they wanted to, for fear of what else might break.

> Just to continue the discussion, is there a similar switch for locate?
> 
> locate  comparison-of-turbulence-models
> Binary file (standard input) matches

That's a real bummer. :/ I don't know of any similar tool. If it's just
built on grep it may just take a -a flag there too... Probably it's best
to take that one up with upstream locate developers.

Thanks

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1587101

Title:
  Grep silently discards tails of long text streams

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/grep/+bug/1587101/+subscriptions

-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to