On Wed, Sep 23, 2009 at 02:37:21PM -0400, James Michael Fultz wrote: > [ ... ] > > When I download manual.txt.gz with wget and then unzip it and > > vim the result, I again see alternating valid character and > > <##> (I have "set display+=uhex in my .vimrc). This may very > > well be a configuration problem on my side, that I don't see > > non-utf8 files correctly. > > Try less and 'col -b manual.txt | less' if the former doesn't > display cleanly. The col command will strip embedded backspace > sequences.
That worked, thanks for the tip (although col complained "Invalid or incomplete multibyte or wide character"). Also, when I zcat the downloaded gz, and run it through col, I get truncated at exactly the same point (right after "Mutt is Copyright"). If, however, I zcat the file directly to less without col, everything displays correctly, with the bolding, and in its entirety, all the way to the end of chapter 10. Perhaps there is some character in manual.txt which is causing truncation somewhere, perhaps server-side, or perhaps in wget, less, and firefox (for the uncompressed manual.txt). Maybe some library on our systems common to all those which does not like some special character? Just a guess. -- Noah Sheppard Assistant Computer Resource Manager Taylor University CSE Department nshep...@cse.taylor.edu