On Wed, 26 May 2010 18:38:47 -0700, Gary Kline <kl...@thought.org> wrote:
> 
> 
> guys,
> 
> is there anything that can take these hex triplets such as
> 
> We Don\xe2\x80\x99t
> 
> and render them back to the ascii or keyboard equivalents?
> in this case, the \x99 would be an apostrophe.
> thus:
> 
> 
> We Don't
> 
> tia,
> 
> gsry
> 
> ps: even lynx -dump messes up, i believe.  i'm trying to go from
> DOC  back to typewriter.... 


Yes, even a typewriter is better than DOC. :-)

To process DOC files into ASCII, there are several ways, with
different complexity:

Most complex ones: Use OpenOffice or Abiword, open the file and
save it as ASCII. Included "special characters" should be in
regular ASCII representation now.

Better: Use (from ports) catdoc or antiword.

I'm not sure in how far conflicting codepages may be involved.
It is known that "Windows" does have problems supporting standards,
and this applies to character sets and language variations, too.



-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...
_______________________________________________
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscr...@freebsd.org"

Reply via email to