On Tue, May 25, 2010 at 3:09 PM, Ted Harding <ted.hard...@manchester.ac.uk> wrote: > It is OpenOffice 2.0 (as updated to openoffice.org-core, > dfsg.2-7etch9 Sat Jan 16 2010) running on Linux (Debian Etch, > originally installed Sept 2007), so none of it is particularly > recent. The command to view the files is like: > > oocalc Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls
I just tried running those HTML fragments through 'html2text'. Comes out quite neatly, although this may get messed up by your font settings in an email page: $ html2text Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls Input recoding failed due to invalid input sequence. Unconverted part of text follows. � |________________|Name _______|________________|____________|_____________| |Prostatic |hsa-miR-222 |hsa-mir-222 |X |19351827 | |Neoplasms_______|_______________|________________|____________|_____________| |Prostatic |hsa-miR-106a |hsa-mir-106a |X |16461460 | |Neoplasms_______|_______________|________________|____________|_____________| |Prostatic |hsa-miR-221 |hsa-mir-221 |X |19107213 | |Neoplasms_______|_______________|________________|____________|_____________| etc etc. At least all the HTML tagging is gone. Even better, doing html2text -width 500 puts each record on one line, separated by | and padded out with _ chars. Barry ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.