On Tue, May 25, 2010 at 3:09 PM, Ted Harding
<ted.hard...@manchester.ac.uk> wrote:
> It is OpenOffice 2.0 (as updated to openoffice.org-core,
> dfsg.2-7etch9 Sat Jan 16 2010) running on Linux (Debian Etch,
> originally installed Sept 2007), so none of it is particularly
> recent. The command to view the files is like:
>
>  oocalc Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls

 I just tried running those HTML fragments through 'html2text'. Comes
out quite neatly, although this may get messed up by your font
settings in an email page:

$ html2text Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls
Input recoding failed due to invalid input sequence. Unconverted part
of text follows.
�
|________________|Name  _______|________________|____________|_____________|
|Prostatic       |hsa-miR-222    |hsa-mir-222     |X           |19351827     |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic       |hsa-miR-106a   |hsa-mir-106a    |X           |16461460     |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic       |hsa-miR-221    |hsa-mir-221     |X           |19107213     |
|Neoplasms_______|_______________|________________|____________|_____________|

etc etc. At least all the HTML tagging is gone.

Even better, doing html2text -width 500 puts each record on one line,
separated by | and padded out with _ chars.

Barry

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to