Hi, On Wed, Nov 17, 2010 at 01:42:40AM -0800, James Stroud wrote: > http://onlinelibrary.wiley.com/doi/10.1002/pmic.200700038/suppinfo > > You'll see in the available PDF file Tables S1-S3. Were I to look for any > significant amount of time, I could find much more egregious examples. > > For this particular example, your eyes may deceive you into thinking that the > PDF file can be parsed and the data represented in the tables extracted with > a script of some sort. But, if you have the patience, go to Table S3 and > start selecting text at "Accession Number" in the heading. You'll find that > the selection goes down that column only about half way and then begins > selecting at the next column, "Swissprot Identifier".
Pick a better PDF viewer: with my version of xpdf (on Ubuntu 10.04) I can easily select that table over three pages and get a reasonably good looking ASCII representation of it. Takes about 10 seconds ... Acrobat reader is not very good for selecting text in PDF files. I don't know about others, but xpdf is really good at it. Cheers Clemens -- *************************************************************** * Clemens Vonrhein, Ph.D. vonrhein AT GlobalPhasing DOT com * * Global Phasing Ltd. * Sheraton House, Castle Park * Cambridge CB3 0AX, UK *-------------------------------------------------------------- * BUSTER Development Group (http://www.globalphasing.com) ***************************************************************