Thanks Paul. For future reference: I ended up installing a demo version of a Windows-only Pdf editor (Infix Pdf Editor) in my VirtualBox environment. The demo is actually perfectly functional---you cannot save without getting the pdf files watermarked, but saving was useless to me. You can select any text snippet and obtain font information including typeface and size (size is rounded up, which made for some interesting headscratching before I realized it. I was converting a book from TeX point sizes to Adobe points (bp).) I think that's similar to what you can do in Acrobat Pro, but I 've never tried it. And no---I did not intend to wite any Java code. I am too old for learning how to write Java.... Cheers,
Stefano On Wed, Dec 15, 2010 at 5:35 PM, Paul Rubin <ru...@msu.edu> wrote: > Not sure how much help this will be, but pdfedit can convert a PDF file to > XML > format. Here's a sample font tag from the XML output of a PDF file: > > <font basename="VYKOIK+NimbusRomNo9L-Regu" > origname="VYKOIK+NimbusRomNo9L-Regu" > embeddedfontname="VYKOIK+NimbusRomNo9L-Regu" tag="F67" serif="false" > symbolic="true" italic="false" bold="false" ascent="0.678" descent="-0.216" > writemode="horizontal" fonttype="Type1 (8bit)"> > > You can identify the font from the basename attribute, and weight/slant > from > obvious attributes. Size is tricky. The difference ascent-descent maps to > the > font size, but the conversion is not entirely obvious to me. > > I also think there are some Java code libraries out there that can parse > XML > files, including font size, but I don't know off-hand which they are (and > wasn't > sure in any case that you wanted to write Java code). > > /Paul > >