Re: Extracting text from PDF with no embedded fonts

2014-03-13 Thread Tilman Hausherr
Hi, The best would be to create an issue with JIRA and upload the file there, if it isn't confidential. Re "the latest", did you use an 1.8 version or a 2.0 version? Tilman Am 10.03.2014 21:19, schrieb Craig Strong: I have been using PDFBox to extract text from several different PDF files f

Fw: Extracting text from PDF with no embedded fonts

2014-03-13 Thread Craig Strong
I found a solution to my issue.  I was able to install the latest XPdf RPM file for AIX so I can now use pdftotext from PASE on the IBM i.  I can also adjust font manipulation on the fly with a configuration file.  This converts this PDF to text on the same system which PDFBox can't do and I d

Re: Clickable links

2014-03-13 Thread Tres Finocchiaro
Do you have the proper amount of slashes? (Assuming you do) This answer doesn't really apply to PDFBox specifically, but from my experience with local file:// URLs, is that this is an OSX security measure which block the functionality. This is just a hunch but I've had java applets blocked in S

Re: Possible PDFont documentation correction needed for getFontWidth and getFontHeight

2014-03-13 Thread Maruan Sahyoun
fixed the online documentation (as this isn’t done automatically ) BR Maruan Sahyoun Am 13.03.2014 um 19:17 schrieb Tilman Hausherr : > Corrected in PDFBOX-1984 > in the source code for the javadoc only (not for the website, someone else > wi

Clickable links

2014-03-13 Thread HQS
Hello all, I’ve successfully implemented the string extraction and reference transformation into hyperlinks, except that, the links are clickable but the files don’t open. I’ve set the URI to be « file:/ » and nothing happens. If I set an http URL instead, it works (the navigator opens to the p

Re: Possible PDFont documentation correction needed for getFontWidth and getFontHeight

2014-03-13 Thread Tilman Hausherr
Corrected in PDFBOX-1984 in the source code for the javadoc only (not for the website, someone else will need to do this, unless this is done automatically) Tilman Am 11.03.2014 18:39, schrieb Andy Hodges: I can see inside the PDFont docume