Re: Missing unicode information from a font.

2025-04-14 Thread NH Rao
to make stuff public, because this brings up new > risks, more support requests and also prevents us to change that API. > > Tilman > > On 12.04.2025 02:52, NH Rao wrote: > > Greetings, > > > > Thank you for the reply. I managed to get it working using reflection

Re: Missing unicode information from a font.

2025-04-11 Thread NH Rao
;) > If it doesn't work, you may have to disable the cache or use your own. > > Tilman > > On 10.04.2025 23:48, NH Rao wrote: > > Greetings, > > > > Some of the PDF files we process do not have unicode information defined > > for its type 3 fonts. I am in the

Missing unicode information from a font.

2025-04-10 Thread NH Rao
Greetings, Some of the PDF files we process do not have unicode information defined for its type 3 fonts. I am in the process of migrating ancient code (based on version 1.8 to the latest version). Since the characters are imited to ASCII characters, we dumped checksum of a glyph and character to

Re: Change requests

2025-04-05 Thread NH Rao
Greetings, Thank you for your response. I'll extract out the test cases and will work with you. Here is one to start with. I tried running PDF debugger in eclipse and noticed that tooltips are broken because of missing ICC profile ISOcoated_v2_300_bas.icc. A quick repository search showed this fi

Text stripper with the known page

2025-01-29 Thread NH Rao
Greetings, PDFBox version: 3.0.3 I am trying to get the text of a certain page. Interesting page is found by looking up outline item and finding out page corresponding to my keyword from outline. As far as I could see, text stripper class allows page limits by page number, but I don't know what

Copying text

2025-01-29 Thread NH Rao
Greetings, This is not a direct PDF box question, but hoping someone here knows the answer. I've noticed many PDF viewers such as the built-in viewer in chrome/firefox have a concept of a block when you start selecting the text. Given a visually tabular structure, many times, multi row text selec