Hi Esteban,
Is there a way you can deidentify the original (slow version) so we can see the 
issue?  We are using PDFBox & PDF.js on millions of documents and I'd love to 
know if there is a way for me to detect the slowness issue and adjust the 
document for performance.  Seeing a slow version would be super-helpful to me.

Thanks,
John

-----Original Message-----
From: Esteban R <eru...@hotmail.com> 
Sent: Thursday, June 29, 2023 10:11 AM
To: users@pdfbox.apache.org
Subject: [EXTERNAL] RE: Slow rendering

Thanks for all the feedback. I understand that the pdf is complex and there is 
probably nothing to do.

I guess that the issue is somewhat related to transparency or groups too, 
because the rendering is really fast if you change the "/Group" entries by 
"/GGGGG" (so pdfbox just ignores it). See https://gofile.io/d/iWymz2


The original document (I cannot share it for privacy reasons) takes less than 5 
secs to show in Adobe Reader and more than 5 minutes to render at 72 dpi in 
pdfbox.
________________________________
De: Tilman Hausherr <thaush...@t-online.de>
Enviado: miƩrcoles, 28 de junio de 2023 18:39
Para: users@pdfbox.apache.org <users@pdfbox.apache.org>
Asunto: Re: Slow rendering

What do you mean "sequential scanning"? Text extraction is fast, but rendering 
is indeed slow.

I mentioned the website just to prove that PDFBox isn't the only one slow in 
rendering. The cause is the incredible complexity of that PDF. I suspect that 
it results in lots of intermediate images getting created.

Tilman

On 28.06.2023 20:02, William Melendez wrote:
> Seems to me we are redirecting the issue away from the original submission. 
> The program is not written to "display" anything except print to screen in a 
> stand-alone PC. No web version. The issue here is the inability to do 
> sequential scanning of each Interactive PDF. That was the issue I had sought 
> help with.
>
> I think if we can stay focused on that, we may be able to find out why the 
> problem exists. Anyways, thank you for your input.
>
> Will Melendez, MBA
> Product Development Engineer | Stratis IoT | RealPage, Inc.
> 2201 Lakeside Blvd | Richardson, TX 75082 
> william.melen...@realpage.com  |http://www.realpage.com/
> w:  1-972-810-2191
>
> -----Original Message-----
> From: Tilman Hausherr<thaush...@t-online.de>
> Sent: Wednesday, June 28, 2023 12:41 PM To:users@pdfbox.apache.org
> Subject: Re: Slow rendering
>
> [You don't often get email fromthaush...@t-online.de. Learn why this 
> is important athttps://aka.ms/LearnAboutSenderIdentification  ]
>
> CAUTION: This email originated from outside of the organization. Do not click 
> links or open attachments unless you recognize the sender and know the 
> content is safe.
>
>
> On 28.06.2023 19:24, Esteban R wrote:
>> Hello. The following pdf:https://gofile.io/d/mlmpoW  (simplification of a 
>> real life example) takes too long to render (PDFToImage or PDFDebugger) with 
>> pdfbox 2.0.28 (more than 30 secs) and less then 5 secs with other tools. I 
>> get the same results for pdfbox-app-3.0.0-alpha3.
> The file also takes a long time to render on that website, probably with 
> PDF.js. From what I see it goes very deep (but not endless), this is one of 
> the objects in that PDF:


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to