Re: Apache PDFBox Licensing Costs for Federal Government (USDA)

2024-09-18 Thread Andreas Lehmkühler
Hi, there aren't any costs as Apache PDFBox can be used for free as long as your usage complies with the license terms. Apache PDFBox is published under the Apache License, Version 2.0. https://www.apache.org/licenses/LICENSE-2.0 Cheers Andreas Am 18.09.24 um 22:02 schrieb Stephen Midda: D

Re: Replace embedded font document-wide by PDType1Font Helvetica in PDF document

2024-11-18 Thread Andreas Lehmkühler
Hi, it sounds easier than it is. As Tilman already mentioned, those standard 14 type 1 fonts are more or less limited to latin text. Anything else won't work. There are additional issues to solve: - you have to deal with different well-known predefined encodings - when it comes to complex fo

[ANNOUNCE] Apache PDFBox 2.0.32 released

2024-07-24 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.32 The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.32 Int

[ANNOUNCE] Apache PDFBox 3.0.3 released

2024-08-08 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.3 The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 3.0.3 Intro

RE: creating fillable forms, possibly in/from existing PDF file?

2017-05-17 Thread Andreas Lehmkühler
> Gary Grosso hat am 18. Mai 2017 um 05:09 > geschrieben: > > > Thanks for your reply, Tilman. > > I see PDFBox allows for text field/area (single or multi-line), list box, > combo box, check box, push button, and radio button. > > Would it be reasonable to say that implementing a date pick

Re: Linearized dictionary

2017-05-22 Thread Andreas Lehmkühler
> karthick g hat am 22. Mai 2017 um 06:17 geschrieben: > > > Hi team, > > Here is the code, I am using COSName.getPDFName("Linearized). The problem > is > > PDDocument pdDoc = PDDocument.load(new File("")); > COSDocument cosDoc = pdDoc.getDocument(); > List lObj = cosDoc.getObjects(); >

Re: Help identifying hair-lines in PDFs using PDFBox and tabula

2017-05-23 Thread Andreas Lehmkühler
> Gilad Denneboom hat am 22. Mai 2017 um 22:07 > geschrieben: > > > Hi all, > > So I'm trying to identify hair-lines in my PDFs. I came across tabula, > which seems to be able to do it, but I can't get it to quite work with my > files in the way I need it to, so any help is greatly appreciated

Re: PDPageContentStream#close() vs PDDocument#close()

2017-06-22 Thread Andreas Lehmkühler
> Thad Humphries hat am 21. Juni 2017 um 23:30 > geschrieben: > > > Is it necessary to call PDDocument#close() after calling > PDPageContentStream#close()? Does the answer apply all cases or only > certain cases? If the latter, what certain cases? > > For example, in the following code snippe

Re: catch(IOException | COSVisitorException e)

2017-06-26 Thread Andreas Lehmkühler
> Steve Carr hat am 26. Juni 2017 um 11:41 > geschrieben: > > > import java.io.IOException; > import org.apache.pdfbox.exceptions.COSVisitorException; > import org.apache.pdfbox.pdmodel.PDDocument; > import org.apache.pdfbox.pdmodel.PDPage; > > /** > * > * @author Azeem > * @Email az...@

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andreas Lehmkühler
> Andreas Lehmkühler hat am 11. Juli 2017 um 12:17 > geschrieben: > > > > > Andrea Vacondio hat am 10. Juli 2017 um 19:22 > > geschrieben: > > > > > > Hi, we came across this case where we are basically cloning outline items > > where t

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andreas Lehmkühler
> Andrea Vacondio hat am 10. Juli 2017 um 19:22 > geschrieben: > > > Hi, we came across this case where we are basically cloning outline items > where the original outline title is a UTF16BE encoded text string > containing the value 00A0 (non break space). We later use the string to > assign

Re: AW: Splitter.createNewDocument() always uses main memory only - this leads to out of memory when splitting large documents

2017-07-14 Thread Andreas Lehmkühler
You are looking at the wrong place. pdfbox-app is just a meta project to create a convience binary of all relevant subprojects. It doesn't contain any source code. The source code you are looking for is here: https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox/2.0.7-

Re: PDFBox JPEG2000 and Tomcat

2017-07-25 Thread Andreas Lehmkühler
> Chris Gamache hat am 25. Juli 2017 um 03:10 geschrieben: > > > I also recall one thread on SO where the developer had kept the scope on the > imageio jars set to `test` as it is in PDFbox's pom. I wish it were a > contributing factor here because it is an easy fix. > > What do you know abo

Re: Regarding retrieving COSName.getPDFName(PreflightConstants.DICTIONARY_KEY_LINEARIZED

2017-07-25 Thread Andreas Lehmkühler
> karthick g hat am 25. Juli 2017 um 10:34 > geschrieben: > > > Hi team, > > Based on the analysis I have found one thing regarding Linearized PDF in > 2.0 and above versions of PDFBox. > > COSDocument cDoc = pdDoc.getDocument(); > List lObj = cDoc.getObjects(); > for (COSObject objec

Fwd: Apache in 2018 - By The Digits

2019-01-01 Thread Andreas Lehmkühler
Hi, Sally prepared some digits for 2018 and I was surprised to see one of our fellow PDFBox committers among the Top 5 committers as we are a small community compared to other ASF projects. Thanks Tilman for your ongoing efforts to improve PDFBox in the last year, the time before that and hop

Re: When will the next version from the 3.x line be available?

2023-06-27 Thread Andreas Lehmkühler
Hi, Am 27.06.23 um 15:10 schrieb Brangs, Erik: Hi, version 2.0.28 of PDFBox was released recently. Will there also be a new version from the 3.x line in the near future? First of all there will be another 2.0 release, hopefully tomorrow Andreas Lehmkühler mentioned a possible beta1

[ANNOUNCE] Apache PDFBox 2.0.29 released

2023-07-01 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.29. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.29 In

[ANNOUNCE] Apache PDFBox 3.0.0-beta1 released

2023-07-13 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of the first beta release for Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html The Apache PDFBox library is an open source Java tool for working with PDF documents. This is the first b

Re: TextToPDF function removes the first char since 2.0.28

2023-07-24 Thread Andreas Lehmkühler
Hi, the question is, where does the char got lost, when creating the pdf or when extracting the text? Did you check the created pdf? Does it contain the whole text? Andreas Am 25.07.23 um 07:52 schrieb michael.a...@universa.de: Hi, the TextToPDF function worked without problems from 2.0.24

Re: TextToPDF function removes the first char since 2.0.28

2023-07-26 Thread Andreas Lehmkühler
I've ran your shell script and got the same result, the first char is missing in the pdf. It seems to be related to the way you are calling TextToPDF. You are simply print the text to the console and redirect it to TextToPDF. I've changed that and echoed the text to a file and used that file

Re: Border / Box around images and form elements with backgrounds

2023-08-06 Thread Andreas Lehmkühler
Please provide the source pdf you used for rendering as well. Thanks in advance Andreas Am 01.08.23 um 22:30 schrieb JJ Blodgett: It looks like the attachments were stripped out of the email. I'll try to include Google doc links and hope these work: Example of bad behavior: https://drive.goo

[ANNOUNCE] Apache PDFBox 3.0.0 released

2023-08-17 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html The Apache PDFBox library is an open source Java tool for working with PDF documents. This is the new major release 3.0.0 of PDFBox.

Re: [ANNOUNCE] Apache PDFBox 3.0.0 released

2023-08-19 Thread Andreas Lehmkühler
Hi, @Erik thanks for the report but I guess there is a misunderstanding, see inline Am 18.08.23 um 11:32 schrieb Brangs, Erik: Hi, -Ursprüngliche Nachricht- Von: Andreas Lehmkühler [mailto:andr...@lehmi.de.INVALID] Gesendet: Freitag, 18. August 2023 07:42 An: users

[ANNOUNCE] Apache PDFBox 1.8.x End-Of-Life (EOL) Announcement

2023-08-19 Thread Andreas Lehmkühler
The Apache PDFBox Team would like to inform you that PDFBox 1.8.17 is the last release of the 1.8 branch, which has reached its end of life and won't be longer officially supported. The current community mainly maintains the 2.0.x branch and the brand new 3.0.x branch. We recommend everyone to

Re: RandomAccessReadBuffer performance issues with inputStreams in 3.0

2023-09-17 Thread Andreas Lehmkühler
Am 28.08.23 um 13:30 schrieb bnncdv: When migrating from 2.0 to 3.0 I noticed some operations were very slow, mainly the Splitter tool. With a big-ish file it would take *a lot* more memory/cpu (jdk8). What exactly are you doing? I've tried to reproduce the issue and I've bee succesful with re

Re: how to replace MemoryUsageSetting.setupMixed(100mb) ?

2023-10-07 Thread Andreas Lehmkühler
Am 06.10.23 um 00:07 schrieb Pados Attila: I am using something like this: PDDocument a1doc = Loader.loadPDF(new RandomAccessReadBuffer(resourceAsStream), () -> new ScratchFile(MemoryUsageSetting.setupMixed(100))); (I use it with tempFileOnly, but the rest are the same) Be aware that all o

Re: Looking for a Debugger that can show which incremental save an object belongs to

2023-10-07 Thread Andreas Lehmkühler
Am 07.10.23 um 06:43 schrieb John Lussmyer: I doubt there is a way. It's most likely that the signing code makes a MD5 checksum (or similar) of the file when it is signed. If the file is changed, checking the signing will re-calculate the checksum and find that it is different.  There isn't a

Re: empty/missing pdf content

2023-10-16 Thread Andreas Lehmkühler
Am 16.10.23 um 23:43 schrieb Pados Attila: I fixed the issue with missing input pdf file, and also re-run this test project with the most fresh 3.0.1-SNAPSHOT version. (oct 5th) So far, the character distortion remains, That isn't the most recent version. The ticket was created on Oct 7th an

Re: PII data

2023-10-16 Thread Andreas Lehmkühler
PDFBox doesn't send any information anywhere. Everything is done locally on your machine. Am 16.10.23 um 23:14 schrieb Ward Dixon: Hello, does anyone know if PDF Box sends any information outside of my network from the PDF it is creating? I'm concerned about Personal Identifiable Information

[ANNOUNCE] Apache PDFBox 2.0.30 released

2023-11-05 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.30. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.30 In

Re: Odd OCG error

2023-11-21 Thread Andreas Lehmkühler
Am 21.11.23 um 21:26 schrieb John Lussmyer: Ugh, formatting mess. For more info, this is the "addOCGs:OCG" log line just before the error message: 10:53:09.765 [etrix SwingWorker[0]] DEBUG ImposedPDFEngine - addOCGs: OCG COSDictionary{COSName{Name}:COSObject{COSNull{}};COSName{Type}:COSOb

[ANNOUNCE] Apache PDFBox 3.0.1 released

2023-11-30 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.1. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 3.0.1 Intr

Re: PDFBox 3.0.1 compile dependency on junit-jupiter

2023-12-02 Thread Andreas Lehmkühler
Hi, Am 01.12.23 um 17:14 schrieb Dan Rabe: It looks like a compile dependency on junit-jupiter snuck into the 3.0.1 release. If I look at the maven page for 3.0.0 at https://mvnrepository.com/artifact/org.apache.pdfbox/pdfbox/3.0.0, junit-jupiter is listed as a test dependency. If I look at

Re: PDFBox 3.0.1 compile dependency on junit-jupiter

2023-12-03 Thread Andreas Lehmkühler
solved, see [1] for further details. Andreas [1] https://issues.apache.org/jira/browse/PDFBOX-5722 Am 02.12.23 um 09:05 schrieb Andreas Lehmkühler: Hi, Am 01.12.23 um 17:14 schrieb Dan Rabe: It looks like a compile dependency on junit-jupiter snuck into the 3.0.1 release. If I look at the

Re: Text extraction from a certain PDF uses up multiple GB of memory

2023-12-14 Thread Andreas Lehmkühler
Looks like https://issues.apache.org/jira/browse/PDFBOX-5479 Am 13.12.23 um 14:50 schrieb Tilman Hausherr: On 13.12.2023 11:23, Brangs, Erik wrote: Hi, we ran into problems when doing text extraction from the PDF athttps://d-nb.info/1312454512/34 . We were using PDFBox 3.0.0 to extract the

Re: RE%3A Re%3A [External Sender] Re%3A PDFBox 3.0.1 compile dependency on junit-jupiter&In-Reply-To=<9f543108-ef5c-4c7a-bac8-d7c6009d9d5f%40gmail.com>

2024-01-10 Thread Andreas Lehmkühler
at, thank you! We’ll look forward to seeing this in the next release! --Dan From: Andreas Lehmkühler Date: Sunday, December 3, 2023 at 1:58 PM To:users@pdfbox.apache.org Subject: [External Sender] Re: PDFBox 3.0.1 compile dependency on junit-jupiter solved, see [1] for further details

Re: Importing landscape format and portrait format oriented pages into the same PDF causes PDF corruption

2024-03-02 Thread Andreas Lehmkühler
Hi, I guess I've fixed https://issues.apache.org/jira/browse/PDFBOX-5752 and the fix works for PDFBOX-5775 as well. @Fabian please give the newest SNAPSHOT build of 3.0.2 a try Andreas Am 23.02.24 um 11:43 schrieb Tilman Hausherr: On 21.02.2024 16:07, Fabian Zünd SI-Solutions Gmbh wrote: He

Re: Help with NullPointerException org.apache.io.IOUtils.LOG

2024-03-12 Thread Andreas Lehmkühler
Hi Matthew, this is a known issue with 3.0.1, see [1] for further details. The upcoming version 3.0.2 includes a fix. Unless nothing unforeseen happens, the new version will be available in about 2 days from now. Andreas [1] https://issues.apache.org/jira/browse/PDFBOX-5758 Am 12.03.24 um

[ANNOUNCE] Apache PDFBox 3.0.2 released

2024-03-14 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.2. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 3.0.2 Intr

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-24 Thread Andreas Lehmkühler
Am 15.03.24 um 05:35 schrieb Tilman Hausherr: You are correct that it's the "fb" parts that are missing. (And some of the other tools you tried also mention this) Just adding true results in text extraction of several files no longer being correct, 433525-p1.pdf O226ORR4SMIKRGPWC6PXUYAYMSBB

[ANNOUNCE] Apache PDFBox 2.0.31 released

2024-03-24 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.31 The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.31 Int

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-24 Thread Andreas Lehmkühler
for my own tests. Tilman On 24.03.2024 16:19, Andreas Lehmkühler wrote: Am 15.03.24 um 05:35 schrieb Tilman Hausherr: You are correct that it's the "fb" parts that are missing. (And some of the other tools you tried also mention this) Just adding true results in text extrac

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-25 Thread Andreas Lehmkühler
Am 25.03.24 um 10:07 schrieb Tilman Hausherr: On 25.03.2024 07:48, Andreas Lehmkühler wrote: Thanks for the URLs. All of them are working with my change. See https://issues.apache.org/jira/browse/PDFBOX-5790 for further details. @Tilman Please run your tests if possible No regressions

Re: Lost xref table on two PDF merge

2024-04-04 Thread Andreas Lehmkühler
Hi, which version of PDFBox are you using? Did you save the merged pdf before you try to fix the signature? The resulting pdf should have a valid xref table. Andreas Am 04.04.24 um 15:09 schrieb František Šimon: Hello, I encounter a problem when trying to fix some problem with invalid

Re: Text extraction from a certain PDF does not seem to terminate

2024-04-06 Thread Andreas Lehmkühler
Hi, Am 03.04.24 um 15:53 schrieb Brangs, Erik: Hi, when attempting text extraction from the PDF at https://d-nb.info/1324982411/34 , either using PDFBox 3.0.0 or PDFBox 4.0.0-SNAPSHOT, the extraction uses about 1,8 GB heap memory and does not seem to terminate. I cancelled the extraction att

Re: Issue with PDFBox 3.0.0 - Unable to Extract and Add Pages

2024-04-07 Thread Andreas Lehmkühler
The issue was fixed and is part of the current 3.0.2 version of PDFBox. Andreas Am 27.02.24 um 10:11 schrieb Tilman Hausherr: Hi, It's like Fabian said. Btw neither the code here nor the different(!) code in https://stackoverflow.com/questions/78065676/ would enable anybody to reproduce suc

Re: PDFBox Loader Issue

2024-06-04 Thread Andreas Lehmkühler
Hi, all IDEs I know are working similar. Create a new project using File -> New -> Java Project, follow the instructions Add all needed jars to your environment using Project -> Properties -> Java Build Path -> Libraries That's it for a simple project without using any build tool like mave

Re: PDFBox Loader Issue

2024-06-05 Thread Andreas Lehmkühler
ed to PDFBox to see if I can learn anything useful. Thank you for your reply! On Wed, Jun 5, 2024, 02:17 Andreas Lehmkühler wrote: Hi, all IDEs I know are working similar. Create a new project using File -> New -> Java Project, follow the instructions Add all needed jars to your envir

Re: Pooled memory resources

2024-06-05 Thread Andreas Lehmkühler
Hi, Am 04.06.24 um 10:44 schrieb Constantine Dokolas: Hi all! I have a requirement for PDFBox memory management where a multi-threaded process that is generating PDF files (one per thread, at most) should share a certain total amount of RAM (any excess should use scratch files). This is because

Re: simple PDF render differences, SVG and JPG

2024-06-17 Thread Andreas Lehmkühler
Hi, Am 18.06.24 um 01:10 schrieb Rich Stafford: I have a simplified PDF case that renders (using PDFBox) differently between JPG and SVG output modes. PDFBox doesn't have a SVG output mode. I guess you are using some code/tool from the Apache XMLGraphics project so that you might ask them for

Re: Content of pdf moved around

2015-01-11 Thread Andreas Lehmkühler
Hi Ray, to unsubscribe you have to write an email to users-subscr...@pdfbox.apache.org. See [1] for fruther details. BR Andreas Lehmkühler [1] http://pdfbox.apache.org/mailinglists.html > Ray Morris hat am 10. Januar 2015 um 22:48 > geschrieben: > > > Please unsubscribe r

Re: unsubscribe [SEC=UNOFFICIAL]

2015-01-12 Thread Andreas Lehmkühler
Hi James, to unsubscribe you have to write an email to users-subscr...@pdfbox.apache.org. See [1] for further details. BR Andreas Lehmkühler [1] http://pdfbox.apache.org/mailinglists.html > "Weatherly, James" hat am 12. Januar > 2015 um 0

Re: Error on PDDocument.load

2015-01-19 Thread Andreas Lehmkühler
u can send it to me. Sounds like something for me ;-) So, please send it directly to me. I'll share it with the other PDFBox members if necessary and if it is ok for you. BR Andreas Lehmkühler > BR > > Maruan > > Am 19.01.2015 um 12:54 schrieb Kevin Morin : > > > Hi Maruan

Re: Error on PDDocument.load

2015-01-21 Thread Andreas Lehmkühler
guess, the issue has to be something else BR Andreas Lehmkühler > Thanks for your help and for all your work. > > Kevin > > On 21/01/2015 11:54, Maruan Sahyoun wrote: > > Hi Kevin > > > > works for me - what's your Java Version? > > >

Re: [PDFBOX-2.0] Signature Issue

2015-01-29 Thread Andreas Lehmkühler
ause? What exactly did you do to sign the pdf? Did you have a look at the provided testcase [1], which demonstrates all necessary steps to sign a pdf. > Best regards > > -- > Isaías Barroso > Belo Horizonte - MG BR Andreas Lehmkühler [1] http://svn.apache.org/viewvc/pdfbox/trunk

Re: Aw: Re: Type1Glyph2D No glyph for 41 (.notdef) in font Helvetica

2015-01-29 Thread Andreas Lehmkühler
with our font mapping if the fonts aren't embedded. Besides, IMO you have to think about your font handling, especially as you're creating the pdfs yourself. It's always a bad idea not to include the used fonts, as the used reader has to map the missing fonts somehow and such a

Re: [PdfBox 2.0] Page rendered as a blank image

2015-02-03 Thread Andreas Lehmkühler
ase send it to me. > Thanks > BR > > Kevin > > - > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > BR Andreas Lehmkühler -

Re: [PDFBOX-2.0] Signature Issue

2015-02-04 Thread Andreas Lehmkühler
e. Attachments are allowed. Either you'll upload the file to a public place or send it to me directly. BR Andreas Lehmkühler > Best regards > > On Mon, Feb 2, 2015 at 6:13 PM, Isaias Barroso > wrote: > > > Thank you, > > > > After test I'll give a

Re: Rejection of the confirmation e-mail

2015-02-12 Thread Andreas Lehmkühler
nd it to the given link, a message text isn't needed. This should lower the spam score concerning two of the given reasons: HTML_MESSAGE,MIME_HTML_MOSTLY HTH Andreas Lehmkühler > Original message headers: > > Received: from DM2PR0701MB1066.namprd07.prod.outlook.com (25.160.26.19) by &

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-16 Thread Andreas Lehmkühler
kXRefOffset(prev); >     if (fixedOffset > -1 && fixedOffset != prev) >     { >     prev = fixedOffset; >     COSDictionary trailer = > xrefTrailerResolver.getCurrentTrailer(); >  

Re: How to attach files to messages sent to users@pdfbox.apache.org?

2015-02-17 Thread Andreas Lehmkühler
Hi Alan, (most kind of) attachments are not allowed. Either attach the file(s) to the related JIRA ticket or provide it using a sharehoster/public webspace/etc. BR Andreas Lehmkühler > Alan Masters hat am 17. Februar 2015 um 11:10 > geschrieben: > > > Please could someone he

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-23 Thread Andreas Lehmkühler
Hi, I've improved the self repair mechnism of the trunk based on Steves report. @Steve Please give the newest trunk version/SNAPSHOT a try. Does the issue still persist? BR Andreas Lehmkühler > Steve Antoch hat am 17. Februar 2015 um 00:05 geschrieben: > > > > Andre

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-24 Thread Andreas Lehmkühler
lpful. I wish I could have > given you the files, but unfortunately, they are proprietary and we cannot > release them. :-( No need to worry, you are not the only one who is not allowed to share a specific pdf. > Best regards- > Steve BR Andreas Lehmkühler > > __

Re: [PDFBOX-2.0] PDF Size after Signature

2015-02-26 Thread Andreas Lehmkühler
EOF. In the 2.0 one, all objects are there ?! Correct, something went wrong when appending the changed objects only. It work for me when I fixed the encryption stuff. I seems as if some recent change introduced this regression. @Isaias Which exact version/revision of the trunk are you using? B

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-26 Thread Andreas Lehmkühler
vel and see > if it worked. Since it did, I wanted to run those changes by you for your > comments. Is there any chance to get a hand on a sample pdf? I would be good enough to send it via private mail to me: BR Andreas Lehmkühler > > Thanks- > Steve > >

Re: [PDFBOX-2.0] PDF Size after Signature

2015-02-27 Thread Andreas Lehmkühler
PDFBOX-2685. Saying that, r1659998 could introduce a regression. I had a quick look and maybe the changes made to COSWriter are the root cause. But we have to debug that first to be sure. BR Andreas Lehmkühler > BR > Maruan > > Am 27.02.2015 um 08:45 schrieb Andreas Lehmkühler : > &g

Re: PDFBox 2.0.0 and UTF8 chars

2015-03-02 Thread Andreas Lehmkühler
thers is an issue with that specific font. Either a malformed font or a fontbox issue. BR Andreas Lehmkühler > > Tilman > > Am 01.03.2015 um 18:53 schrieb Andreas Lehmkuehler: > > Hi, > > > > Am 28.02.2015 um 11:52 schrieb Ivan Klaric: > >> Hello good PDFB

Re: Question about PDDocument.setVersion

2015-03-04 Thread Andreas Lehmkühler
t the header else set both header > and catalog so something like: > if (newVersion >= 1.4f) > { > > getDocumentCatalog().setVersion(Float.toString(newVersion)); > } > getDocument().setVersion(newVersion); > > Am I missing so

Re: Problem building the project with Eclipse and m2e

2015-03-17 Thread Andreas Lehmkühler
e of software and I guess I'm not alone. > > Agreed. But what's the alternative? :-{ I don't know any. > Best >Martin BR Andreas Lehmkühler - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: Text removal

2015-03-24 Thread Andreas Lehmkühler
ently stuck in the PDF specifications 1.5 and really running out > of time. > > I'd so much appreciate any help or any idea on what's going on. > > Notes: > 1. I use use PDFBox 1.7.1 1.7.1 is more than 2 years old (released in july 2012). I strongly recommend to use

Re: Blank page rendered with wrong xref start objid (batch 1.8)

2015-03-26 Thread Andreas Lehmkühler
or different reasons. If someone wants to do so, patches are welcome :-) BR Andreas Lehmkühler > thank you > Jerome > > > > - > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apach

Re: java source in PDFBox snapshot jars?

2015-04-21 Thread Andreas Lehmkühler
ore the PDFBox jars are quite small compared to other projects, so that infra most likely won't any headache if we put those source jars into the repo as well. I'll try to find out what we have to do to publish the sources as well, see PDFBOX-2770 BR Andreas Lehmkühler > > BR

Re: How to flatedecode and find all acroform fields in a compressed PDF

2015-05-21 Thread Andreas Lehmkühler
gt; PDFs. For that, I would like to thank you for your time and pointers. Maybe it's worth to try to share the file "visually" with us. Open both files (compressed and decompressed) with PDFDebugger [1] and post a screenshot of both somehwere (dropbox etc.) and share the link

Re: IllegalArgumentException when using PDType1Font.HELVETICA

2015-05-27 Thread Andreas Lehmkühler
000A sounds like line feed. Does your string contain any newline characters like CR or LF? You have to remove those, as you have to manage line breaks yourself. > Thanks :) > > > Johanneke Lamberink BR Andreas Lehmkühler

Re: Scratch files - too many files open

2015-06-03 Thread Andreas Lehmkühler
Hi, > Jesse Long hat am 3. Juni 2015 um 08:45 geschrieben: > > > On 02/06/2015 17:48, Andreas Lehmkuehler wrote: > > Hi, > > > > Am 02.06.2015 um 16:15 schrieb Jesse Long: > >> Hi All, > >> > >> Regarding PDFBOX-2301, and the use of scratch files: right now, each > >> COSStream > >> uses one o

Re: Scratch files - too many files open

2015-06-05 Thread Andreas Lehmkühler
Hi, > Jesse Long hat am 3. Juni 2015 um 13:20 geschrieben: > > > On 03/06/2015 12:46, Andreas Lehmkühler wrote: > > Hi, > > > >> Jesse Long hat am 3. Juni 2015 um 08:45 > >> geschrieben: > >> > >> > >> On 02/06/2015 17:48,

Re: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
> Alex Sviridov hat am 1. Juli 2015 um 10:16 geschrieben: > > > I want to display all page thumbnails. However I came across memory size > problem with PDFRenderer or PDDocument - I don't know which one. > > I have the following code: >    >     private PDDocument pdfDocument; >      >

Re: Re[2]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
> I am sure that the problem is in PdfBox. Please, help me. Maybe, but I'm not sure at all. Try to use the scratch file. > Среда, 1 июля 2015, 12:48 +02:00 от Andreas Lehmkühler : > > > > > >> Alex Sviridov < ooo_satu...@mail.ru > hat am 1. Juli 2015 um 10

Re: Re[4]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
rry, I don't understand you. PDFBox holds a lot of temporary data in the memory. To reduce the memory footprint one can choose to use a scratch file instead, so that some/most of that data will be hold in a file. To do so, simply use another load method, e.g. load(File file, boolean useScrat

Re: Re[6]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
9) >     at java.util.zip.Inflater.inflate(Inflater.java:280) >     at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:101) >     at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:74) >     ... 20 more > > How to solve these problems? PDFBox isn't supposed to be thre

Re: Re[8]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
> > > Среда, 1 июля 2015, 13:54 +02:00 от Andreas Lehmkühler : > >> Alex Sviridov < ooo_satu...@mail.ru > hat am 1. Juli 2015 um 13:38 > >> geschrieben: > >> > >> > >> The file is here https://yadi.sk/i/Y0fTuvHmhbZiE > >Ah,

Re: Re[10]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
erefore the code to be extended is more or less complex. > Среда, 1 июля 2015, 14:08 +02:00 от Andreas Lehmkühler : > > > > > >> Alex Sviridov < ooo_satu...@mail.ru > hat am 1. Juli 2015 um 13:59 > >> geschrieben: > >> > >> > >&

Re: Bouncycastle Provider Suddenly Stopped Working wehn I went to PDFBox 2.0

2015-07-01 Thread Andreas Lehmkühler
Hi, > Torgeir Veimo hat am 2. Juli 2015 um 05:54 > geschrieben: > > > I recently had some unexpected bouncy castle issues and didn't realise > until I read this that the 2.0.0-SNAPSHOT actually includes the bouncy > castle class files! Will the practice be abandoned for release jars? > It will

Re: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
> John Hewson hat am 2. Juli 2015 um 06:10 geschrieben: > > > > > On 1 Jul 2015, at 07:52, Tilman Hausherr wrote: > > > >> Am 01.07.2015 um 10:16 schrieb Alex Sviridov: > >> In my application I have real time memory graphs and they show that memory > >> is very fast filled. > >> When there

Re: How to configure Maven POM to include latest SNAPSHOT of PDFbox

2015-07-07 Thread Andreas Lehmkühler
Hi, > Roberto Nibali hat am 7. Juli 2015 um 11:43 geschrieben: > > > Hi > > How do I properly set the dependencies in my Maven POM, so I can use the > latest SNAPSHOT of pdfbox? > > I tried the following (https://pdfbox.apache.org/2.0/getting-started.html), > which does not work at all: > >

[ANNOUNCE] Apache PDFBox 1.8.10 released

2015-07-22 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 1.8.10. The release is available for download at: http://pdfbox.apache.org/download.cgi See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 1.8.10 I

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
Hi Roberto, > Roberto Nibali hat am 27. Juli 2015 um 09:28 geschrieben: > > > Dear developers > > The last commit 1692730 by lehmi, 18:36, broke the PDFWriter. The result > is: The file xx.pdf cannot be open; It may be damaged or use a file format > that Preview doesn’t recognize. The same whe

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
> Andreas Lehmkühler hat am 27. Juli 2015 um 09:37 > geschrieben: > > > Hi Roberto, > > > Roberto Nibali hat am 27. Juli 2015 um 09:28 > > geschrieben: > > > > > > Dear developers > > > > The last commit 1692730 by lehmi, 18

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
> Roberto Nibali hat am 27. Juli 2015 um 10:29 geschrieben: > > > Hi Andreas > > Thanks for the quick reply. > > On Mon, Jul 27, 2015 at 9:55 AM, Andreas Lehmkühler > wrote: > > > > > > > > Andreas Lehmkühler hat am 27. Juli 2015 um 09

Re: Major differences between PDFTextStripper and PrintTextLocations

2015-08-10 Thread Andreas Lehmkühler
Hi Gilad, sorry for the late answer I'm not sure what you're expecting. You are using 2 totally different approaches to process a pdf. PrintTextLocations provides a lot of additional information for every piece of text, which may vary from one character up to whole words or lines of text. Co

Re: Something weird with PDFMergerUtility?

2015-08-14 Thread Andreas Lehmkühler
> Magnus Evensberget hat am 14. August 2015 um > 10:21 geschrieben: > > > Rolled back to the commit c343a3f and then it works. We are working with svn and I guess you are referring to the PDFBox github mirror, aren't you? That git commit refers to rev 1693855. BR Andreas > > On Fri, 14 Aug

Re: Anyone know how to set up a bouncycastle?

2015-09-24 Thread Andreas Lehmkühler
Hi, > Eric Douglas hat am 18. September 2015 um 16:53 > geschrieben: > > > I'm trying to read a PDF using pdfbox, and on one system I get this error: > > cannot create instance of > org.bouncycastle.jcajce.provider.digest.GOST3411$Mappings > : java.security.AccessControlException: access denie

Re: Failure to close files on parse error

2015-11-02 Thread Andreas Lehmkühler
Hi, > Jesse Long hat am 2. November 2015 um 12:26 > geschrieben: > > > Hi All, > > The changes to PDDocument in eb83a299bbe39c2e59735aca2b39bca312c1ddc4 > were insufficient, please include attached patch. Please provide a JIRA ticket number or a svn revision as a reference TIA, Andreas > Th

Re: Shell Can't Find pdfbox

2015-11-03 Thread Andreas Lehmkühler
Hi, > Jonathan Levi hat am 3. November 2015 um 03:25 geschrieben: > > > I'm finding that shell commands to use pdfbox-app-1.8.10.jar won't work unless > the full path is used. Example: > > drj-air:Desktop jonathan$ ls /usr/local/bin/pdfb* > /usr/local/bin/pdfbox-app-1.8.10.jar > drj-air:Deskto

[ANNOUNCE] Apache PDFBox 2.0.0 RC3 released

2016-01-15 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.0 RC3. The release is available for download at: http://pdfbox.apache.org/download.cgi The numerous feedback on our second release candidate helps us to make this release candidate better again, e. g. opti

Re: Rotating a new annotation to match the page's rotation

2016-02-24 Thread Andreas Lehmkühler
Hi, > Gilad Denneboom hat am 24. Februar 2016 um 09:34 > geschrieben: > > > No one has any ideas? ... > > On Sun, Feb 21, 2016 at 12:30 AM, Gilad Denneboom > wrote: > > > Hi all, > > > > Hoping someone can help me with this issue... > > I have a tool that adds new highlight annotations to a

Re: memory consumption PDFBox 2.0.0

2016-03-01 Thread Andreas Lehmkühler
Hi, > Felix Benz-Baldas hat am 1. März 2016 um 12:35 > geschrieben: > > > Hello, > > we plan to use PDFBox 2.0.0 for converting PDFs to JPEG. We want to convert a > very large number of documents (more than one million). > > One question: Is it possible to control the memory-consumption? When

Re: Fields and "]" + Checkboxes

2016-03-09 Thread Andreas Lehmkühler
Hi, > Al Grant hat am 8. März 2016 um 18:57 geschrieben: > > > Morning All, > > I have been writing some Java with PDFBox for a few weeks now. Its been > very good so far. > > My goal is to loop through all the fields in a form, grab the values and > write the value to a corresponding field i

Re: PrintTextLocations 1.8 vs 2.0

2016-03-16 Thread Andreas Lehmkühler
Hi, > Peter Prusinowski hat am 16. März 2016 um 09:52 > geschrieben: > > > Good morning, > > thank you for the hints, now I am overwriting showGlyph() and trying to > get the value with > > PDSimpleFont sf = (PDSimpleFont) font; > String name = sf.getEncoding().getN

Re: Spaces are ignored when reading a PDF file

2016-03-18 Thread Andreas Lehmkühler
> "Hesham G." hat am 17. März 2016 um 11:20 > geschrieben: > > > Andreas, > > That is very helpful. > > I can get the x location of each character using TextPosition.getX(), ex: > W: 102.88399 > i: 114.18165 > t: 117.660614 > h: 121.55801 > d: 133.09477 > u: 140.3994 > e: 147.60838 > > So to

  1   2   3   >