Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Tilman Hausherr
fixed in https://issues.apache.org/jira/browse/PDFBOX-3864 Tilman Am 11.07.2017 um 16:06 schrieb Tilman Hausherr: The cause are "gaps" in the PDFDocEncoding specification that have been missed in the implementation. I'll create an issue later. Tilman Am 10.07.2017 um 19:22 schrieb Andrea Vac

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Tilman Hausherr
The cause are "gaps" in the PDFDocEncoding specification that have been missed in the implementation. I'll create an issue later. Tilman Am 10.07.2017 um 19:22 schrieb Andrea Vacondio: Hi, we came across this case where we are basically cloning outline items where the original outline title is

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andrea Vacondio
I'm talking about the node dictionary, try adding this: System.out.println(node.getTitle()); On Tue, Jul 11, 2017 at 12:20 PM, Andreas Lehmkühler wrote: > > > Andreas Lehmkühler hat am 11. Juli 2017 um 12:17 > geschrieben: > > > > > > > > > Andrea Vacondio hat am 10. Juli 2017 um > 19:22 gesch

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andreas Lehmkühler
> Andrea Vacondio hat am 10. Juli 2017 um 19:22 > geschrieben: > > > Hi, we came across this case where we are basically cloning outline items > where the original outline title is a UTF16BE encoded text string > containing the value 00A0 (non break space). We later use the string to > assign

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andreas Lehmkühler
> Andreas Lehmkühler hat am 11. Juli 2017 um 12:17 > geschrieben: > > > > > Andrea Vacondio hat am 10. Juli 2017 um 19:22 > > geschrieben: > > > > > > Hi, we came across this case where we are basically cloning outline items > > where the original outline title is a UTF16BE encoded text s

UTF16 encoded string to PDFDocEncoding

2017-07-10 Thread Andrea Vacondio
Hi, we came across this case where we are basically cloning outline items where the original outline title is a UTF16BE encoded text string containing the value 00A0 (non break space). We later use the string to assign the title in a new outline item and the A0 is recognised as a € sign. Here is a