[jira] [Updated] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-6020: Attachment: image-2025-06-13-11-08-06-836.png > mix of subscript and superscript can lead

[jira] [Commented] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17970197#comment-17970197 ] Tilman Hausherr commented on PDFBOX-6020: - The OCR itself isn't really good here

[jira] [Updated] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-6020: Attachment: image-2025-06-13-11-10-27-514.png > mix of subscript and superscript can lead

[jira] [Commented] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17972763#comment-17972763 ] Tilman Hausherr commented on PDFBOX-6020: - Nevertheless your change makes sense,

[jira] [Updated] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Markus Seifert (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Seifert updated PDFBOX-6020: --- Attachment: DE102016007628A1-Page2.pdf > mix of subscript and superscript can lead to unnece

[jira] [Commented] (PDFBOX-6020) mix of subscript and superscript can lead to unnecessary new lines during text extraction

2025-06-13 Thread Markus Seifert (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-6020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17968811#comment-17968811 ] Markus Seifert commented on PDFBOX-6020: Just saw, that the PDF is split into si