Messages by Thread
-
-
[jira] [Commented] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[jira] [Created] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[jira] [Updated] (TIKA-4409) Run the 2.9.4 release process
Tim Allison (Jira)
-
[PR] Bump protobuf.version from 3.25.6 to 3.25.7 [tika]
via GitHub
-
Re: [PR] Tika-2820: detection of Unix dump files (includes test files) [tika]
via GitHub
-
[jira] [Created] (TIKA-4408) python file identified as application/x-sh under several circumstances
Carol Alexandru (Jira)
-
[jira] [Closed] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Comment Edited] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Updated] (TIKA-4407) Docker: Could not find or load main class org.apache.tika.server.core.TikaServerCli
Alexander Skwar (Jira)
-
[jira] [Created] (TIKA-4407) Docker: Could not find or load main class org.apache.tika.server.core.TikaServerCli
Alexander Skwar (Jira)
-
[PR] TIKA-4406: add missing backslash [tika-docker]
via GitHub
-
[jira] [Resolved] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
ASF GitHub Bot (Jira)
-
[jira] [Created] (TIKA-4406) docker-compose-tika-customocr.yml - yaml: line 22: did not find expected ',' or ']'
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4405) XWPFEventBasedWordExtractor does not support run text that is marked as capitalized
PJ Fanning (Jira)
-
[jira] [Commented] (TIKA-4244) Tika idenifies MIME type of ics files with html content as text/html
Andreas Hubold (Jira)
-
[jira] [Created] (TIKA-4405) XWPFEventBasedWordExtractor does not support run text that is marked as capitalized
PJ Fanning (Jira)
-
[PR] [MINOR] mark some fields as final [tika]
via GitHub
-
[jira] [Created] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4404) PDFX conformance is never used
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4404) PDFX conformance is never used
Hudson (Jira)
-
"TODO: find an example where basic.getThumbNail is not null"
Tilman Hausherr
-
[jira] [Created] (TIKA-4404) PDFX conformance is never used
Tilman Hausherr (Jira)
-
[jira] [Resolved] (TIKA-4401) Catch jempbox's NumberFormatException
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[jira] [Resolved] (TIKA-4403) Implement transferTo in BoundedInputStream
Tim Allison (Jira)
-
[PR] TIKA-4395 -- improve handling and logging in container detection [tika]
via GitHub
-
[jira] [Commented] (TIKA-4403) Implement transferTo in BoundedInputStream
Hudson (Jira)
-
[jira] [Created] (TIKA-4403) Implement transferTo in BoundedInputStream
Tim Allison (Jira)
-
[jira] [Comment Edited] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tilman Hausherr (Jira)
-
[jira] [Resolved] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4402) Support jacoco
Tilman Hausherr (Jira)
-
[jira] [Comment Edited] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[PR] TIKA-4401 -- catch jempbox's numberformat exception [tika]
via GitHub
-
[jira] [Commented] (TIKA-4401) Catch jempbox's NumberFormatException
ASF GitHub Bot (Jira)
-
[jira] [Resolved] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
Tim Allison (Jira)
-
[jira] [Created] (TIKA-4401) Catch jempbox's NumberFormatException
Tim Allison (Jira)
-
[PR] TIKA-4400 -- move some modules to the sandbox profile [tika]
via GitHub
-
[jira] [Commented] (TIKA-4400) Consider simplifying the build with a sandbox profile
ASF GitHub Bot (Jira)
-
[jira] [Created] (TIKA-4400) Consider simplifying the build with a sandbox profile
Tim Allison (Jira)
-
next releases?
Tim Allison
-
[jira] [Commented] (TIKA-4399) RUnpackExtractor -- improve stream wrapping
ASF GitHub Bot (Jira)
-
[PR] TIKA-4399 -- require TikaInputStream for embedded documents [tika]
via GitHub
-
[jira] [Reopened] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tim Allison (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
Tilman Hausherr (Jira)
-
[jira] [Commented] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
[jira] [Resolved] (TIKA-3604) Upgrade pdfbox3
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4398) When extracting a docx file with Tika 3.1.0, the package parser was detected instead of the OOXML parser
mannixli (Jira)
-
OpenJDK Quality Outreach: Java 24 Is Now Available
David Delabassee
-
EmbeddedDocumentExtractor or OCR module for extracting images?
Cristian Zamfir
-
[PR] Bump io.netty:netty-bom from 4.2.0.RC4 to 4.2.0.Final [tika]
via GitHub
-
[PR] Bump org.codehaus.plexus:plexus-classworlds from 2.8.0 to 2.9.0 [tika]
via GitHub
-
[PR] Bump org.ops4j.pax.url:pax-url-aether from 2.6.16 to 3.0.0 [tika]
via GitHub
-
[PR] Bump poi.version from 5.4.0 to 5.4.1 [tika]
via GitHub
-
[PR] Bump com.microsoft.graph:microsoft-graph from 6.33.0 to 6.34.0 [tika]
via GitHub
-
[PR] Bump org.apache:apache from 33 to 34 [tika]
via GitHub
-
[PR] Bump org.apache.maven.plugins:maven-failsafe-plugin from 3.5.2 to 3.5.3 [tika]
via GitHub
-
[jira] [Commented] (TIKA-4396) Copies of config files not deleted after build tests
Hudson (Jira)
-
[jira] [Commented] (TIKA-4397) Small refactorings
Hudson (Jira)
-
[jira] [Updated] (TIKA-4397) Small refactorings
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4397) Small refactorings
Tilman Hausherr (Jira)
-
[jira] [Resolved] (TIKA-4396) Copies of config files not deleted after build tests
Tilman Hausherr (Jira)
-
[jira] [Updated] (TIKA-4396) Copies of config files not deleted after build tests
Tilman Hausherr (Jira)
-
[jira] [Created] (TIKA-4396) Copies of config files not deleted after build tests
Tilman Hausherr (Jira)
-
[PR] Bump com.google.guava:guava from 33.4.5-jre to 33.4.6-jre [tika]
via GitHub
-
[PR] Bump joda-time:joda-time from 2.13.1 to 2.14.0 [tika]
via GitHub
-
[PR] Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.2 to 3.5.3 [tika]
via GitHub
-
[PR] Bump org.ow2.asm:asm from 9.7.1 to 9.8 [tika]
via GitHub
-
[PR] [TIKA-XXXX] Refactor core logic: extract methods, simplify conditionals, and clarify UTF-8 byte checks. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor core logic: extract methods, simplify conditionals, and clarify UTF-8 byte checks. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
Re: [PR] [TIKA-XXXX] Refactor(core): Modularize Classes, Methods, and Associations for Clarity. [tika]
via GitHub
-
[jira] [Closed] (TIKA-4395) cannot get any slide content for pptx file
james (Jira)
-
[jira] [Updated] (TIKA-4395) cannot get any slide content for pptx file
james (Jira)
-
[jira] [Commented] (TIKA-4395) cannot get any slide content for pptx file
Tim Allison (Jira)
-
[jira] [Created] (TIKA-4395) cannot get any slide content for pptx file
james (Jira)
-
Extractous
Tim Allison
-
[PR] Bump net.bytebuddy:byte-buddy from 1.17.2 to 1.17.4 [tika]
via GitHub
-
[jira] [Created] (TIKA-4394) DXF files with a comment in the first two lines are sometimes detected as text/plain
Sandro Lackner (Jira)