dev
Thread
Date
Earlier messages
Messages by Thread
[jira] [Commented] (TIKA-4678) Create GitHub Action automation to publish tika-helm to Artfactory on each merge to main branch
ASF GitHub Bot (Jira)
[PR] TIKA-4678 Create GitHub Action automation to publish tika-helm to Artfactory on each merge to main branch [tika-helm]
via GitHub
[jira] [Created] (TIKA-4678) Create GitHub Action automation to publish tika-helm to Artfactory on each merge to main branch
Lewis John McGibbney (Jira)
[jira] [Commented] (TIKA-4677) Migrate tika-helm Helm Chart tooling from belitre/helm-push-artifactory-plugin to jfrog cli
ASF GitHub Bot (Jira)
[PR] TIKA-4677 Migrate tika-helm Helm Chart tooling from belitre/helm-push-artifactory-plugin to jfrog cli [tika-helm]
via GitHub
[jira] [Created] (TIKA-4677) Migrate tika-helm Helm Chart tooling from belitre/helm-push-artifactory-plugin to jfrog cli
Lewis John McGibbney (Jira)
[jira] [Commented] (TIKA-4676) Add inference and vlm resources to tika-server-standard and tika-app
Hudson (Jira)
[jira] [Commented] (TIKA-4674) Add a progress timeout feature
Hudson (Jira)
[jira] [Created] (TIKA-4676) Add inference and vlm resources to tika-server-standard and tika-app
Tim Allison (Jira)
[jira] [Commented] (TIKA-4675) Add an encoding detector for UTF-16 and UTF-32
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4675) Add an encoding detector for UTF-16 and UTF-32
Hudson (Jira)
[PR] TIKA-4675 -- improve wide unicode detection [tika]
via GitHub
[jira] [Created] (TIKA-4675) Add an encoding detector for UTF-16 and UTF-32
Tim Allison (Jira)
[jira] [Created] (TIKA-4674) Add a progress timeout feature
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4497) Allow per parse timeouts via ParseContext in tika-pipes
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4647) Use argfile for PipesServer
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4670) Improve early crash detection in pipesserver in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4666) Add VLM/modern OCR options parsers in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4669) Improve runtime serialization updates in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4668) Simplify version with maven $revision in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4667) Add tess4j wrapper in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4664) Add poppler renderer and remove mutool rendering wrapper in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4663) Switch default handler type to markdown in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4665) Add chunking and inference handling poc in 4.x
Tim Allison (Jira)
[PR] upgrade okhttp [tika]
via GitHub
Re: [PR] upgrade okhttp [tika]
via GitHub
[jira] [Commented] (TIKA-4673) Add a parser that's a hook for Jina Reader in 4.x
ASF GitHub Bot (Jira)
[PR] TIKA-4673 -- add a parser to wrap Jina Reader [tika]
via GitHub
[jira] [Created] (TIKA-4673) Add a parser that's a hook for Jina Reader in 4.x
Tim Allison (Jira)
[PR] Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.4 to 3.5.5 [tika]
via GitHub
Re: [PR] Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.4 to 3.5.5 [tika]
via GitHub
[PR] Bump org.springframework:spring-context from 7.0.4 to 7.0.5 [tika]
via GitHub
Re: [PR] Bump org.springframework:spring-context from 7.0.4 to 7.0.5 [tika]
via GitHub
[PR] Bump org.codehaus.mojo:flatten-maven-plugin from 1.6.0 to 1.7.3 [tika]
via GitHub
Re: [PR] Bump org.codehaus.mojo:flatten-maven-plugin from 1.6.0 to 1.7.3 [tika]
via GitHub
[PR] Bump org.apache.maven.plugins:maven-failsafe-plugin from 3.5.4 to 3.5.5 [tika]
via GitHub
Re: [PR] Bump org.apache.maven.plugins:maven-failsafe-plugin from 3.5.4 to 3.5.5 [tika]
via GitHub
[PR] Bump com.nimbusds:nimbus-jose-jwt from 10.7 to 10.8 [tika]
via GitHub
Re: [PR] Bump com.nimbusds:nimbus-jose-jwt from 10.7 to 10.8 [tika]
via GitHub
[PR] Bump org.jetbrains.kotlin:kotlin-stdlib-jdk7 from 1.9.10 to 2.3.10 [tika]
via GitHub
Re: [PR] Bump org.jetbrains.kotlin:kotlin-stdlib-jdk7 from 1.9.10 to 2.3.10 [tika]
via GitHub
[PR] Bump commonmark.version from 0.24.0 to 0.27.1 [tika]
via GitHub
Re: [PR] Bump commonmark.version from 0.24.0 to 0.27.1 [tika]
via GitHub
[PR] Bump org.jetbrains.kotlin:kotlin-stdlib-common from 1.9.10 to 2.0.21 [tika]
via GitHub
Re: [PR] Bump org.jetbrains.kotlin:kotlin-stdlib-common from 1.9.10 to 2.0.21 [tika]
via GitHub
[PR] Bump org.apache.kafka:kafka-clients from 4.1.1 to 4.2.0 [tika]
via GitHub
Re: [PR] Bump org.apache.kafka:kafka-clients from 4.1.1 to 4.2.0 [tika]
via GitHub
[PR] Bump org.jetbrains:annotations from 26.0.2-1 to 26.1.0 [tika]
via GitHub
Re: [PR] Bump org.jetbrains:annotations from 26.0.2-1 to 26.1.0 [tika]
via GitHub
[PR] Bump okhttp.version from 4.12.0 to 5.3.2 [tika]
via GitHub
Re: [PR] Bump okhttp.version from 4.12.0 to 5.3.2 [tika]
via GitHub
Re: [PR] Bump okhttp.version from 4.12.0 to 5.3.2 [tika]
via GitHub
Re: [PR] Bump okhttp.version from 4.12.0 to 5.3.2 [tika]
via GitHub
[PR] Bump org.jetbrains.kotlin:kotlin-stdlib from 1.9.10 to 2.3.10 [tika]
via GitHub
Re: [PR] Bump org.jetbrains.kotlin:kotlin-stdlib from 1.9.10 to 2.3.10 [tika]
via GitHub
[PR] Bump org.jetbrains.kotlin:kotlin-stdlib-jdk8 from 1.9.10 to 2.3.10 [tika]
via GitHub
Re: [PR] Bump org.jetbrains.kotlin:kotlin-stdlib-jdk8 from 1.9.10 to 2.3.10 [tika]
via GitHub
[PR] Bump com.fasterxml.jackson:jackson-bom from 2.21.0 to 2.21.1 [tika]
via GitHub
Re: [PR] Bump com.fasterxml.jackson:jackson-bom from 2.21.0 to 2.21.1 [tika]
via GitHub
[PR] Bump twelvemonkeys.version from 3.13.0 to 3.13.1 [tika]
via GitHub
Re: [PR] Bump twelvemonkeys.version from 3.13.0 to 3.13.1 [tika]
via GitHub
[PR] Bump software.amazon.awssdk:bom from 2.41.29 to 2.41.34 [tika]
via GitHub
Re: [PR] Bump software.amazon.awssdk:bom from 2.41.29 to 2.41.34 [tika]
via GitHub
[PR] Bump net.sourceforge.tess4j:tess4j from 5.16.0 to 5.18.0 [tika]
via GitHub
Re: [PR] Bump net.sourceforge.tess4j:tess4j from 5.16.0 to 5.18.0 [tika]
via GitHub
[PR] Bump google-auth-library-oauth2-http.version from 1.42.1 to 1.43.0 [tika]
via GitHub
Re: [PR] Bump google-auth-library-oauth2-http.version from 1.42.1 to 1.43.0 [tika]
via GitHub
[PR] Bump com.mchange:mchange-commons-java from 0.3.2 to 0.4.0 [tika]
via GitHub
Re: [PR] Bump com.mchange:mchange-commons-java from 0.3.2 to 0.4.0 [tika]
via GitHub
[PR] Bump com.googlecode.plist:dd-plist from 1.28 to 1.29 [tika]
via GitHub
Re: [PR] Bump com.googlecode.plist:dd-plist from 1.28 to 1.29 [tika]
via GitHub
[PR] TIKA-4663 -- add cli option for markdown in 3.x to include tika-batch [tika]
via GitHub
Re: [PR] TIKA-4663 -- add cli option for markdown in 3.x to include tika-batch [tika]
via GitHub
[PR] TIKA-4662 [tika]
via GitHub
[jira] [Commented] (TIKA-4672) Add an emitter for Elasticsearch in 4.x
ASF GitHub Bot (Jira)
[PR] TIKA-4672 - add an Elasticsearch emitter [tika]
via GitHub
[jira] [Created] (TIKA-4672) Add an emitter for Elasticsearch in 4.x
Tim Allison (Jira)
FYI dead-end with TikaPipes and Docker
Mikhail Khludnev
Re: FYI dead-end with TikaPipes and Docker
Tim Allison
Re: FYI dead-end with TikaPipes and Docker
Mikhail Khludnev
Re: FYI dead-end with TikaPipes and Docker
Mikhail Khludnev
Re: FYI dead-end with TikaPipes and Docker
Mikhail Khludnev
[PR] TIKA-4671-lang-aware-charset-detection [tika]
via GitHub
Re: [PR] TIKA-4671-lang-aware-charset-detection [tika]
via GitHub
[jira] [Commented] (TIKA-4671) Use langid to adjudicate charset detector disagreements
Hudson (Jira)
[jira] [Commented] (TIKA-4671) Use langid to adjudicate charset detector disagreements
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4671) Use langid to adjudicate charset detector disagreements
ASF GitHub Bot (Jira)
[jira] [Created] (TIKA-4671) Use langid to adjudicate charset detector disagreements
Tim Allison (Jira)
[PR] fix flaky windows timeouts on ci/cd [tika]
via GitHub
Re: [PR] fix flaky windows timeouts on ci/cd [tika]
via GitHub
[PR] TIKA-4663 -- add cli option for markdown in 3.x [tika]
via GitHub
Re: [PR] TIKA-4663 -- add cli option for markdown in 3.x [tika]
via GitHub
Re: [PR] TIKA-4663 -- add cli option for markdown in 3.x [tika]
via GitHub
Re: [PR] TIKA-4663 -- add cli option for markdown in 3.x [tika]
via GitHub
[jira] [Commented] (TIKA-4670) Improve early crash detection in pipesserver in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4670) Improve early crash detection in pipesserver in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4670) Improve early crash detection in pipesserver in 4.x
Hudson (Jira)
[PR] TIKA-4670 -- improve exit handling btwn pipesclient and pipesserver [tika]
via GitHub
Re: [PR] TIKA-4670 -- improve exit handling btwn pipesclient and pipesserver [tika]
via GitHub
[jira] [Created] (TIKA-4670) Improve early crash detection in pipesserver in 4.x
Tim Allison (Jira)
[jira] [Commented] (TIKA-4669) Improve runtime serialization updates in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4669) Improve runtime serialization updates in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4669) Improve runtime serialization updates in 4.x
Hudson (Jira)
[PR] TIKA-4669 -- improve serdes [tika]
via GitHub
Re: [PR] TIKA-4669 -- improve serdes [tika]
via GitHub
[jira] [Created] (TIKA-4669) Improve runtime serialization updates in 4.x
Tim Allison (Jira)
[jira] [Commented] (TIKA-4668) Simplify version with maven $revision in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4668) Simplify version with maven $revision in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4668) Simplify version with maven $revision in 4.x
Hudson (Jira)
[PR] TIKA-4668 -- modernize versioning with $revision [tika]
via GitHub
Re: [PR] TIKA-4668 -- modernize versioning with $revision [tika]
via GitHub
[jira] [Created] (TIKA-4668) Simplify version with maven $revision in 4.x
Tim Allison (Jira)
[jira] [Resolved] (TIKA-4661) Automate tika-helm release on Tika version update
Lewis John McGibbney (Jira)
[jira] [Commented] (TIKA-4667) Add tess4j wrapper in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4667) Add tess4j wrapper in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4667) Add tess4j wrapper in 4.x
Hudson (Jira)
[PR] TIKA-4667 - add Tess4J in-process OCR parser and docs [tika]
via GitHub
Re: [PR] TIKA-4667 - add Tess4J in-process OCR parser and docs [tika]
via GitHub
[jira] [Commented] (TIKA-4666) Add VLM/modern OCR options parsers in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4666) Add VLM/modern OCR options parsers in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4666) Add VLM/modern OCR options parsers in 4.x
Hudson (Jira)
[PR] TIKA-4665-inference-module [tika]
via GitHub
Re: [PR] TIKA-4665-inference-module [tika]
via GitHub
[PR] TIKA-4666 - add VLM parsers (Claude, Gemini, OpenAI) [tika]
via GitHub
Re: [PR] TIKA-4666 - add VLM parsers (Claude, Gemini, OpenAI) [tika]
via GitHub
[jira] [Commented] (TIKA-4665) Add chunking and inference handling poc in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4665) Add chunking and inference handling poc in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4665) Add chunking and inference handling poc in 4.x
Hudson (Jira)
[jira] [Commented] (TIKA-4664) Add poppler renderer and remove mutool rendering wrapper in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4664) Add poppler renderer and remove mutool rendering wrapper in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4664) Add poppler renderer and remove mutool rendering wrapper in 4.x
Hudson (Jira)
[PR] TIKA-4664 - add Poppler renderer, replace MuPDF, add OCR safety limits [tika]
via GitHub
Re: [PR] TIKA-4664 - add Poppler renderer, replace MuPDF, add OCR safety limits [tika]
via GitHub
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
Hudson (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
Hudson (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4663) Switch default handler type to markdown in 4.x
Hudson (Jira)
[PR] TIKA-4663 - add content handler type metadata and switch default to markdown [tika]
via GitHub
Re: [PR] TIKA-4663 - add content handler type metadata and switch default to markdown [tika]
via GitHub
[jira] [Resolved] (TIKA-4630) embeddedRelationshipId is missing from tar files that are children of gzip files (i.e. tarballs)
Tim Allison (Jira)
[jira] [Updated] (TIKA-4665) Add chunking and inference handling poc in 4.x
Tim Allison (Jira)
[jira] [Created] (TIKA-4667) Add tess4j wrapper in 4.x
Tim Allison (Jira)
[jira] [Created] (TIKA-4666) Add VLM/modern OCR options parsers in 4.x
Tim Allison (Jira)
[jira] [Created] (TIKA-4665) Add chunking and inference handling poc in 4.x
Tim Allison (Jira)
[jira] [Created] (TIKA-4664) Add poppler renderer and remove mutool rendering wrapper in 4.x
Tim Allison (Jira)
[jira] [Created] (TIKA-4663) Switch default handler type to markdown in 4.x
Tim Allison (Jira)
[jira] [Commented] (TIKA-4662) Modernize lang-detector for at least 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4662) Modernize lang-detector for at least 4.x
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4662) Modernize lang-detector for at least 4.x
Hudson (Jira)
[jira] [Commented] (TIKA-4662) Modernize lang-detector for at least 4.x
ASF GitHub Bot (Jira)
[PR] TIKA-4662 -- update language detection [tika]
via GitHub
Re: [PR] TIKA-4662 -- update language detection [tika]
via GitHub
[jira] [Created] (TIKA-4662) Modernize lang-detector for at least 4.x
Tim Allison (Jira)
[PR] Bump software.amazon.awssdk:bom from 2.41.28 to 2.41.29 [tika]
via GitHub
Re: [PR] Bump software.amazon.awssdk:bom from 2.41.28 to 2.41.29 [tika]
via GitHub
[PR] Bump org.springframework:spring-context from 7.0.3 to 7.0.4 [tika]
via GitHub
Re: [PR] Bump org.springframework:spring-context from 7.0.3 to 7.0.4 [tika]
via GitHub
[PR] Bump org.xerial:sqlite-jdbc from 3.51.1.0 to 3.51.2.0 [tika]
via GitHub
Re: [PR] Bump org.xerial:sqlite-jdbc from 3.51.1.0 to 3.51.2.0 [tika]
via GitHub
[jira] [Commented] (TIKA-4661) Automate tika-helm release on Tika version update
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4661) Automate tika-helm release on Tika version update
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4661) Automate tika-helm release on Tika version update
ASF GitHub Bot (Jira)
[jira] [Updated] (TIKA-4661) Automate tika-helm release on Tika version update
Lewis John McGibbney (Jira)
[PR] TIKA-4661 Automate tika-helm release on Tika version update [tika-helm]
via GitHub
Re: [PR] TIKA-4661 Automate tika-helm release on Tika version update [tika-helm]
via GitHub
Re: [PR] TIKA-4661 Automate tika-helm release on Tika version update [tika-helm]
via GitHub
[jira] [Created] (TIKA-4661) Automate tika-helm release on Tika version update Description:
Lewis John McGibbney (Jira)
[jira] [Resolved] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
Lewis John McGibbney (Jira)
[PR] Bump Apache Tika Docker image to 3.2.3.0-full [tika-helm]
via GitHub
Re: [PR] Bump Apache Tika Docker image to 3.2.3.0-full [tika-helm]
via GitHub
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
ASF GitHub Bot (Jira)
[jira] [Commented] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
Lewis John McGibbney (Jira)
[PR] TIKA-4660 Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions [tika-helm]
via GitHub
Re: [PR] TIKA-4660 Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions [tika-helm]
via GitHub
Re: [PR] TIKA-4660 Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions [tika-helm]
via GitHub
[PR] TIKA-4660 Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions [tika-helm]
via GitHub
Re: [PR] TIKA-4660 Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions [tika-helm]
via GitHub
[jira] [Updated] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Updated] (TIKA-4660) Add automated Tika Docker image version bump workflow and upgrade all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Created] (TIKA-4660) dd automated Tika Docker image version bump workflow and upgrade all GitHub Actions
Lewis John McGibbney (Jira)
[jira] [Resolved] (TIKA-4657) Endnote content in tables omitted from .docx text
Tim Allison (Jira)
[PR] remove bean setters/getters on parsers and detectors; other fixes to work when tesseract is installed [tika]
via GitHub
Re: [PR] remove bean setters/getters on parsers and detectors; other fixes to work when tesseract is installed [tika]
via GitHub
[PR] TIKA-4657 -- improve extraction from footnote/endnotes in xwpf [tika]
via GitHub
Re: [PR] TIKA-4657 -- improve extraction from footnote/endnotes in xwpf [tika]
via GitHub
[jira] [Commented] (TIKA-4659) Add tika-eval-lite for embedded junk detection
ASF GitHub Bot (Jira)
[PR] TIKA-4659 -- tika-eval-lite [tika]
via GitHub
[jira] [Created] (TIKA-4659) Add tika-eval-lite for embedded junk detection
Tim Allison (Jira)
[jira] [Commented] (TIKA-4657) Endnote content in tables omitted from .docx text
Tilman Hausherr (Jira)
Earlier messages