[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread William Miller (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924297#comment-17924297 ] William Miller commented on TIKA-4380: -- I'll try that when I have some time later ton

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread William Miller (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924285#comment-17924285 ] William Miller commented on TIKA-4380: -- Not sure if it matters, but what worked for m

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924287#comment-17924287 ] Tim Allison commented on TIKA-4380: --- This branch works locally: https://github.com/apach

[jira] [Comment Edited] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924284#comment-17924284 ] Tim Allison edited comment on TIKA-4380 at 2/5/25 9:49 PM: --- Do w

[jira] [Comment Edited] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924282#comment-17924282 ] Tim Allison edited comment on TIKA-4380 at 2/5/25 9:45 PM: --- So,

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924284#comment-17924284 ] Tim Allison commented on TIKA-4380: --- Do we need to add it conditionally? > Java crash o

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924282#comment-17924282 ] Tim Allison commented on TIKA-4380: --- So, when I do this: {noformat} ENV JAVA_OPTS="-XX:U

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread William Miller (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924256#comment-17924256 ] William Miller commented on TIKA-4380: -- I can't speak to [~tallison]'s question regar

[jira] [Commented] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924244#comment-17924244 ] Tim Allison commented on TIKA-4380: --- Can anyone more familiar {{tika-docker}} than I am

[jira] [Updated] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-4380: -- Component/s: tika-docker > Java crash on M4 processor > -- > > K

[jira] [Created] (TIKA-4380) Java crash on M4 processor

2025-02-05 Thread William Miller (Jira)
William Miller created TIKA-4380: Summary: Java crash on M4 processor Key: TIKA-4380 URL: https://issues.apache.org/jira/browse/TIKA-4380 Project: Tika Issue Type: Bug Reporter: W

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924076#comment-17924076 ] Subbu edited comment on TIKA-4370 at 2/5/25 1:49 PM: - Appreciate your

[jira] [Comment Edited] (TIKA-1180) Better Matroska MKV and WEBM Detection

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924145#comment-17924145 ] Subbu edited comment on TIKA-1180 at 2/5/25 3:46 PM: - [~tallison]  Sur

[jira] [Commented] (TIKA-1180) Better Matroska MKV and WEBM Detection

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924145#comment-17924145 ] Subbu commented on TIKA-1180: - [~tallison]  Sure thing. I can help with PR as above. But just

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924087#comment-17924087 ] Tim Allison commented on TIKA-4370: --- And, we currently hardcode MimeTypes as the last de

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924076#comment-17924076 ] Subbu edited comment on TIKA-4370 at 2/5/25 1:52 PM: - Appreciate your

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924085#comment-17924085 ] Tim Allison commented on TIKA-4370: --- My proposal is definitely application layer, not ba

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924076#comment-17924076 ] Subbu edited comment on TIKA-4370 at 2/5/25 1:55 PM: - Appreciate your

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924076#comment-17924076 ] Subbu edited comment on TIKA-4370 at 2/5/25 1:48 PM: - Appreciate your

[jira] [Commented] (TIKA-1180) Better Matroska MKV and WEBM Detection

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924075#comment-17924075 ] Tim Allison commented on TIKA-1180: --- A PR would be helpful. > Better Matroska MKV and W

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924076#comment-17924076 ] Subbu commented on TIKA-4370: - Appreciate your responses here. _Perhaps, if the file is appli

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924073#comment-17924073 ] Tim Allison edited comment on TIKA-4370 at 2/5/25 1:45 PM: --- bq.

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924069#comment-17924069 ] Subbu commented on TIKA-4370: - And this shouldn't be file specific but all SJIS encoded files.

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924073#comment-17924073 ] Tim Allison commented on TIKA-4370: --- bq. Haha new thing called TextAndCSVParser Sorry. I

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924066#comment-17924066 ] Subbu commented on TIKA-4370: - Haha new thing called TextAndCSVParser, :D I was talking about

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924064#comment-17924064 ] Tim Allison edited comment on TIKA-4370 at 2/5/25 1:23 PM: --- Not

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924064#comment-17924064 ] Tim Allison commented on TIKA-4370: --- Not sure precisely what your proposal is. If you're

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924052#comment-17924052 ] Subbu commented on TIKA-4370: - But TextDetector is in core and TXTParser comes later in parser

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924032#comment-17924032 ] Subbu edited comment on TIKA-4370 at 2/5/25 12:05 PM: -- [~tallison]  I

[jira] [Comment Edited] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17923606#comment-17923606 ] Subbu edited comment on TIKA-4370 at 2/5/25 11:25 AM: -- _Unless I misu

[jira] [Commented] (TIKA-4370) SJIS Encoded Files Can't be Detected

2025-02-05 Thread Subbu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17924032#comment-17924032 ] Subbu commented on TIKA-4370: - [~tallison]  I reviewed this further and see that in UTF8 files

Re: [VOTE] Release Apache Tika 2.9.3 Candidate #1

2025-02-05 Thread TvT
+1 Am Mo., 3. Feb. 2025 um 15:48 Uhr schrieb Nicholas DiPiazza < nicholas.dipia...@gmail.com>: > +1 > > On Mon, Feb 3, 2025, 7:56 AM Tilman Hausherr > wrote: > > > +1 > > > > builds on Windows 10, oracle jdk8 > > > > Tilman > > > > On 03.02.2025 13:43, Tim Allison wrote: > > > A candidate for th