[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13889317#comment-13889317
]
Prashanth Ramaswamy commented on TIKA-245:
--
Nick, Thanks for your response. Unfort
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888787#comment-13888787
]
Nick Burch commented on TIKA-245:
-
Prashanth - you might be best off opening a new bug for t
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13888778#comment-13888778
]
Prashanth Ramaswamy commented on TIKA-245:
--
Hi, I still get the Array index excepti
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13857520#comment-13857520
]
Jukka Zitting commented on TIKA-245:
bq. tika is not able to extract contents from chm
Tika chm support has its limitations, can you provide such file(s) for
further investigation ?
BR,
Oleg
On Wed, Mar 6, 2013 at 1:10 AM, Tejas Patil (JIRA) wrote:
>
> [
> https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&foc
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13594074#comment-13594074
]
Tejas Patil commented on TIKA-245:
--
I am working on NUTCH-1454 and I am observing that tika
Hi Tran Nam Quang,
Currently our CHM extractor skips all entities that are not HTML.
It would be great if you could write a list of desired entities to be
extracted. In addition, if you can, please attach the CHM files you're
working with.
BR,
Oleg
On Sat, Oct 22, 2011 at 8:08 AM, Tran Nam Quan
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133260#comment-13133260
]
Tran Nam Quang commented on TIKA-245:
-
@ Oleg
I tested the CHM parser from Tika 0.10 on
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13046247#comment-13046247
]
Mattmann, Chris A (388J) commented on TIKA-245:
---
Awesome!
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13046231#comment-13046231
]
Oleg Tikhonov commented on TIKA-245:
Committed revision 1133556.
> Support of CHM Forma
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13045506#comment-13045506
]
Jukka Zitting commented on TIKA-245:
Nice!
Next time, Oleg, feel free to commit your ch
Hey Jukka,
Thanks for the motivation. I put my money where my mouth was :-)
Oleg, your patch rox. That's all I had to say. My improvement was simply to
commit it to the Tika sources. Feel free to mod/add/whatever on it after that,
per Jukka's comments.
I am going to make one more update, just
Hey Jukka,
On Jun 7, 2011, at 6:55 AM, Jukka Zitting wrote:
> Hi,
>
> On Tue, Jun 7, 2011 at 3:52 PM, Mattmann, Chris A (388J)
> wrote:
>> Please revert r1132997, and then just modify your patch to make sure that
>> your java classes and files fit into the appropriate Tika source code area.
>>
Hi,
On Tue, Jun 7, 2011 at 3:52 PM, Mattmann, Chris A (388J)
wrote:
> Please revert r1132997, and then just modify your patch to make sure that
> your java classes and files fit into the appropriate Tika source code area.
> Then please attach a new patch real quick so I (or some other committer)
Hi Oleg,
On Jun 7, 2011, at 6:28 AM, Oleg Tikhonov wrote:
> Hi Chris,
>
> I've applied the patch to the
> tika-parsers/src/main/java/org/apache/tika/parser/chm, also added 3 chm
> files to the tika-parsers\src\test\resources\test-documents and the tests.
Thanks sorry I think I confused you with
Hi Chris,
I've applied the patch to the
tika-parsers/src/main/java/org/apache/tika/parser/chm, also added 3 chm
files to the tika-parsers\src\test\resources\test-documents and the tests.
BR,
Oleg
On Sun, Jun 5, 2011 at 1:32 AM, Chris A. Mattmann (JIRA) wrote:
>
>[
> https://issues.apache.or
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13044403#comment-13044403
]
Chris A. Mattmann commented on TIKA-245:
Hi Oleg,
Looking over this patch, I have a
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13014247#comment-13014247
]
Chris A. Mattmann commented on TIKA-245:
Guys, we'd prefer to not use LGPL (and ther
Hello Tran Nam Quang,
It uses CHMLIB C library, i.e. JNI. From my previous experience, it works
for limited amount of os'es. It does not work in Solaris or AIX.
The really good library with limitations mentioned above is
http://sevenzipjbind.sourceforge.net/ and also LGPL (I would say, the best
on
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13014110#comment-13014110
]
Tran Nam Quang commented on TIKA-245:
-
Hello guys,
Here's another CHM library for Java,
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008802#comment-13008802
]
Nick Burch commented on TIKA-245:
-
Eventually we might need to separate out some of the file
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008797#comment-13008797
]
Oleg Tikhonov commented on TIKA-245:
I've implemented chm extractor, based on the same e
I've written to the sourceforge.org's support having asked to take over the
project or help me to contact the owner. Here is their response:
"Comment:
Hello,
We have attempted contact with the current project administrator. We will
let you know once they approve or reject this takeover request
[
https://issues.apache.org/jira/browse/TIKA-245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894574#action_12894574
]
Nick Burch commented on TIKA-245:
-
JCHM seems to be under the CDDL license, so we're fine to
24 matches
Mail list logo