Build failed in Jenkins: Tika-trunk #837

2012-04-26 Thread Apache Jenkins Server
See Changes: [mattmann] - apply patch from TIKA-901: Provide version number in tika-server contributed by Ingo Renner -- [...truncated 2255 lines...] [WARNING] We have a duplicate org/apache/tika/fork

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Antoni, Precisely! :) That would be awesome huh. And, my goal there too is to turn Any23 parsers into Tika parsers too as I think they could be one and the same (with an RDF or XMP or RSS ContentHandler transforming the Tika intermediate SAX output the same). Cheers, Chris On Apr 26, 2012, a

[jira] [Resolved] (TIKA-901) Provide version number in tika-server

2012-04-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-901. Resolution: Fixed Committed in r1331147, thanks Ingo YOU ROCK! > Provide ver

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Antoni Mylka
2012/04/26 Mattmann, Chris A (388J) napisał/wrote: Hi Guys, One comment RE: the below too -- this is precisely where I see Any23 coming into play and why there is a strong relationship between it and Tika: http://incubator.apache.org/any23/ I'm the current Champion for the project and the Tika

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Antoni Mylka
2012/04/25 Joerg Ehrlich napisał/wrote: Hi, I have put a proposal of a roadmap for the metadata features in Tika on the wiki: http://wiki.apache.org/tika/MetadataRoadmap The proposal is based on a discussion around this topic I have had with Jukka. Please review and feel free to edit the wiki

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Guys, One comment RE: the below too -- this is precisely where I see Any23 coming into play and why there is a strong relationship between it and Tika: http://incubator.apache.org/any23/ I'm the current Champion for the project and the Tika PMC is sponsoring the podling. Please check it out

RE: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Joerg Ehrlich
Yes, that is exactly my biggest concern. Another nice example is regional metadata like from a face detection (taken from MWG guidance V2): Face John Doe ... And I also definitely meant to keep the current

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Ray Gauss II
I think besides the namespaces, one of the issues Jörg is trying to tackle is the structured metadata and the extra time and effort referred to is dealing with serialization of structured data to and from a hashmap. For example I may have metadata similar to: Contact1 |-- First Name |-- Last Na

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Jörg, Thanks for your email, comments below: On Apr 26, 2012, at 3:35 AM, Joerg Ehrlich wrote: > Hi Chris, > > Those are all valid points and I agree that you could do everything with a > Hashmap. > Having the parsers fill the Metadata class and its Hashmap with all needed > information w

RE: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Joerg Ehrlich
Hi Chris, Those are all valid points and I agree that you could do everything with a Hashmap. Having the parsers fill the Metadata class and its Hashmap with all needed information which is then consumed by an XMP component sitting on top of Tika-Core is definitely an interesting solution whic