Re: renaming master?

2020-06-17 Thread Ray Gauss II
Hi all, Apologies for not being able to be very involved over the past few years, but still trying to follow along and hoping to get time to contribute in the future. Another option might be ‘stable’? - Ray > On Jun 16, 2020, at 1:31 PM, Tim Allison wrote: > > All, > > As you may have see

Embed and ExifTool Contributions

2011-11-03 Thread Ray Gauss II
estion was to raise JIRA issues for each concept which I'll do shortly. Please let me know if you have any questions or suggestions. Thanks. Ray Gauss II http://rightspro.com

ExifTool Parser Conventions

2012-01-17 Thread Ray Gauss II
The ExifTool parser found here ( https://issues.apache.org/jira/browse/TIKA-774 ) has been refactored a bit and pulled out into a separate project in light of the coming modularity changes. Is there any convention to be followed yet in terms of naming of such a project or where it should live?

Re: Sharing metadata logic between parsers

2012-01-30 Thread Ray Gauss II
I personally like Nick's 3rd idea: Extending the Property class to support a converter. Even in the case described here the Metadata property setters could be modified to something like: public void set(Property property, int value) { if(property.getPropertyType() != Property.PropertyTy

Re: Metadata situation and XMP support in Tika

2012-04-10 Thread Ray Gauss II
rward if you'd like. Ray Gauss II DAM Architect, Alfresco On Apr 5, 2012, at 8:58 AM, Joerg Ehrlich wrote: > Hi everyone, > > I am an engineer in the XMP/Metadata team at Adobe and we would like to > leverage Tika in current projects for metadata extraction (and mimetype >

Re: Metadata situation and XMP support in Tika

2012-04-13 Thread Ray Gauss II
For the IPTC example specifically, all properties are defined using their respective namespaces, but some are defined 'inline' while others are an alias to the referenced standard, i.e. Property KEYWORDS = DublinCore.DC_SUBJECT; If I'm understanding you correctly your proposal is to do that

Re: Metadata situation and XMP support in Tika

2012-04-13 Thread Ray Gauss II
"standard" > properties available in the Metadata class would be a good help for clients > who don't care which "title" or "author" they read. They could just say > "Metadata.title" instead of "DublinCore.title". > > Rega

Re: Metadata situation and XMP support in Tika

2012-04-24 Thread Ray Gauss II
I think the aliasing approach supports both use cases nicely, i.e.: Metatadata.java: ... Property TITLE = DublinCore.DC_TITLE; ... Users then only have to concern themselves with "give me the metadata that best fits the idea of Title, as defined by Tika", and not even have to know about Dubl

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Ray Gauss II
I think besides the namespaces, one of the issues Jörg is trying to tackle is the structured metadata and the extra time and effort referred to is dealing with serialization of structured data to and from a hashmap. For example I may have metadata similar to: Contact1 |-- First Name |-- Last Na

Re: [DISCUSS] Apache Tika 1.2 RC?

2012-05-28 Thread Ray Gauss II
It would be nice to get https://issues.apache.org/jira/browse/TIKA-930 resolved before people start using some of those properties which may be dropped. On May 28, 2012, at 9:19 AM, Michael McCandless wrote: > +1 to release 1.2! > > The more frequent releases the better :) > > Thanks Chris.

Re: Welcome Ray Gauss as a Tika committer/PMC

2012-06-10 Thread Ray Gauss II
Hello everyone, and thank you! I consider it a great honor and privilege to be part of a community with such forethought and ingenuity. I've been contributing to open source projects for some time and I'm now at Alfresco implementing features around rich media, including metadata, where Tika o

Re: JAX-RS overhead in tika-server

2012-07-01 Thread Ray Gauss II
Is there any consensus on TIKA-930 [1]? I don't want to wipe out properties that others feel are critical or include non-ratified standards if that's outside of policy. If there's no objection to what's outlined in that issue I can commit those changes tomorrow morning (GMT -4). Regards, Ray

Re: Can't build javadocs for 1.2 API site docs

2012-07-17 Thread Ray Gauss II
Should be fixed in r1362429. Regards, Ray On Jul 17, 2012, at 3:57 AM, Joerg Ehrlich wrote: > Hi, > > Unfortunately I am currently in a whole week workshop. I will try to have a > look at it as soon as possible. > > Regards > jörg > > -Original Message- > From: Mattmann, Chris A (3

Re: Can't build javadocs for 1.2 API site docs

2012-07-17 Thread Ray Gauss II
Should I merge this to tags/1.2? On Jul 17, 2012, at 5:40 AM, Ray Gauss II wrote: > Should be fixed in r1362429. > > Regards, > > Ray > > > On Jul 17, 2012, at 3:57 AM, Joerg Ehrlich wrote: > >> Hi, >> >> Unfortunately I am currently in a whole

Re: [DISCUSS] Including tika-server WAR in 1.3 artifacts?

2012-07-27 Thread Ray Gauss II
+1 from me. On Jul 20, 2012, at 11:32 AM, "Mattmann, Chris A (388J)" wrote: > Hey Guys, > > Now that we have tika-server, etc., I was thinking of including it like we do > tika-app > as a release artifact in 1.3-on. > > That sound OK? > > Cheers, > Chris > > +++

Re: Build failed in Jenkins: Tika-trunk #906

2012-08-01 Thread Ray Gauss II
Anyone have ideas on this one? Is it really something I did? On Aug 1, 2012, at 3:17 PM, Apache Jenkins Server wrote: > See > > Changes: > > [rgauss] TIKA-965: Text Detection Fails on Mostly Non-ASCII UTF-8 Files > - Added looksLikeU

Re: Build failed in Jenkins: Tika-trunk #934

2012-10-29 Thread Ray Gauss II
DecimalFormatSymbol.getInstance(Locale locale) definitely is Java 6 only which is why the first commit broke the build. I attempted to fix that in r1403221 with a call to 'new DecimalFormatSymbols(Locale)' which looks like it was around in 5 [1]. Regards, Ray [1] http://docs.oracle.com/jav

Re: [DISCUSS] Release Candidate for 1.3?

2013-01-08 Thread Ray Gauss II
The code for TIKA-775 [1] is on trunk but it was re-opened with some concerns, some of which were addressed and some of which are still open discussions, though I think minor enough to create separate issues if need be and resolve TIKA-775 as fixed. [1] https://issues.apache.org/jira/browse/TIK

Re: [VOTE] Apache Tika 1.3 Release Candidate #1

2013-01-20 Thread Ray Gauss II
Built on OS X, updated tika-exiftool to depend on 1.3 which compiled and passed tests. +1 for release! Cheers, Ray On Jan 18, 2013, at 11:30 PM, Dave Meikle wrote: > Hi Guys, > > A candidate for the Tika 1.3 release is available at: > >http://people.apache.org/~dmeikle/apache-tika-1.3

Re: Build failed in Jenkins: Tika-trunk #994

2013-05-01 Thread Ray Gauss II
Looks like a possible build server problem. Does anyone have access to manually trigger another build? Regards, Ray On May 1, 2013, at 5:01 PM, Apache Jenkins Server wrote: > See

Re: Build failed in Jenkins: Tika-trunk #994

2013-05-01 Thread Ray Gauss II
> Subject: Jenkins build is back to normal : Tika-trunk #995 Yay, thanks! On May 1, 2013, at 5:24 PM, Michael McCandless wrote: > I just kicked off another build ... (it's queued). > > Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, May

Re: MP4Parser triggers .... something betwwen an exception and endDocument() from the Contenthandlers point of view?

2013-06-07 Thread Ray Gauss II
I think the Parser interface Javadoc would make sense as a place to document, but I don't know if there is an existing policy. We'll certainly need to consider things like DelegatingParsers which may be using other parsers to do portions of the work. Not the principle comment you were looking f

Re: RFC822Parser build error on gump

2013-06-28 Thread Ray Gauss II
I know very little about gump, but looking at the log the build seems to have skipped the mime4j artifacts altogether. On Jun 25, 2013, at 6:25 PM, Nick Burch wrote: > Hi All > > Anyone have any idea about this compiler error on the tika parsers project as > hit by gump? > http://vmgump.apac

Tika Core and Parsers Test Artifacts

2013-07-18 Thread Ray Gauss II
I don't recall if we've discussed this already (I did do a brief search and didn't see anything). Is there any opposition to adding test-jar Maven artifacts for tika-core and tika-parsers? Seems like it would be good to allow others to extend from tests there if need be.

Re: Tika Core and Parsers Test Artifacts

2013-07-18 Thread Ray Gauss II
3, at 9:19 AM, Ken Krugler wrote: > Hi Ray, > > On Jul 18, 2013, at 5:14am, Ray Gauss II wrote: > >> I don't recall if we've discussed this already (I did do a brief search and >> didn't see anything). >> >> Is there any opposition to ad

Re: Tika Core and Parsers Test Artifacts

2013-07-22 Thread Ray Gauss II
neral I'll create a JIRA issue where we can discuss the details. Regards, Ray On Jul 21, 2013, at 3:25 PM, Ken Krugler wrote: > Hi Ray, > > On Jul 18, 2013, at 6:37am, Ray Gauss II wrote: > >> Hi Ken, >> >> They recommend test-jar instead of classifier no

Re: Extract thumbnail from openxml office files

2014-01-08 Thread Ray Gauss II
Hi Hong-Thai, It’s certainly worth investigating.  Several other formats can have embedded thumbnails as well so we could implement a generic thumbnail property. We could probably store as something like a Base64 encoded string, but we’d likely want to place limits on the size and may need a th

Re: [DISCUSS] Centralizing JSON handling of Metadata

2014-05-28 Thread Ray Gauss II
Hi Tim, 1) Sounds good to me. 2) I do think we want core as lean as possible, so my vote would be for a separate project/module, similar to what was done with tika-xmp.  Perhaps something like tika-serialization-json to indicate other formats may follow in the same precedence? 3) Similar to a

RE: [DISCUSS] Centralizing JSON handling of Metadata

2014-05-28 Thread Ray Gauss II
hat would be blckbelt), so I'm happy to go with either. > > A new compilation unit makes sense. I'm wondering if we want to be that > specific? tika-serialization? > Or, maybe just tika-utils? > > Package name looks good to me. > > Thanks, again! > &g

[jira] [Commented] (TIKA-93) OCR support

2014-08-19 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14102175#comment-14102175 ] Ray Gauss II commented on TIKA-93: -- Can you create a config object and pass that in

[jira] [Commented] (TIKA-93) OCR support

2014-08-19 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14102193#comment-14102193 ] Ray Gauss II commented on TIKA-93: -- Apologies, jumped in late and only glanced at

[jira] [Commented] (TIKA-93) OCR support

2014-09-15 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134822#comment-14134822 ] Ray Gauss II commented on TIKA-93: -- You could use [{{org.junit.Assume}}|

[jira] [Commented] (TIKA-1510) FFMpeg installed but not parsing video files

2015-01-11 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273049#comment-14273049 ] Ray Gauss II commented on TIKA-1510: In that project there

[jira] [Commented] (TIKA-1510) FFMpeg installed but not parsing video files

2015-01-12 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273520#comment-14273520 ] Ray Gauss II commented on TIKA-1510: Yes. The only reason I haven't mysel

[jira] [Commented] (TIKA-634) Command Line Parser for Metadata Extraction

2015-03-01 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342547#comment-14342547 ] Ray Gauss II commented on TIKA-634: --- Also see the [tika-ffmpeg project|https://github

[jira] [Commented] (TIKA-1594) Webp parsing support

2015-04-07 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14484463#comment-14484463 ] Ray Gauss II commented on TIKA-1594: I'd recommend that for now we t

[jira] [Commented] (TIKA-1607) Introduce new HashMap data structure for persitsence of Tika Metadata

2015-04-21 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14505054#comment-14505054 ] Ray Gauss II commented on TIKA-1607: We've had a few discussions on s

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-08-06 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14660441#comment-14660441 ] Ray Gauss II commented on TIKA-1607: To clarify, the work mentioned above that use

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-08-19 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14703924#comment-14703924 ] Ray Gauss II commented on TIKA-1607: I've put together the start of the DOM

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-08-19 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14704108#comment-14704108 ] Ray Gauss II commented on TIKA-1607: [~chrismattmann], I did. It seemed more sim

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-08-20 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14704880#comment-14704880 ] Ray Gauss II commented on TIKA-1607: I did see that, but I was after full

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-08-21 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706706#comment-14706706 ] Ray Gauss II commented on TIKA-1607: Yes, by shoehorn I meant that the inde

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2015-09-15 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14746719#comment-14746719 ] Ray Gauss II commented on TIKA-1607: Hi [~talli...@mitre.org], apologies for the d

[jira] [Commented] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-03 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130386#comment-15130386 ] Ray Gauss II commented on TIKA-1824: bq. Thank you, Bob Paulin! Again, thi

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-02-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15149231#comment-15149231 ] Ray Gauss II commented on TIKA-1607: Are we opening a can of worms by encouraging

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-02-19 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154205#comment-15154205 ] Ray Gauss II commented on TIKA-1607: In my experience people gravitate towards &#

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-02-25 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15167135#comment-15167135 ] Ray Gauss II commented on TIKA-1607: I know there can be multiple XMP packets

[jira] [Commented] (TIKA-1894) Add XMPMM metadata extraction to JempboxExtractor

2016-03-14 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193622#comment-15193622 ] Ray Gauss II commented on TIKA-1894: The {{tika-xmp}} project deals with converti

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-03-14 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193845#comment-15193845 ] Ray Gauss II commented on TIKA-1607: Have we already considered treating the

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-03-15 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15195326#comment-15195326 ] Ray Gauss II commented on TIKA-1607: Sorry, I meant {{EmbeddedDocumentExtra

[jira] [Comment Edited] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-03-15 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193845#comment-15193845 ] Ray Gauss II edited comment on TIKA-1607 at 3/15/16 1:5

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-03-15 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15196030#comment-15196030 ] Ray Gauss II commented on TIKA-1607: bq. It might be more easily configurable to

[jira] [Created] (TIKA-1906) ExternalParser No Longer Supports Commands in Array Format

2016-03-21 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-1906: -- Summary: ExternalParser No Longer Supports Commands in Array Format Key: TIKA-1906 URL: https://issues.apache.org/jira/browse/TIKA-1906 Project: Tika Issue Type

[jira] [Commented] (TIKA-1906) ExternalParser No Longer Supports Commands in Array Format

2016-03-22 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15206138#comment-15206138 ] Ray Gauss II commented on TIKA-1906: bq. agreed, sorry must have missed that

[jira] [Comment Edited] (TIKA-1906) ExternalParser No Longer Supports Commands in Array Format

2016-03-22 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15206138#comment-15206138 ] Ray Gauss II edited comment on TIKA-1906 at 3/22/16 2:3

[jira] [Resolved] (TIKA-1906) ExternalParser No Longer Supports Commands in Array Format

2016-03-23 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-1906. Resolution: Fixed > ExternalParser No Longer Supports Commands in Array For

[jira] [Updated] (TIKA-1906) ExternalParser No Longer Supports Commands in Array Format

2016-03-23 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-1906: --- Fix Version/s: 1.13 2.0 > ExternalParser No Longer Supports Commands in Ar

[jira] [Commented] (TIKA-774) ExifTool Parser

2016-03-23 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209162#comment-15209162 ] Ray Gauss II commented on TIKA-774: --- bq. we should add a static check for whe

[jira] [Commented] (TIKA-2056) Installing exiftool causes ForkParserIntegration test errors

2016-08-25 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15436705#comment-15436705 ] Ray Gauss II commented on TIKA-2056: My guess is that when Exiftool is availabl

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-05-09 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271395#comment-13271395 ] Ray Gauss II commented on TIKA-915: --- Unfortunately this looks like an issue with the

[jira] [Updated] (TIKA-915) Image geodata being rounded to integers

2012-05-09 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-915: -- Attachment: problem_jpeg_geo_test.diff testJPEG_GEO_2.jpg Attached is a patch which

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-05-14 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13274632#comment-13274632 ] Ray Gauss II commented on TIKA-915: --- I've investigated the latest release, 2.5.

[jira] [Created] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-925: - Summary: Remove DublinCore From Metadata and Deprecate String Properties Key: TIKA-925 URL: https://issues.apache.org/jira/browse/TIKA-925 Project: Tika Issue

[jira] [Updated] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-925: -- Attachment: tika-dublincore-changes-parsers.diff tika-dublincore-changes-core.diff

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Affects Version/s: 1.1 > Data Typed Metadata.set(...) Value Methods Should Call > Metada

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Attachment: tika-metadata-set-core.diff > Data Typed Metadata.set(...) Value Methods Should C

[jira] [Created] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-926: - Summary: Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...) Key: TIKA-926 URL: https://issues.apache.org/jira/browse/TIKA-926 Project: Tika

[jira] [Created] (TIKA-927) Composite Properties

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-927: - Summary: Composite Properties Key: TIKA-927 URL: https://issues.apache.org/jira/browse/TIKA-927 Project: Tika Issue Type: Improvement Components

[jira] [Updated] (TIKA-927) Composite Properties

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-927: -- Attachment: tika-composite-properties-core.diff Apply to tika-core > Compos

[jira] [Created] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-928: - Summary: Separation of Tika Core Properties From Metadata Processing Key: TIKA-928 URL: https://issues.apache.org/jira/browse/TIKA-928 Project: Tika Issue Type

[jira] [Updated] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-928: -- Attachment: tika-core-properties.diff Apply to tika-core. > Separation of Tika C

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Attachment: tika-add-by-property.diff Changes to allow for adding by Property and setting an array of

[jira] [Updated] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-17 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-928: -- Attachment: tika-core-properties-metadata-refactor-parsers.diff tika-core-properties

[jira] [Updated] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-05-17 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-842: -- Attachment: tika-iptc-refactoring-core.diff Adds Photoshop and XMPRights interfaces and updates IPTC to

[jira] [Updated] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-05-17 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-842: -- Attachment: tika-iptc-property-rename.diff Minor rename of properties. Apply to core

[jira] [Created] (TIKA-930) Consolidation of Some Tika Core Properties

2012-05-21 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-930: - Summary: Consolidation of Some Tika Core Properties Key: TIKA-930 URL: https://issues.apache.org/jira/browse/TIKA-930 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-05-23 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13281910#comment-13281910 ] Ray Gauss II commented on TIKA-915: --- Still no response from Mr. Noakes but he has

[jira] [Commented] (TIKA-930) Consolidation of Some Tika Core Properties

2012-05-29 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284794#comment-13284794 ] Ray Gauss II commented on TIKA-930: --- I'm not sure what our policy is on using

[jira] [Commented] (TIKA-930) Consolidation of Some Tika Core Properties

2012-05-29 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284968#comment-13284968 ] Ray Gauss II commented on TIKA-930: --- Creator: "The DublinCore creator i

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-05-31 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13286565#comment-13286565 ] Ray Gauss II commented on TIKA-915: --- Great news. Mr. Noakes is in the process of get

[jira] [Created] (TIKA-947) AbstractMetadataHandler addMetadata Does not Check Property.isMultiValuePermitted

2012-07-02 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-947: - Summary: AbstractMetadataHandler addMetadata Does not Check Property.isMultiValuePermitted Key: TIKA-947 URL: https://issues.apache.org/jira/browse/TIKA-947 Project: Tika

[jira] [Resolved] (TIKA-947) AbstractMetadataHandler addMetadata Does not Check Property.isMultiValuePermitted

2012-07-02 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-947. --- Resolution: Fixed Fixed in r1356406 > AbstractMetadataHandler addMetadata Does

[jira] [Resolved] (TIKA-930) Consolidation of Some Tika Core Properties

2012-07-02 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-930. --- Resolution: Fixed Fix Version/s: 1.2 Fixed in r1356560. This ended up being a fairly large

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-07-20 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13419568#comment-13419568 ] Ray Gauss II commented on TIKA-915: --- I ended up pushing version 2.6.2 of the Drew No

[jira] [Commented] (TIKA-915) Image geodata being rounded to integers

2012-07-29 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13424643#comment-13424643 ] Ray Gauss II commented on TIKA-915: --- Resolved in r1366967 >

[jira] [Commented] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support

2012-07-29 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13424642#comment-13424642 ] Ray Gauss II commented on TIKA-811: --- Thanks for the patch Emmanuel, I wish I had see

[jira] [Resolved] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-811. --- Resolution: Fixed Fix Version/s: 1.3 Assignee: Ray Gauss II Resolved by r1366967

[jira] [Resolved] (TIKA-915) Image geodata being rounded to integers

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-915. --- Resolution: Fixed Fix Version/s: 1.3 Assignee: Ray Gauss II Resolved by r1366967

[jira] [Commented] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13424828#comment-13424828 ] Ray Gauss II commented on TIKA-906: --- AutoPageNumberUtilsTest,java is missing a lic

[jira] [Reopened] (TIKA-915) Image geodata being rounded to integers

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II reopened TIKA-915: --- Reopening this as the metadata-extractor library is adding false precision and we should be rounding its

[jira] [Resolved] (TIKA-915) Image geodata being rounded to integers

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-915. --- Resolution: Fixed Moved the decimal formatting from the JpegParserTest to the GeotagHandler in

[jira] [Created] (TIKA-962) Backwards Compatibility for Metadata.LAST_AUTHOR is Broken

2012-07-30 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-962: - Summary: Backwards Compatibility for Metadata.LAST_AUTHOR is Broken Key: TIKA-962 URL: https://issues.apache.org/jira/browse/TIKA-962 Project: Tika Issue Type

[jira] [Commented] (TIKA-962) Backwards Compatibility for Metadata.LAST_AUTHOR is Broken

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425137#comment-13425137 ] Ray Gauss II commented on TIKA-962: --- This is fixed in r1367225 but I'll ho

[jira] [Created] (TIKA-963) Backwards Compatibility for Metadata.DATE is Incorrect

2012-07-30 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-963: - Summary: Backwards Compatibility for Metadata.DATE is Incorrect Key: TIKA-963 URL: https://issues.apache.org/jira/browse/TIKA-963 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-963) Backwards Compatibility for Metadata.DATE is Incorrect

2012-07-30 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425157#comment-13425157 ] Ray Gauss II commented on TIKA-963: --- This is fixed in r1367249. We may want to inc

[jira] [Created] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-07-31 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-965: - Summary: Text Detection Fails on Mostly Non-ASCII UTF-8 Files Key: TIKA-965 URL: https://issues.apache.org/jira/browse/TIKA-965 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-07-31 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425790#comment-13425790 ] Ray Gauss II commented on TIKA-965: --- I do have a test file and it's more than a

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-07-31 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425981#comment-13425981 ] Ray Gauss II commented on TIKA-965: --- That's the solution I was looking into and

[jira] [Comment Edited] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-07-31 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425981#comment-13425981 ] Ray Gauss II edited comment on TIKA-965 at 7/31/12 6:1

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426525#comment-13426525 ] Ray Gauss II commented on TIKA-965: --- Are we likely to run into similar issues with o

  1   2   3   >