[ 
https://issues.apache.org/jira/browse/TIKA-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated TIKA-1014:
--------------------------------

    Attachment: TIKA-1014-CustomMimeTypesReader.patch

updated patch with an error handling example.

In this example, the custom reader reads two files that have conflicting glob 
patters.  The custom handler just logs the warning without failing:

{code:java}
@Override
    protected void handleError(MimeType type, String err, Exception ex, String 
qName, Attributes attributes) throws SAXException {
      if(ERROR_BAD_GLOB.equals(err)) {
        ignorePatterns.add( type.toString() + ">>" + 
attributes.getValue(PATTERN_ATTR));
      }
      else {
        throw new SAXException(ex);
      }
    }
{code}


                
> Allow custom MimeTypesReader
> ----------------------------
>
>                 Key: TIKA-1014
>                 URL: https://issues.apache.org/jira/browse/TIKA-1014
>             Project: Tika
>          Issue Type: New Feature
>          Components: mime
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: TIKA-1014-CustomErrorHandling.patch, 
> TIKA-1014-CustomMimeTypesReader.patch, TIKA-1014-CustomMimeTypesReader.patch, 
> TIKA-1014-CustomMimeTypesReader.patch
>
>
> The current MimeTypesReader is package protected with private fields.  It 
> would be great to allow subclassing this Reader to support custom behavior.  
> In particular:
>  - error handling
>  - custom XML tags

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to