Thank you!

                        -Kerry

From: Sean Kalynuk <[email protected]>
Sent: Wednesday, May 5, 2021 3:42 PM
To: Bouchard, Kerry <[email protected]>; DSpace Technical Support 
<[email protected]>
Subject: Re: [dspace-tech] How do I create an exclusion list for filter-media?

Hi Kerry,

There is a Skip mode option (-s) for the filter-media command:

https://wiki.lyrasis.org/display/DSDOC6x/Mediafilters+for+Transforming+DSpace+Content#MediafiltersforTransformingDSpaceContent-Executing(viaCommandLine)<https://urldefense.com/v3/__https:/wiki.lyrasis.org/display/DSDOC6x/Mediafilters*for*Transforming*DSpace*Content*MediafiltersforTransformingDSpaceContent-Executing(viaCommandLine)__;KysrKyM!!O2_lDA!gitI-7z2vFQJ3h4F5yMBnMSjAF6grxW0PBQ85vTK9Gi2O8XrjhpjRJM2Ja-4FN8R9g$>

--
Sean

From: [email protected]<mailto:[email protected]> 
<[email protected]<mailto:[email protected]>> on behalf 
of Kerry Bouchard <[email protected]<mailto:[email protected]>>
Date: Wednesday, May 5, 2021 at 3:20 PM
To: DSpace Technical Support 
<[email protected]<mailto:[email protected]>>
Subject: [dspace-tech] How do I create an exclusion list for filter-media?
Caution: This message was sent from outside the University of Manitoba.

We are running into the problem described here: 
http://dspace.2283337.n4.nabble.com/Filter-media-on-PDFs-exported-from-Outlook-causes-a-TikaException-error-and-prevents-Items-from-inde-td4683489.html
 , where the *.pdf.txt files output by the PDF Text Extractor media filter for 
a couple of PDFs in our repository causes indexing to fail for not just the PDF 
full text, but all the associated metadata. (In our case, the PDFs were not 
output from Microsoft Outlook mail folders, but I'm seeing the same 
"org.apache.tika.exception.TikaException: Failed to parse an email message" in 
the dspace log file.)

The posting at the URL above refers to a work-around by creating an exclusion 
list for filter-media. But I can find any documentation on how to create an 
exclusion list. Can someone point me to that?

Thanks, Kerry
--
All messages to this mailing list should adhere to the Code of Conduct: 
https://duraspace.org/about/policies/code-of-conduct/<https://urldefense.com/v3/__https:/duraspace.org/about/policies/code-of-conduct/__;!!O2_lDA!gitI-7z2vFQJ3h4F5yMBnMSjAF6grxW0PBQ85vTK9Gi2O8XrjhpjRJM2Ja-aSDnk1g$>
---
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to 
[email protected]<mailto:[email protected]>.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/85e9b754-31d4-4558-8bde-071facdf9d0bn%40googlegroups.com<https://urldefense.com/v3/__https:/groups.google.com/d/msgid/dspace-tech/85e9b754-31d4-4558-8bde-071facdf9d0bn*40googlegroups.com?utm_medium=email&utm_source=footer__;JQ!!O2_lDA!gitI-7z2vFQJ3h4F5yMBnMSjAF6grxW0PBQ85vTK9Gi2O8XrjhpjRJM2Ja9XKBYSng$>.

-- 
All messages to this mailing list should adhere to the Code of Conduct: 
https://duraspace.org/about/policies/code-of-conduct/
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/ca7904aadba04262a99a2592446fbf61%40tcu.edu.

Reply via email to