It seems as if the xml parser ClamAV is has some parsing errors in regard
to this document variant. You could submit a bug report at
bugzilla.clamav.net; attaching a sample would also help.

-Kevin

On Fri, Apr 1, 2016 at 6:30 PM, David Shaw <ds...@jabberwocky.com> wrote:

> Hello,
>
> I am using ClamAV 0.99 on CentOS 7 (so clamav-0.99-2.el7.x86_64.rpm).  I
> occasionally see MS Office files (in the new, XML format) that cannot be
> scanned, with this error:
>
> clamd[7726]: msxml.xml:1: parser error : Document labelled UTF-16 but has
> UTF-8 content
> clamd[7726]: <?xml version="1.0" encoding="utf-16"?><?mso-application
> progid="Excel.Sheet"?><
> clamd[7726]: ^
> clamd[7726]: fd[14]: Can't parse data ERROR
>
> Any suggestions where to go from here?  The error itself seems fairly
> straightforward, but these are standard MS Office files, generated by MS
> Office, so it's not clear what, if anything, I can change on that side.
>
> David
>
> _______________________________________________
> Help us build a comprehensive ClamAV guide:
> https://github.com/vrtadmin/clamav-faq
>
> http://www.clamav.net/contact.html#ml
>
_______________________________________________
Help us build a comprehensive ClamAV guide:
https://github.com/vrtadmin/clamav-faq

http://www.clamav.net/contact.html#ml

Reply via email to