It seems as if the xml parser ClamAV is has some parsing errors in regard to this document variant. You could submit a bug report at bugzilla.clamav.net; attaching a sample would also help.
-Kevin On Fri, Apr 1, 2016 at 6:30 PM, David Shaw <ds...@jabberwocky.com> wrote: > Hello, > > I am using ClamAV 0.99 on CentOS 7 (so clamav-0.99-2.el7.x86_64.rpm). I > occasionally see MS Office files (in the new, XML format) that cannot be > scanned, with this error: > > clamd[7726]: msxml.xml:1: parser error : Document labelled UTF-16 but has > UTF-8 content > clamd[7726]: <?xml version="1.0" encoding="utf-16"?><?mso-application > progid="Excel.Sheet"?>< > clamd[7726]: ^ > clamd[7726]: fd[14]: Can't parse data ERROR > > Any suggestions where to go from here? The error itself seems fairly > straightforward, but these are standard MS Office files, generated by MS > Office, so it's not clear what, if anything, I can change on that side. > > David > > _______________________________________________ > Help us build a comprehensive ClamAV guide: > https://github.com/vrtadmin/clamav-faq > > http://www.clamav.net/contact.html#ml > _______________________________________________ Help us build a comprehensive ClamAV guide: https://github.com/vrtadmin/clamav-faq http://www.clamav.net/contact.html#ml