Re: piping input to an external script

Dave Angel Tue, 12 May 2009 00:25:58 -0700

Tim Arnold wrote:

Hi, I have some html files that I want to validate by using an externalscript 'validate'. The html files need a doctype header attached beforevalidation. The files are in utf8 encoding. My code:
---------------
import os,sys
import codecs,subprocess
HEADER = '<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">'
filename  = 'mytest.html'
fd = codecs.open(filename,'rb',encoding='utf8')
s = HEADER + fd.read()
fd.close()

p = subprocess.Popen(['validate'],
                    stdin=subprocess.PIPE,
                    stdout=subprocess.PIPE,
                    stderr=subprocess.STDOUT)
validate = p.communicate(unicode(s,encoding='utf8'))
print validate
---------------

I get lots of lines like this:
Error at line 1, character 66:\tillegal character number 0
etc etc.
But I can give the command in a terminal 'cat mytest.html | validate' andget reasonable output. My subprocess code must be wrong, but I could usesome help to see what the problem is.
python2.5.1, freebsd6
thanks,
--Tim

The usual rule in debugging: split the problem into two parts, and testeach one separately, starting with the one you think most likely to bethe culprit

In this case the obvious place to split is with the data you're passingto the communicate call. I expect it's already wrong, long before youhand it to the subprocess. So write it to a file instead, and inspectit with a binary file viewer. And of course test it manually with yourvalidate program. Is validate really expecting a Unicode stream in stdin ?



--
http://mail.python.org/mailman/listinfo/python-list

Re: piping input to an external script

Reply via email to