Re: Help with pipes, buffering and pseudoterminals

Cameron Simpson Sun, 05 Apr 2015 17:14:42 -0700

On 05Apr2015 12:20, Daniel Ellis <ellis...@gmail.com> wrote:

I have a small little tool I'd like to make.  It essentially takes piped input, 
modifies the text in some way, and immediately prints the output.  The problem 
I'm having is that any output I pipe to the program seems to be buffered, 
removing the desired effect.

That depends on the upstream program; it does the buffering. The pipe itselfpresents received data downstream immediately.

However, as you've seen, almost every program buffers its standard output ifthe output is not a tty; this is automatic in the stdio C library and resultsin more fficient use of I/O.

From what I understand, I need to somehow have the input be retrieved via a 
pseudoterminal.

This is a rather gross hack, though sometimes all you can do. While someprograms have an option to force unbuffered output, most do not. Attachingtheir output to a pty is one way to encourage them to at least line buffertheir output.

However, you should bear in mind that the reason that programs line buffer to aterminal is that they presume they are in an interactive situation with aperson watching. The program _may_ act differently in other ways as well, suchas asking question it might not otherwise ask in "batch" mode (where it mightcautiously not ask and presume "no").

Also, output sent through a pty is subject to the line discipline in theterminal; temrinals are funny things with much historical behaviour. At theleast you pobably want your pty in "raw" mode to avoid all sorts of stuff thatcan be done to your data.

The problem that I'm having is that most examples on the internet seem to 
assume I would like to launch a program in a forked pty process, which doesn't 
really fit my use case.


Indeed not, but not to worry. You don't need to fork.

I've tried a number of things, but I seem to be unable to get even a basic 
understanding of how to use the pty module.

Have you every used a pty from C? Do you know how ptys work? (master side,slave side, etc).

Here's a piece of code I whipped up just to try to get a feel for what is going 
on when I use pty.fork, but it doesn't seem to do what I think it should:

   import pty
   import os
   import sys

   pid, fd = pty.fork()
   print pid, fd
   sys.stdout.flush()
   os.read(fd, 1024)

This only seems to print from the parent process.


The documentation for pty.fork says:

 Return value is (pid, fd). Note that the child gets pid 0, and the fd is 
invalid.

So the child cannot used "fd". It further says that the child has its stdin andstdout attached to the pty, and that the pty is the child's controllingterminal (this means it is affected by things like "typing" ^C at the pty,etc).

I read that I need to do the os.read call for the fork to happen. I've also

tried printing *after* the os.read call.

Don't try to adapt fork-based tutorials to your needs. Understand ptys directlyfirst.

I realize this does very little to solve my overall goal, but I figure 
understanding what is going on is probably a worthwhile first step.

What you probably want to use is pty.openpty() instead. No fork. You will getback file descriptors for the master and slave sides of the pty. Then you canuse these with the subprocess module to connect your input program. Or,guessing from your opening sentence, you can write a wrapper script whose wholepurpose is to run a program on a pty.

Regarding terminology: a pseudoterminal (pty) is a device that looks like atraditional serial terminal. All terminal emulators like xterm use one, and sodo other programs presenting a terminal session such as the sshd processhandling an interactive remote login.

When you call pty.openpty() you are handed two file descriptors: one for themaster side of the pty and one for the slave side. The slave side is the sidethat looks like a terminal, and is what a typical use would connect a childprocess to. The master side is the other side of the pty. When a program writesto the "slave" side, the output is available for read on the master side, muchlike a pipe. When a program writes to the master side, the output is availablefor read on the slave side, _as_ _if_ _typed_ at the terminal.

A pty is not necessarily going to solve your problem unless you can get yourinput via the pty. From the sounds of it you're in this situation:


 command-generating-output | your-program

such that your input is attached to a pipe, and because"command-generating-output" is attached to a pipe it is block buffering itsoutput, hence your problem.


You can't undo that situation after the fact.

To solve your problem via a pty you need to contrive to set up"command-generating-output" already attached to a pty. One way to do that isfor "your-program" to open a pty and itself invoke "command-generating-output"with its output via the pty, which is why so many tutorials suppose a "fork"situation.

One typical away to do that is to pass the "command-generating-output" commandname and args to your program, eg:


 your-program command-generating-output [args...]

Then your main program can gather that up:

 import sys
 command_generating_output = sys.argv[1:]

Then you can call pty.openpty(), and then use the slave file descriptor withsubprocess.Popen to invoke command_generating_output. Thus the generatingcommand will be talking to you via a pty instead of a pipe.


Cheers,
Cameron Simpson <c...@zip.com.au>

It is interesting to think of the great blaze of heaven that we winnow
down to animal shapes and kitchen tools.        - Don DeLillo
--
https://mail.python.org/mailman/listinfo/python-list

Re: Help with pipes, buffering and pseudoterminals

Reply via email to