On 11/27/13 2:40 PM, magnus.ly...@gmail.com wrote:
When I run e.g. compile('sin(5) * cos(6)', '<string>', 'eval').co_names, I get 
('sin', 'cos'), which is just what I expected.

But when I have a list comprehension in the expression, I get a little surprise:
compile('[x*x for x in y]',  '<string>', 'eval').co_names
('_[1]', 'y', 'x')


This happens in Python 2.6.6 on Red Hat Linux, but not when I run Python 2.7.3 
in Windows. Unfortunately I'm stuck with 2.6.

* Are there more surprises similar to this one that I can expect from 
compile(...).co_names? Is this "behaviour" documented somewhere?


That name is the name of the list being built by the comprehension, which I found out by disassembling the code object to see the bytecodes:

    >>> co = compile("[x*x for x in y]", "<s>", "eval")
    >>> co.co_names
    ('_[1]', 'y', 'x')
    >>> import dis
    >>> dis.dis(co)
      1           0 BUILD_LIST               0
                  3 DUP_TOP
                  4 STORE_NAME               0 (_[1])
                  7 LOAD_NAME                1 (y)
                 10 GET_ITER
            >>   11 FOR_ITER                17 (to 31)
                 14 STORE_NAME               2 (x)
                 17 LOAD_NAME                0 (_[1])
                 20 LOAD_NAME                2 (x)
                 23 LOAD_NAME                2 (x)
                 26 BINARY_MULTIPLY
                 27 LIST_APPEND
                 28 JUMP_ABSOLUTE           11
            >>   31 DELETE_NAME              0 (_[1])
                 34 RETURN_VALUE

The same list comprehension in 2.7 uses an unnamed list on the stack:

      1           0 BUILD_LIST               0
                  3 LOAD_NAME                0 (y)
                  6 GET_ITER
            >>    7 FOR_ITER                16 (to 26)
                 10 STORE_NAME               1 (x)
                 13 LOAD_NAME                1 (x)
                 16 LOAD_NAME                1 (x)
                 19 BINARY_MULTIPLY
                 20 LIST_APPEND              2
                 23 JUMP_ABSOLUTE            7
            >>   26 RETURN_VALUE

I don't know whether such facts are documented. They are deep implementation details, and change from version to version, as you've seen.

* Is there perhaps a better way to achieve what I'm trying to do?

What I'm really after, is to check that python expressions embedded in text 
files are:
- well behaved (no syntax errors etc)
- don't accidentally access anything it shouldn't
- I serve them with the values they need on execution

I hope you aren't trying to prevent malice this way: you cannot examine a piece of Python code to prove that it's safe to execute. For an extreme example, see: Eval Really Is Dangerous: http://nedbatchelder.com/blog/201206/eval_really_is_dangerous.html

In your environment it looks like you have a whitelist of identifiers, so you're probably ok.


So, in the case of "a.b + x" I'm really just interested in a and x, not b. So 
the (almost) whole story is that I do:

     # Find names not starting with ".", i.e a & b in "a.c + b"
     abbr_expr = re.sub(r"\.\w+", "", expr)
     names = compile(abbr_expr, '<string>', 'eval').co_names
     # Python 2.6 returns '_[1]' in co_names for list comprehension. Bug?
     names = [name for name in names if re.match(r'\w+$', name)]

     for name in names:
         if name not in allowed_names:
             raise NameError('Name: %s not permitted in expression: %s' % 
(name, expr))


I don't know of a better way to determine the real names in the expression. I doubt Python will insert a valid name into the namespace, since it doesn't want to step on real user names. The simplest way to do that is to autogenerate invalid names, like "_[1]" (I wonder why it isn't "_[0]"?)

--Ned.

--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to