Hello everyone, 

my name is Michael, I'm the lead developer of a Python GUI automation library 
for Windows called Automa: http://www.getautoma.com. We want to add some 
features to our library but are unsure how to best expose them via our API. It 
would be extremely helpful for us if you could let us know which API design 
feels "right" to you.

Our API already offers very simple commands for automating the GUI of a Windows 
computer. For example:

        from automa.api import *
        start("Notepad")
        write("Hello World!")
        press(CTRL + 's')
        write("test.txt", into="File name")
        click("Save")
        click("Close")

When you execute this script, Automa starts Notepad and simulates key strokes, 
mouse movements and clicks to perform the required commands. At the moment, 
each action is performed in the currently active window. 

We do not (yet) have a functionality that allows you to explicitly switch to a 
specific window. Such a functionality would for instance make it possible to 
open two Notepad windows using the start(...) command, and copy text between 
them.

One API design would be to have our start(...) function return a "Window" (say) 
object, whose methods allow you to do the same operations as the global 
functions write(...), press(...), click(...) etc., but in the respective 
window. In this design, the example of operating two Notepad windows could be 
written as

        notepad_1 = start("Notepad")
        notepad_2 = start("Notepad")
        notepad_1.write("Hello World!")
        notepad_1.press(CTRL + 'a', CTRL + 'c')
        notepad_2.press(CTRL + 'v')

The problem with this design is that it effectively duplicates our API: We want 
to keep our "global" functions because they are so easy to read. If we add 
methods to a new "Window" class that do more or less the same, we feel that we 
are violating Python's principle that "There should be one - and preferably 
only one - obvious way to do it."

An alternative design would be to make the window switching an explicit action. 
One way of doing this would be to add a new global function, say "switch_to" or 
"activate", that takes a single parameter that identifies the window to be 
switched to. We could still have start(...) return a Window object, that could 
then be passed to our function:

        notepad_1 = start("Notepad")
        notepad_2 = start("Notepad")
        switch_to(notepad_1)
        write("Hello World!")
        press(CTRL + 'a', CTRL + 'c')
        switch_to(notepad_2)
        press(CTRL + 'v')

Maybe our Window objects could also be used as context managers:

        notepad_1 = start("Notepad")
        notepad_2 = start("Notepad")
        with notepad_1:
                write("Hello World!")
                press(CTRL + 'a', CTRL + 'c')
        with notepad_2:
                press(CTRL + 'v')

As a final idea, switching could also be done as a method of the Window class:

        notepad_1 = start("Notepad")
        notepad_2 = start("Notepad")
        notepad_1.activate()
        write("Hello World!")
        press(CTRL + 'a', CTRL + 'c')
        notepad_2.activate()
        press(CTRL + 'v')

It would be extremely helpful for us if you could let me know which way of 
using the API you would prefer. If you opt for an explicit version, how would 
you call the respective method? "activate" / "switch_to" / "focus" or something 
else?

Thank you so much!

Best wishes,
Michael
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to