Question about asyncio and blocking operations

Frank Millman Sat, 23 Jan 2016 06:40:55 -0800

Hi all

I am developing a typical accounting/business application which involves afront-end allowing clients to access the system, a back-end connecting to adatabase, and a middle layer that glues it all together.

Some time ago I converted the front-end from a multi-threaded approach to anasyncio approach. It was surprisingly easy, and did not require me to delveinto asyncio too deeply.

There was one aspect that I deliberately ignored at that stage. I did notchange the database access to an asyncio approach, so all readingfrom/writing to the database involved a blocking operation. I am now readyto tackle that.

I find I am bumping my head more that I expected, so I thought I would tryto get some feedback here to see if I have some flaw in my approach, or ifit is just in the nature of writing an asynchronous-style application.

Here is the difficulty. The recommended way to handle a blocking operationis to run it as task in a different thread, using run_in_executor(). Thismethod is a coroutine. An implication of this is that any method that callsit must also be a coroutine, so I end up with a chain of coroutinesstretching all the way back to the initial event that triggered it. I canunderstand why this is necessary, but it does lead to some awkwardprogramming.

I use a cache to store frequently used objects, but I wait for the firstrequest before I actually retrieve it from the database. This is how itworked -


# cache of database objects for each company
class DbObject(dict):
   def __missing__(self, company):
       db_object = self[company] = get_db_object _from_database()
       return db_object
db_objects = DbObjects()

Any function could ask for db_cache.db_objects[company]. The first time itwould be read from the database, on subsequent requests it would be returnedfrom the dictionary.


Now get_db_object_from_database() is a coroutine, so I have to change it to
       db_object = self[company] = await get_db_object _from_database()

But that is not allowed, because __missing__() is not a coroutine.

I fixed it by replacing the cache with a function -

# cache of database objects for each company
db_objects = {}
async def get_db_object(company):
   if company not in db_objects:

db_object = db_objects[company] = await get_db_object_from_database()

   return db_objects[company]

Now the calling functions have to call 'awaitdb_cache.get_db_object(company)'


Ok, once I had made the change it did not feel so bad.

Now I have another problem. I have some classes which retrieve some datafrom the database during their __init__() method. I find that it is notallowed to call a coroutine from __init__(), and it is not allowed to turn__init__() into a coroutine.

I imagine that I will have to split __init__() into two parts, put thedatabase functionality into a separately-callable method, and then gothrough my app to find all occurrences of instantiating the object andfollow it with an explicit call to the new method.

Again, I can handle that without too much difficulty. But at this stage I donot know what other problems I am going to face, and how easy they will beto fix.

So I thought I would ask here if anyone has been through a similar exercise,and if what I am going through sounds normal, or if I am doing somethingfundamentally wrong.


Thanks for any input

Frank Millman


--
https://mail.python.org/mailman/listinfo/python-list

Question about asyncio and blocking operations

Reply via email to