New submission from Mark Shannon <m...@hotpy.org>:
Currently the compiler operates in three main passes: Code-gen Optimize Assemble The problem is that these passes use the same basic-block based CFG, leading to unnecessary coupling and inefficiencies. A basic block CFG is awkward and error-prone for the code-gen, but not very efficient for the optimizer and assembler. A better design would be for the code-gen to create a single linear sequence of instructions. The optimizer would take this and produce a list of extended-blocks for the assembler to consume. code-gen -> (list of instructions) -> optimizer optimizer -> (list of extended blocks) -> assembler (Extended blocks have a single entry and multiple exits, unlike basic blocks which have a single entry and single exit) This would: 1. Reduce memory use considerably (the size of instruction and block data structures would be about 60% of current) 2. Be faster (Less CFG management). 3. Produce better code (extended blocks are a better unit for optimization that basic blocks). 4. Be easier to maintain: a) Code-gen wouldn't have to worry about creating a correct CFG. b) The optimizer wouldn't need to handle empty blocks and track which basic blocks form an extended block. Apart from the changes to the compiler, it would help if we made all branch instructions absolute (or have a backward dual) to accommodate free reordering of blocks in the optimizer. ---------- messages: 385033 nosy: Mark.Shannon priority: normal severity: normal status: open title: Split compiler into code-gen, optimizer and assembler. _______________________________________ Python tracker <rep...@bugs.python.org> <https://bugs.python.org/issue42926> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com