New submission from Mark Shannon <m...@hotpy.org>:

Currently the compiler operates in three main passes:

Code-gen
Optimize
Assemble

The problem is that these passes use the same basic-block based CFG, leading to 
unnecessary coupling and inefficiencies.
A basic block CFG is awkward and error-prone for the code-gen, but not very 
efficient for the optimizer and assembler.

A better design would be for the code-gen to create a single linear sequence of 
instructions. The optimizer would take this and produce a list of 
extended-blocks for the assembler to consume.

code-gen -> (list of instructions) -> optimizer
optimizer -> (list of extended blocks) -> assembler

(Extended blocks have a single entry and multiple exits, unlike basic blocks 
which have a single entry and single exit)

This would:
1. Reduce memory use considerably (the size of instruction and block data 
structures would be about 60% of current)
2. Be faster (Less CFG management).
3. Produce better code (extended blocks are a better unit for optimization that 
basic blocks).
4. Be easier to maintain:
  a) Code-gen wouldn't have to worry about creating a correct CFG.
  b) The optimizer wouldn't need to handle empty blocks and track which basic 
blocks form an extended block.


Apart from the changes to the compiler, it would help if we made all branch 
instructions absolute (or have a backward dual) to accommodate free reordering 
of blocks in the optimizer.

----------
messages: 385033
nosy: Mark.Shannon
priority: normal
severity: normal
status: open
title: Split compiler into code-gen, optimizer and assembler.

_______________________________________
Python tracker <rep...@bugs.python.org>
<https://bugs.python.org/issue42926>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to