On Fri, Mar 7, 2008 at 11:02 AM, Philipp Marek <[EMAIL PROTECTED]> wrote:
> Hello everybody,
>
>  I have a feature request.
>
>
>  I'd like to (manually) define some byte blocks, eg. as functions with an
>  identifier.  Then, if GCC would emit exactly these bytes, it puts a JMP
>  identifier there instead.
>
>  This would help by sharing many identical (text/code) byte sequences, to
>  make a smaller image - like -fmerge-constants does for data - and therefor
>  helps for cache locality, too.
>
>
>  I wrote some perl scripts to test this.  I took a "alldefconfig" i686
>  kernel, let objdump disassemble it, and on "iret", "ret", "ljmp" or "jmp"
>  with a 4 byte address I store the last few bytes.  Another script goes
>  through these block-ends, and estimates the number of bytes saved, if
>  identical sequences get changed into a 5 byte opcode (jump with 32bit
>  address).
>
>
>  The original kernel has
>   $ size vmlinux
>      text    data     bss     dec     hex  filename
>    4802166  446554  606208 5854928  5956d0 vmlinux
>
>  and my scripts emit space savings of about 30kB.  Now that's not
>  that much, but for embedded systems it means another userspace
>  binary more.
>   [ If anyone's interested, I can post my scripts here ... they're not
>     that large. ]
>
>  And, of course, the same can be used for other binaries too - which could
>  save other space as well.  (blender-bin, text size 8664025, gives 97KB
>  savings)
>
>
>  I'd imagine the use that such:
>  - Built the binary
>  - Run my scripts, to find some common code sequences
>  - Give them to gcc
>  - Rebuild, using the optimizations
>
>  I don't know whether some new optimize switch could be used, to do all
>  that internally in the compiler and linker.
>
>
>  What do you think about that?
>  [ Please keep at least *me* cc'ed - don't know whether that's that
>   interesting for linux-tiny. ]

Sounds like what -frtl-abstract-sequences is trying to do.

Richard.

Reply via email to