spec2k comparison of gcc 4.1 and 4.2 on AMD K8

2007-02-24 Thread Serge Belyshev
I have compared 4.1.2 release (r121943) with three revisions of 4.2 on spec2k on an 2GHz AMD Athlon64 box (in 64bit mode), detailed results are below. In short, current 4.2 performs just as good as 4.1 on this target with the exception of huge 80% win on 178.galgel. All other difference lies almos

ICE While Building 4.2 Branch

2007-02-24 Thread tbrowder
I checked out the gcc 4.2 branch because I want to start using the OpenMP (gomp) capability. When I tried to build it, I got an ICE somewhere in stage 3 and the message said to submit a bug report. I searched for a similar bug in bugzilla and decided to submit the bug report, but when I had to ch

gcc 4.1 and 4.2 comparison on SPEC2000 on PPC64

2007-02-24 Thread Vladimir N. Makarov
Here is the comparison of 4.1 and 4.2 branches (as of day before yesterday) on SPEC2000 for ppc64. In brief, gcc4.2 generates 3% faster code for SPECFP2000 and the same code for SPECInt2000. In average the generated SPECInt2000 code size is 0.5% smaller for gcc4.2. The SPECFp2000 code is 1.7% b

Re: Fold and integer types with sub-ranges

2007-02-24 Thread Richard Kenner
> Sure - I wonder if there is a reliable way of testing whether we face > a non-base type in the middle-end. I suppose TREE_TYPE (type) != NULL > won't work in all cases... (?) That's the right way as far as I know. > I agree. But appearantly fold does not care about base vs. non-base > types a

Re: Fold and integer types with sub-ranges

2007-02-24 Thread Richard Guenther
On Fri, 23 Feb 2007, Richard Kenner wrote: > > That said, this whole thing is a can of worms. Suppose the compiler wants > > to > > calculate t+1. Of course you do something like this: > > > > int_const_binop (PLUS_EXPR, t, build_int_cst (TREE_TYPE (t), 1), 0); > > > > But if 1 is not in the

Re: Auto Vectorizing help needed

2007-02-24 Thread [EMAIL PROTECTED]
You'll want to have a look at something like the SSE SIMD optimisation within the back end for the x86. I'd also download the GCC internals document for 4.3.x as it has some information on CPU SIMD optimisations. I've been reviewing this area heavily as I'm combining GCC and GPGPU (the same id