Re: 16 byte alignment hint for sse vectorization

2006-11-05 Thread Michael James
arrays are passed as pointer-to-struct with the array element in the struct specified with an alignment attribute. I have not tested method 2; it seems like a transformation which may work despite being unaesthetic. Regards, Michael James On 11/5/06, Dorit Nuzman <[EMAIL PROTECTED]>

16 byte alignment hint for sse vectorization

2006-11-04 Thread Michael James
Hello, I have been playing with gcc's new (to me) auto vectorization optimizations. I have a particular loop for which I have made external provisions to ensure that the data is 16-byte aligned. I have tried everything I can think of to give gcc the hint that it is operating on aligned data, but

Re: Mapping NAN to ZERO / When does gcc generate MOVcc and FCMOVcc instructions?

2006-11-03 Thread Michael James
for(i=i0; i, . I get the same results either way. Again, help is appreciated. -- Thanks. Regards, Michael James

Mapping NAN to ZERO / When does gcc generate MOVcc and FCMOVcc instructions?

2006-11-02 Thread Michael James
Hello, I am trying to get gcc to optimize an inner math loop. The first part of the loop computes a single precision float expression (which may or may not be NAN), and the second part sums all of these results into a double precision total: Conceptually, the code is: double sum = 0; for(i=0;