http://gcc.gnu.org/bugzilla/show_bug.cgi?id=47059
--- Comment #2 from Rahul Kharche 2011-01-15
12:43:27 UTC ---
This issue also exists on the trunk. I am in the process of bootstrap testing
this for i686-pc-linux-gnu. I will send out this patch once it checks out.
The attached patch is Vs 4.5.2
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=47059
--- Comment #1 from Rahul Kharche 2011-01-15
12:32:01 UTC ---
Created attachment 22974
--> http://gcc.gnu.org/bugzilla/attachment.cgi?id=22974
Patch Vs 4.5.2 Rev 167088
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=47059
Summary: compiler fails to coalesce loads/stores
Product: gcc
Version: 4.5.1
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo
--- Comment #2 from rahul at icerasemi dot com 2010-08-06 08:01 ---
Confirmed, fix for PR41317 avoids forwarding ARRAY_REFs to their use and fixes
this issue. Does this fix hinder any optimizations?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45195
Product: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul at icerasemi dot com
GCC build triplet: i686-pc-linux
GCC host triplet: i6
--- Comment #3 from rahul at icerasemi dot com 2010-03-26 12:25 ---
The following test in 'rest_of_handle_reorder_blocks'
if ((flag_reorder_blocks || flag_reorder_blocks_and_partition)
&& optimize_function_for_speed_p (cfun))
{ ... }
suggests when optimize_size
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul at icerasemi dot com
GCC build triplet: i686-pc-linux-gnu
GCC host triplet: i686-pc-linux-gnu
GCC target triplet: i686-pc-linux-gnu
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43515
--- Comment #32 from rahul at icerasemi dot com 2010-01-11 12:34 ---
I will re-test on our port and report my findings, cheers!
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=20070
--- Comment #1 from rahul at icerasemi dot com 2010-01-05 11:30 ---
Accidentally added due to browser refresh. Bug is duplicate of PR42614.
*** This bug has been marked as a duplicate of 42614 ***
--
rahul at icerasemi dot com changed:
What|Removed
--- Comment #3 from rahul at icerasemi dot com 2010-01-05 11:30 ---
*** Bug 42620 has been marked as a duplicate of this bug. ***
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=42614
inlining works okay?
--
Summary: FRE optimizes away valid code after IPA inlining
Product: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc do
inlining works okay?
--
Summary: FRE optimizes away valid code after IPA inlining
Product: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc do
: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: minor
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul at icerasemi dot com
GCC build triplet: i686-pc-linux-gnu
--- Comment #1 from rahul at icerasemi dot com 2009-09-28 12:45 ---
See http://gcc.gnu.org/ml/gcc/2009-09/msg00432.html for some followup.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=41488
y: IVOpts cannot coalesce multiple induction variables
Product: gcc
Version: 4.4.1
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul
--- Comment #28 from rahul at icerasemi dot com 2009-09-25 17:10 ---
Sorry, I also had changes to move loop header copying before FRE from
http://gcc.gnu.org/ml/gcc/2009-09/msg00434.html.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=23821
--- Comment #25 from rahul at icerasemi dot com 2009-09-25 14:26 ---
Do the fixes in comment #11 and #24 alone solve the missed induction variable
problem?
I'm using the 4.4.1 release branch and it doesn't seem to work for me.
After DOM i get
# i_10 = PHI
i_5 = i_10 + 1
--- Comment #6 from rahul at icerasemi dot com 2009-09-11 10:03 ---
An interesting regression results as a side effect of loop header copying (this
occurs even in vanilla O2). If I modify my original test case to
struct struct_t {
int* data;
};
void testAddr (struct struct_t* sp
--- Comment #29 from rahul at icerasemi dot com 2009-09-04 14:51 ---
I am testing Steven's Crossjumping patch attached here. With CoreMark we see a
1% increase in performance when using Os. Other proprietary tests show ~0.5%
decrease in code size.
The path however does not fix PR
--- Comment #4 from rahul at icerasemi dot com 2009-08-13 15:46 ---
Confirmed. Introducing loop header copy for Os, resolves the problem.
On our port, this not only helps move the invariant load outside the loop, but
also correctly uses an auto-increment address mode via the AutoInc
Version: 4.4.1
Status: UNCONFIRMED
Severity: enhancement
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul at icerasemi dot com
GCC build triplet: i686-pc-linux
GCC host triplet: i686-pc-l
--- Comment #15 from rahul at icerasemi dot com 2009-06-11 17:38 ---
GCC4.4 is still missing this fix. GCC-4.4.1 (20090507) on x86_64 produces the
following with O2/O3
kernel:
pushl %ebp
movl%esp, %ebp
subl$24, %esp
movl$1, (%esp
--- Comment #11 from rahul at icerasemi dot com 2009-05-07 15:57 ---
Confirmed issue resolved.
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40057
--- Comment #1 from rahul at icerasemi dot com 2009-05-07 11:11 ---
Suspect tree-ter optimisation pass. Compiling with -O1 -fno-tree-ter produces
the right result. Using -fdump-tree-optimized shows SSA-Gimple to change from
shiftTest (const ulonglong var)
{
int D.1842;
:
if (var
correct right shift by 31 with long long
Product: gcc
Version: 4.4.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
AssignedTo: unassigned at gcc dot gnu dot org
ReportedBy: rahul at icerasemi dot
25 matches
Mail list logo