https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80874
--- Comment #1 from denis.campredon at gmail dot com ---
Sorry, minmax3 should not produce the same asm, since minmax return a pair of
const reference.
But still the code is less than optimal.
One part it is because gcc might be because gcc is not
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80876
Markus Trippelsdorf changed:
What|Removed |Added
Status|UNCONFIRMED |NEW
Last reconfirmed|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80877
Bug ID: 80877
Summary: Derived template class can access base class's private
constexpr/const static fields
Product: gcc
Version: 6.2.0
Status: UNCONFIRMED
Seve
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80876
Bug ID: 80876
Summary: [8 Regression] ICE in verify_loop_structure, at
cfgloop.c:1644 (error: loop 1's latch does not have an
edge to its header)
Product: gcc
Ver
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80822
--- Comment #3 from Nathan Weeks ---
Setting OMP_DISPLAY_ENV=verbose results in the following output with Intel
17.0.2:
OPENMP DISPLAY ENVIRONMENT BEGIN
_OPENMP=
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79801
Alan Modra changed:
What|Removed |Added
Status|UNCONFIRMED |RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80544
--- Comment #6 from Jonathan Wakely ---
GCC now accepts the original testcase, and with -Wignored-qualifiers (which is
included in -Wextra) prints:
q.cc: In function ‘int main()’:
q.cc:8:30: warning: type qualifiers ignored on cast result type
[
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80868
--- Comment #3 from George Burgess IV ---
Thanks for the response!
From the standpoint of consistency, I agree.
My point is more that GCC isn't bound by the standard to be as strict with
`typeof`, and making an exception for `typeof` here would
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80544
Jonathan Wakely changed:
What|Removed |Added
Status|NEW |RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80544
--- Comment #4 from Jonathan Wakely ---
Author: redi
Date: Wed May 24 22:16:59 2017
New Revision: 248432
URL: https://gcc.gnu.org/viewcvs?rev=248432&root=gcc&view=rev
Log:
PR c++/80544 strip cv-quals from cast results
gcc/cp:
PR c++/80
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80867
Matthias Klose changed:
What|Removed |Added
Status|WAITING |UNCONFIRMED
CC|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80731
Martin Sebor changed:
What|Removed |Added
Status|ASSIGNED|RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80731
--- Comment #3 from Martin Sebor ---
Author: msebor
Date: Wed May 24 22:07:21 2017
New Revision: 248431
URL: https://gcc.gnu.org/viewcvs?rev=248431&root=gcc&view=rev
Log:
PR c/80731 - poor -Woverflow warnings
gcc/c-family/ChangeLog:
PR
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80846
--- Comment #2 from Peter Cordes ---
(In reply to Richard Biener from comment #1)
> That is, it was supposed to end up using pslldq
I think you mean PSRLDQ. Byte zero is the right-most when drawn in a way that
makes bit/byte shift directions al
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80803
Bill Schmidt changed:
What|Removed |Added
Summary|libgo appears to be |libgo appears to be
|mi
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80803
--- Comment #12 from Ian Lance Taylor ---
A global variable that can not be statically initialized would be initialized
by a function named "net..import", invoked before the Go main function starts.
Since the net.IPv4 function is trivial, it is
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80803
--- Comment #11 from boger at us dot ibm.com ---
The first failure happens in TestParseIP from ip_test.go because the "out"
entries in the var parseIPTests are not initialized correctly. This causes the
failures because the actual value (which is
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=37131
Bug 37131 depends on bug 66094, which changed state.
Bug 66094 Summary: Handle transpose(A) in inline matmul
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66094
What|Removed |Added
---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66094
Thomas Koenig changed:
What|Removed |Added
Status|NEW |RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66094
--- Comment #10 from Thomas Koenig ---
Author: tkoenig
Date: Wed May 24 18:44:35 2017
New Revision: 248425
URL: https://gcc.gnu.org/viewcvs?rev=248425&root=gcc&view=rev
Log:
2017-05-24 Thomas Koenig
PR fortran/66094
* fronten
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80875
Marek Polacek changed:
What|Removed |Added
Status|NEW |ASSIGNED
Assignee|unassigned a
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80875
Marek Polacek changed:
What|Removed |Added
Keywords||ice-on-valid-code
Target Milestone|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80875
--- Comment #2 from Marek Polacek ---
commit 0123775a88c6cf1035e4633fde7823a3e9889809
Author: rguenth
Date: Wed Oct 28 13:41:25 2015 +
2015-10-28 Richard Biener
* fold-const.c (negate_expr_p): Adjust the division case
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80875
Marek Polacek changed:
What|Removed |Added
Status|UNCONFIRMED |NEW
Last reconfirmed|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80875
Bug ID: 80875
Summary: UBSAN: compile time crash in fold_binary_loc at
fold-const.c:9817
Product: gcc
Version: 8.0
Status: UNCONFIRMED
Severity: normal
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78591
--- Comment #1 from Paolo Carlini ---
The released 7.1.0 doesn't ICE.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80754
--- Comment #5 from Wilco ---
Author: wilco
Date: Wed May 24 17:06:55 2017
New Revision: 248424
URL: https://gcc.gnu.org/viewcvs?rev=248424&root=gcc&view=rev
Log:
When lra-remat rematerializes an instruction with a clobber, it checks
that the cl
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71451
Paolo Carlini changed:
What|Removed |Added
CC||paolo.carlini at oracle dot com
--- Comm
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #19 from Thorsten Kurth ---
Thanks you very much. I am sorry that I do not have a simpler test case. The
kernel which is executed is in the same directory as ABecLaplacian and called
MG_3D_cpp.cpp.
We have seen similar problems with
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #18 from Jakub Jelinek ---
Ok, I'll grab your git code and will have a look tomorrow what's going on.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #17 from Thorsten Kurth ---
the result though is correct, I verified that both codes generate the correct
output.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #16 from Thorsten Kurth ---
FYI, the code is:
https://github.com/zronaghi/BoxLib.git
in branch
cpp_kernels_openmp4dot5
and then in Src/LinearSolvers/C_CellMG
the file ABecLaplacian.cpp. For example, lines 542 and 543 can be comme
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #15 from Thorsten Kurth ---
The code I care about definitely has optimization enabled. For the fortran
stuff it does (for example):
ftn -g -O3 -ffree-line-length-none -fno-range-check -fno-second-underscore
-Jo/3d.gnu.MPI.OMP.EXE -
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #14 from Jakub Jelinek ---
(In reply to Thorsten Kurth from comment #13)
> the compiler options are just -fopenmp. I am sure it does not have to do
> anything with vectorization as I compare the code runtime with and without
> the tar
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=28004
Thomas Koenig changed:
What|Removed |Added
Last reconfirmed|2007-07-03 21:06:36 |2017-5-24
--- Comment #12 from Thomas Ko
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #13 from Thorsten Kurth ---
Hello Jakub,
the compiler options are just -fopenmp. I am sure it does not have to do
anything with vectorization as I compare the code runtime with and without the
target directives and thus vectorization
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #12 from Jakub Jelinek ---
(In reply to Thorsten Kurth from comment #11)
> yes, you are right. I thought that map(tofrom:) is the default mapping
> but I might be wrong. In any case, teams is always 1. So this code is
Variables t
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #11 from Thorsten Kurth ---
Hello Jakub,
yes, you are right. I thought that map(tofrom:) is the default mapping but
I might be wrong. In any case, teams is always 1. So this code is basically
just data streaming so there is no n
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #10 from Jakub Jelinek ---
(In reply to Thorsten Kurth from comment #7)
> Hello Jakub,
>
> thanks for your comment but I think the parallel for is not racey. Every
> thread is working a block of i-indices so that is fine. The dotprod
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #9 from Thorsten Kurth ---
Sorry, in the second run I set the number of threads to 12. I think the code
works as expected.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #8 from Thorsten Kurth ---
Here is the output of the get_num_threads section:
[tkurth@cori02 omp_3_vs_45_test]$ export OMP_NUM_THREADS=32
[tkurth@cori02 omp_3_vs_45_test]$ ./nested_test_omp_4dot5.x
We got 1 teams and 32 threads.
and
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #7 from Thorsten Kurth ---
Hello Jakub,
thanks for your comment but I think the parallel for is not racey. Every thread
is working a block of i-indices so that is fine. The dotprod kernel is actually
a kernel from the OpenMP standard
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #6 from Jakub Jelinek ---
movq/pushq etc. aren't that expensive, if it affects performance it must be
something in the inner loops. A compiler switch that ignores omp target, teams
and distribute would basically create a new OpenMP v
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #5 from Thorsten Kurth ---
To clarify the problem:
I think that the additional movq, pushq and other instructions generated when
using the target directive can cause a big hit on the performance. I understand
that these instructions a
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #4 from Thorsten Kurth ---
Created attachment 41415
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41415&action=edit
Testcase
This is the test case. The files ending on .as contain the assembly code with
and without target regi
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78379
--- Comment #36 from Jerry DeLisle ---
Results look very good.
Gfortran 7, no patch gives:
$ gfc7 -static -Ofast -ftree-vectorize compare.f90
$ ./a.out
=
ME
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #3 from Thorsten Kurth ---
Created attachment 41414
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41414&action=edit
OpenMP 4.5 Testcase
This is the source code
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80843
Segher Boessenkool changed:
What|Removed |Added
CC||segher at gcc dot gnu.org
--- Comme
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80860
Segher Boessenkool changed:
What|Removed |Added
Status|ASSIGNED|RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80843
--- Comment #1 from Segher Boessenkool ---
Author: segher
Date: Wed May 24 14:33:11 2017
New Revision: 248421
URL: https://gcc.gnu.org/viewcvs?rev=248421&root=gcc&view=rev
Log:
rs6000: Fix for separate shrink-wrapping for fp (PR80860, PR80843)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80860
--- Comment #5 from Segher Boessenkool ---
Author: segher
Date: Wed May 24 14:33:11 2017
New Revision: 248421
URL: https://gcc.gnu.org/viewcvs?rev=248421&root=gcc&view=rev
Log:
rs6000: Fix for separate shrink-wrapping for fp (PR80860, PR80843)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78379
--- Comment #35 from Jerry DeLisle ---
(In reply to Thomas Koenig from comment #34)
> Created attachment 41410 [details]
> Patch which has all the files
>
> Well, I suspect my way of splitting the previous patch into
> one real patch and one *.t
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80874
Bug ID: 80874
Summary: gcc does not emit cmov for minmax
Product: gcc
Version: 8.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #12 from Uroš Bizjak ---
(In reply to Peter Cordes from comment #4)
> MMX is also a saving in code-size: one fewer prefix byte vs. SSE2 integer
> instructions. It's also another set of 8 registers for 32-bit mode.
After touching a M
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #11 from Uroš Bizjak ---
(In reply to Peter Cordes from comment #0)
> A lower-latency xmm->int strategy would be:
>
> movd%xmm0, %eax
> pextrd $1, %xmm0, %edx
Proposed patch implements the above for generic move
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #10 from Uroš Bizjak ---
(In reply to Peter Cordes from comment #0)
> Scalar 64-bit integer ops in vector regs may be useful in general in 32-bit
> code in some cases, especially if it helps with register pressure.
We have scalar-to
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #9 from Uroš Bizjak ---
(In reply to Uroš Bizjak from comment #8)
> movq%xmm0, (%esp) <<-- unneeded store due to RA problem
For some reason, reload "fixes" direct DImode register moves, and passes value
via memory.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80873
--- Comment #2 from Morris Hafner ---
Created attachment 41413
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41413&action=edit
Minimal example code (valid)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80873
--- Comment #1 from Morris Hafner ---
I managed to create an example that is a valid program:
struct Buffer {};
auto parse(Buffer b);
template void parse(T target);
template
auto field(T target) {
return [&] {
parse(t
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80803
boger at us dot ibm.com changed:
What|Removed |Added
CC||boger at us dot ibm.com
--- Com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #8 from Uroš Bizjak ---
The patch from comment #7 generates:
a) DImode move for 32 bit targets:
--cut here--
long long test (long long a)
{
asm ("" : "+x" (a));
return a;
}
--cut here--
gcc -O2 -msse4.1 -mtune=intel -mregparm=2
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=46186
Raphael C changed:
What|Removed |Added
CC||drraph at gmail dot com
--- Comment #26 from
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80833
--- Comment #7 from Uroš Bizjak ---
Created attachment 41412
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=41412&action=edit
Prototype patch
Patch that emits mov/pinsr or mov/pextr pairs for DImode (x86_32) and TImode
(x86_64) moves.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80844
--- Comment #5 from Richard Biener ---
(In reply to Jakub Jelinek from comment #2)
> (In reply to Richard Biener from comment #1)
> > If OMP SIMD always zeros the vector then it could also emit the maybe easier
> > to optimize
> >
> > WITH_SIZ
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80826
Jan Hubicka changed:
What|Removed |Added
Status|NEW |ASSIGNED
Assignee|unassigned at
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80864
Richard Biener changed:
What|Removed |Added
Keywords||ice-on-valid-code
Status|UN
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80867
Richard Biener changed:
What|Removed |Added
Target Milestone|--- |7.2
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
Richard Biener changed:
What|Removed |Added
Keywords||missed-optimization, openmp
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80856
Richard Biener changed:
What|Removed |Added
Priority|P3 |P2
Target Milestone|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80853
Richard Biener changed:
What|Removed |Added
Priority|P3 |P2
Target Milestone|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80823
Peter Bergner changed:
What|Removed |Added
Status|RESOLVED|CLOSED
--- Comment #7 from Peter Bergner
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80823
Peter Bergner changed:
What|Removed |Added
Status|NEW |RESOLVED
URL|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80823
--- Comment #5 from Peter Bergner ---
Author: bergner
Date: Wed May 24 12:10:54 2017
New Revision: 248408
URL: https://gcc.gnu.org/viewcvs?rev=248408&root=gcc&view=rev
Log:
gcc/
PR middle-end/80823
* tree-cfg.c (group_case_labels
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80725
--- Comment #5 from Andreas Krebbel ---
Author: krebbel
Date: Wed May 24 11:36:54 2017
New Revision: 248407
URL: https://gcc.gnu.org/viewcvs?rev=248407&root=gcc&view=rev
Log:
S/390: Fix PR80725.
gcc/ChangeLog:
2017-05-24 Andreas Krebbel
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80851
Richard Biener changed:
What|Removed |Added
Keywords||rejects-valid
Status|UNCONF
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80846
Richard Biener changed:
What|Removed |Added
Status|UNCONFIRMED |ASSIGNED
Last reconfirmed|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71579
--- Comment #6 from Antony Polukhin ---
C++ LWG related issue: http://cplusplus.github.io/LWG/lwg-active.html#2797
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80873
Bug ID: 80873
Summary: ICE in tsubst_copy when trying to use an overloaded
function without a definition in a lambda
Product: gcc
Version: 7.1.1
Status: UNCONFIRMED
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79583
--- Comment #2 from Paolo Carlini ---
The released 7.1.0, current gcc-7-branch and trunk are fine. I'm adding the
testcase and closing the bug.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80868
Marek Polacek changed:
What|Removed |Added
CC||mpolacek at gcc dot gnu.org
--- Comment
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68578
Paolo Carlini changed:
What|Removed |Added
CC||paolo.carlini at oracle dot com
--- Comm
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80872
Bug ID: 80872
Summary: There is no warning on accidental infinite loops
Product: gcc
Version: unknown
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80396
Christophe Lyon changed:
What|Removed |Added
CC||clyon at gcc dot gnu.org
--- Comment #
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80812
Ville Voutilainen changed:
What|Removed |Added
Status|NEW |ASSIGNED
CC|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
--- Comment #2 from Jakub Jelinek ---
Also, even for host fallback there is a separate set of ICVs and many other
properties, the target region can't be just ignored for many reasons even if
there is no data sharing.
Of course, if you provide sma
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80844
--- Comment #4 from Jakub Jelinek ---
What we should do is first vectorize the main simd loop and then, once we've
determined the vectorization factor thereof etc., see if there is any related
preparation and finalization loop around it and try t
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80867
Eric Botcazou changed:
What|Removed |Added
Status|UNCONFIRMED |WAITING
Last reconfirmed|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80859
Jakub Jelinek changed:
What|Removed |Added
CC||jakub at gcc dot gnu.org
--- Comment #1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78969
--- Comment #8 from Jakub Jelinek ---
idx_10 addition is a consequence of TODO_update_ssa in vrp1's todo_flags,
triggered by jump threading creating the bb6.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78969
Jakub Jelinek changed:
What|Removed |Added
CC||jakub at gcc dot gnu.org
--- Comment #7
90 matches
Mail list logo