Bug#574990: nscd crashes after moderate use

2010-07-29 Thread Lukas Kolbe
Am Mittwoch, den 28.07.2010, 23:20 -0400 schrieb Moritz Muehlenhoff:
> On Mon, Mar 22, 2010 at 06:55:00PM +0100, Lukas Kolbe wrote:
> > Package: nscd
> > Version: 2.7-18lenny2
> > Severity: grave
> > 
> > Hi,
> > 
> > on all (?) our servers we see repeated crashes of nscd:
> > 
> > [ 3687.306397] nscd[3340]: segfault at 10 ip 7f6b22ce3685 sp 
> > 7fffc977bb60 error 6 in nscd[7f6b22cdd000+1c000]
> > 
> > This happens mostly between 3000 and 6000 seconds after starting. We
> > have 3003 users in our LDAP, so we increased the 'suggested-size' to a
> > high prime number 1, removed the dbs and restarted nscd. The crashes
> > happen nevertheless, rendering nscd completely unusable on our systems.
> > I'd like to provide more debugging information, but I don't really know
> > how at the moment ...
> 
> I'm currently setting up a similar test environment with Lenny
> and Squeeze to see if I can reproduce it.
> 
> nscd has configuration options to force a continuous restart, see
> the options "paranoia" and "restart-interval" in nscd.conf. Does
> nscd keep up if you set it to something like 600?
> 
> You should probably use this in conjunction with the "persistent"
> settings.

Sorry, I forgot to mention that - of course we set paranoia mode on, but
that doesn't help at all. I'll set the restart-interval down to 600 to
see if it helps. Here's our default configuration for reference:

debug-level 0
paranoiayes
enable-cachepasswd  yes
positive-time-to-live   passwd  600
negative-time-to-live   passwd  20
suggested-size  passwd  1
check-files passwd  yes
persistent  passwd  yes
shared  passwd  yes
max-db-size passwd  134217728
auto-propagate  passwd  yes
enable-cachegroup   yes
positive-time-to-live   group   3600
negative-time-to-live   group   60
suggested-size  group   1
check-files group   yes
persistent  group   yes
shared  group   yes
max-db-size group   134217728
auto-propagate  group   yes
enable-cachehosts   no
positive-time-to-live   hosts   3600
negative-time-to-live   hosts   20
suggested-size  hosts   1
check-files hosts   yes
persistent  hosts   yes
shared  hosts   yes
max-db-size hosts   134217728
enable-cacheservicesyes
positive-time-to-live   services28800
negative-time-to-live   services20
suggested-size  services1
check-files servicesyes
persistent  servicesyes
shared  servicesyes
max-db-size services134217728

As soon as we test our first fileserver with Squeeze (which will take a
few more months probably), I hope to report this bug is non-existant
there ;)

Thanks for your help!

-- 
Lukas




-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/1280390063.5114.4.ca...@larosa.fritz.box



Bug#574990: nscd crashes after moderate use

2010-07-29 Thread Lukas Kolbe
Hi again,

> I'm currently setting up a similar test environment with Lenny
> and Squeeze to see if I can reproduce it.
> 
> nscd has configuration options to force a continuous restart, see
> the options "paranoia" and "restart-interval" in nscd.conf. Does
> nscd keep up if you set it to something like 600?
> 
> You should probably use this in conjunction with the "persistent"
> settings.

Hm, even after setting restart-interval to 600 nscd keeps crashing:

Jul 29 10:06:42 niska kernel: [29035521.984129] nscd[24114]: segfault at 10 ip 
7fa46659b685 sp 7fffec52f6b0 error 6 in nscd[7fa466595000+1c000]

-- 
Lukas





-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/1280392251.5114.6.ca...@larosa.fritz.box



Bug#574990: nscd crashes after restart after sshd hangs

2010-07-29 Thread Hermann Lauer
Hello,

with libnss-ldap and nscd on lenny latest from time to time we have
unresolved bug #552431, which makes sshd hanging due to suddenly unresolvable
"sshd" system user.

Today nscd crashed after sshd hanging and login was possible.
I set "paranoia yes" in nscd.conf without setting "restart-interval" so it
should be default 3600.

Now nscd crashed, probably while restarting itself:

Jul 29 11:00:43 quadopt3 kernel: [3616361.904197] nscd[6891]: segfault at 10 ip 
7f587be30685 sp 7fffd960 error 6 in nscd[7f587be2a000+1c000]

Set "persistent no" now and restarted nscd. Current config is below.

Any help for solving this and #552431 is most welcome !

Thanks,
  Hermann

nscd.conf:
debug-level 0
#   reload-count5
paranoiayes
#   restart-interval3600

enable-cachepasswd  yes
positive-time-to-live   passwd  600
negative-time-to-live   passwd  20
suggested-size  passwd  211
check-files passwd  yes
persistent  passwd  no
shared  passwd  yes
max-db-size passwd  33554432
auto-propagate  passwd  yes

enable-cachegroup   yes
positive-time-to-live   group   3600
negative-time-to-live   group   60
suggested-size  group   211
check-files group   yes
persistent  group   no
shared  group   yes
max-db-size group   33554432
auto-propagate  group   yes

# hosts caching is broken with gethostby* calls, hence is now disabled
# per default.  See /usr/share/doc/nscd/NEWS.Debian.
enable-cachehosts   no
positive-time-to-live   hosts   3600
negative-time-to-live   hosts   20
suggested-size  hosts   211
check-files hosts   yes
persistent  hosts   no
shared  hosts   yes
max-db-size hosts   33554432

enable-cacheservicesyes
positive-time-to-live   services28800
negative-time-to-live   services20
suggested-size  services211
check-files servicesyes
persistent  servicesno
shared  servicesyes
max-db-size services33554432



-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/20100729100446.gb8...@lemon.iwr.uni-heidelberg.de



Bug#413934: Bug appears to be fixed in squeeze glibc

2010-07-29 Thread Mike Crowe
I've tested minicom in a squeeze chroot using the byte sequence I
provided earlier. I could not reproduce the problem. Minicom displays
a strange character (as would be expected) but does not assert.

I tested with:

ii  libc6  2.11.2-2   
Embedded GNU C Library: Shared libraries
ii  minicom2.4-2  
friendly menu driven serial communication program

So, I think we can call this fixed in squeeze.

Thanks.

Mike.



-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100729113028.ga32...@mcrowe.com



Bug#574990: nscd crashes after moderate use

2010-07-29 Thread Moritz Muehlenhoff
On Thu, Jul 29, 2010 at 10:30:51AM +0200, Lukas Kolbe wrote:
> Hi again,
> 
> > I'm currently setting up a similar test environment with Lenny
> > and Squeeze to see if I can reproduce it.
> > 
> > nscd has configuration options to force a continuous restart, see
> > the options "paranoia" and "restart-interval" in nscd.conf. Does
> > nscd keep up if you set it to something like 600?
> > 
> > You should probably use this in conjunction with the "persistent"
> > settings.
> 
> Hm, even after setting restart-interval to 600 nscd keeps crashing:
> 
> Jul 29 10:06:42 niska kernel: [29035521.984129] nscd[24114]: segfault at 10 
> ip 7fa46659b685 sp 7fffec52f6b0 error 6 in nscd[7fa466595000+1c000]

Could you please run an additional test: If you set the cache-size
to a really small value, does is crash faster?

I have the suspicion that the crash might occur the first time a
cache slot is replaced.

Cheers,
Moritz



-- 
To UNSUBSCRIBE, email to debian-glibc-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20100730010027.gb8...@galadriel.inutil.org