Package: ppp
Version: 2.4.2+20040428-6
Severity: critical
Justification: breaks the whole system

hello pppeople,

i've encountered serious problems with pppd and days of monitoring the
issues lead me to the assumption that pppd is to blame. feel free to
tell me otherwise and i'll happily forward the problem to the right
people - because i am not guru enough to fix it.

the problem in short: i am using pppd with pppoe (ADSL dialup) i get 
disconnected by my ISP once a day and pppd receives "LCP terminated by
peer" then. normally pppd will reconnect and the box would be online
again. this does not happen anymore, instead 30min after disconnect all
RAM is chewed up by *something* (still don't know by what, please read
on) and OOM killer kicks in, killing almost every application.

i have reported the issue to the linux-kernel mailinglist first, because
i thought it was a kernel issue:
  http://lkml.org/lkml/2005/3/8/173
  http://lkml.org/lkml/2005/3/10/109
maybe the replies are noteworthy too. 

yesterday i hit the issue again, with more (hopefully helpful) details:
  http://www.ussg.iu.edu/hypermail/linux/kernel/0503.2/0316.html
again, the time between "LCP terminated by peer" to the first OOM
messages are (alsmost exactly) 30min. when this happens, i am usually
not around, so when i get to the machine, OOM has already happened. i
can recover the machine (no reboot, but SYSRQ-E) and i have to kill -9
pppd, anything else does not work:

http://nerdbynature.de/bits/sheep/2.6.11/oom/pppd.log
("Terminating on signal 15" does not work, "+++ killed by SIGKILL +++"
is the only choice.)

i first noticed this behaviour with a new kernel, namely 2.6.11, but
downgrading to a former (stable for me) kernel (2.6.11-rc5-bk2) did not
help. instead, i downgraded from pppd-2.4.3-20041231+2 to
ppp-2.4.2+20040428-6 and got rid of the problem! 2.4.3 was uploaded on
26.02.2005, and a couple of days later i switched to a new kernel and
did a "apt-get upgrade". that could be the cause, i suspected kernel
issues first. 

have a look at http://nerdbynature.de/bits/sheep/2.6.11/oom/   for
further details, feel free to ask. i still did not manage to reproduce
the problem "by hand", e.g. by setting up a pppoe server on the same
machine,  connecting with "pppd call..." and killing the pppoe server
(in the hope that it might send "LCP terminated by peer" to the pppd").


thank you for your time,
Christian.

-- System Information:
Debian Release: 3.1
  APT prefers unstable
  APT policy: (990, 'unstable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.11.3
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)

Versions of packages ppp depends on:
ii  libc6                       2.3.2.ds1-20 GNU C Library: Shared libraries an
ii  libpam-modules              0.76-22      Pluggable Authentication Modules f
ii  libpam-runtime              0.76-22      Runtime support for the PAM librar
ii  libpam0g                    0.76-22      Pluggable Authentication Modules l
ii  libpcap0.7                  0.7.2-7      System interface for user-level pa
ii  libssl0.9.7                 0.9.7e-3     SSL shared libraries
ii  makedev                     2.3.1-76     creates device files in /dev
ii  netbase                     4.20         Basic TCP/IP networking system
ii  procps                      1:3.2.5-1    /proc file system utilities

-- no debconf information


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to