Hello Jason, An earlier post showed auth_md5_validate() in the logs, which is what does APOP validation - this time, no mention. Can you forward the logs at trace level 5 (when the problem "should" happen). Are you using APOP or not? (Maybe after running through APOP command a variable is reused for USER/PASS without initialization or something.) Also, I assume the user(s) this happens for does have the password saved correctly, as this doesn't happen 100% of the time? Or do they manually enter it (ie. sometime it'll be right, sometimes not)?
Jesse ---- Original Message ---- From: Jason Burfield <dbmail@dbmail.org> To: dbmail@dbmail.org Subject: Re: [Dbmail] High Load Average and dbmail-pop3d ? Sent: 19 May 2003 17:24:21 -0400 > Well...I have found one thing that is interesting... > > With the trace_level set to 5, I do NOT have any problems at all. > However, at 0,1,2,3 or 4 I have problems. > > Does this make any sense to anyone? > > The most I can get in a log file is at trace_level=4 and it is just > this: > > --- > May 19 17:18:51 vault dbmail/pop3d[24695]: PerformChildTask(): incoming > connection from [xx.xxx.xxx.xx] > May 29 17:18:51 vault dbmail/pop3d[24695]: user [EMAIL PROTECTED] could > not be validated > --- > > At this point it seems to go into a loop or something, using up all > processor ticks and NOT returning an error to the user. > > When I go in and issue 'killall dbmail-pop3d' is when the user gets a > 'password incorrect' message on his/her screen. > > So far it will not freak out when it's at trace_level 5, but then the > logs are huge. > > I'm still baffled. > > Thanks... > > -- Jason > > > On Sun, 2003-05-18 at 13:34, Micah Stevens wrote: > > go through the logs and find where he's getting an invalid PW error, if you > > have the trace level set to 5 it should provide some usefull information > > there. Post that bit if you can maybe others can help. > > > > I would restart the pop server and purposely log in with an incorrect > > password > > too to see what happens. > > > > I know I have at least 1 OSX user and this isn't a problem, I'm using a CVS > > from February though. > > > > -Micah > > > > > > On Saturday 17 May 2003 3:30 pm, Jason Burfield wrote: > > > Another quick follow-up... > > > > > > My server just freaked and ran up to a load average over 32.00. That is > > > out > > > of control! Wow! > > > > > > There were 30 dbmail-pop3d processes all carrying a load. I killed the > > > server and then with a copy of the log file, traced each of the pids back. > > > > > > Every single one of them ended with: > > > > > > -- > > > May 17 17:31:21 vault dbmail/pop3d[24001]: auth_md5_validate(): user > > > [EMAIL PROTECTED] could not be validated > > > -- > > > > > > Obviously, the address has been changed. > > > > > > I found that this user is using the Apple Mail program on OS X to check > > > mail. And, he is checking mail every one minute. (yeah, I know...but what > > > can I do?) > > > > > > It seems that most of his connections work fine, but then, for whatever > > > reason, he gets an error about his password being incorrect. > > > > > > This seems to be when the server takes off and creates a huge load. > > > > > > I'm really getting concerned now because when the server is left alone for > > > a few hours, it gets to be out of control. > > > > > > If there is any additional info anyone would like to see, please let me > > > know...I'd like to help figure this one out! > > > > > > Thanks again... > > > > > > -- Jason > > > > > > on 5/17/03 2:23 AM, Eelco van Beek - IC&S at [EMAIL PROTECTED] wrote: > > > > The most important thing pop & imap share is the connection handling > > > > code. Roel is checking that out for bugs. > > > > > > > > Best regards, > > > > > > > > eelco > > > > > > > > On zaterdag, mei 17, 2003, at 04:22 Europe/Amsterdam, Chris Nolan wrote: > > > >> Hi all, > > > >> > > > >> I am having an extremely similar problem with dbmail-imapd (running > > > >> dbmail 1.1). And, to simplify things somewhat, I can tell you a bit > > > >> about the problem. > > > >> > > > >> 1. MySQL is *not* the cause of this problem. I am using MySQL 4.0.12 > > > >> and killing dbmail-imapd very quickly brings the load average down. > > > >> 2. The load average grows linearly, so one day after having > > > >> dbmail-imapd running, the load average will read "1.00 1.00 1.00", the > > > >> second day "2.00 2.00 2.00" the third day "3.00 3.00 3.00" etc. > > > >> 3. I am running Mandrake 9.0 (one of the last glibc 2.2-based distros). > > > >> 4. DBMail very happily maxes out MySQL's connection pool. I'm betting > > > >> this has something to do with it (trying to connect from any other > > > >> machine with any MySQL client or another DBMail instance will always > > > >> fail after dbmail-imapd has been running for a few days). > > > >> 5. I have many indexes and foreign key constraints in place as have > > > >> been suggested on this list. > > > >> 6. I only have 8 people using this server - 6 with IMAP clients and 2 > > > >> with POP3 clients. > > > >> 7. SquirrelMail's performance degrades as the load average goes up, > > > >> and massively. It takes 10 seconds to render each frame due to the > > > >> IMAP side of things being so heavily choked. > > > >> > > > >> If anyone has any other questions or any suggestion as to where the > > > >> problem is in the source (I'm a 4th year Software Engineering student > > > >> at a uni where C is the primary language and Unix is the primary > > > >> platform) I'll have a go at fixing it. > > > >> > > > >> Interestingly, Roel said that they only get this problem every few > > > >> months with some of their customer sites.... > > > >> > > > >> Chris > > > >> > > > >> lou wrote: > > > >>> In some email I received from Jason Burfield <[EMAIL PROTECTED]> on > > > >>> 16 May 2003 14:54:16 > > > >>> -0400, wrote: > > > >>> > > > >>> Hi jason, > > > >>> did you try to attach a debugger to the process? > > > >>> in any way if you can get us dump image off the process that would be > > > >>> great. > > > >>> > > > >>> kill -ABRT <pid> > > > >>> should make the process exit with a core dump. > > > >>> > > > >>> anyway, might be an infinite loop or something? > > > >>> > > > >>> cheers. > > > >>> > > > >>>> It's dbmail-pop3d. > > > >>>> > > > >>>> Currently TOP is showing this: > > > >>>> > > > >>>> PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND > > > >>>> 21088 root 15 0 564 564 500 S 99.9 0.0 181:42 > > > >>>> dbmail-pop3d > > > >>>> > > > >>>> The database currently has all proper indexes...and it does not seem > > > >>>> to > > > >>>> be a MySQL issue at all. > > > >>>> > > > >>>> And, the load average continues to grow the longer that dbmail-pop3d > > > >>>> continues to run. > > > >>>> > > > >>>> It is currently at 11.57 > > > >>>> > > > >>>> This is the highest I personally have EVER seen one of my machines > > > >>>> go. > > > >>>> > > > >>>> Also, when I check the maillog it does not seem to be doing anything > > > >>>> out > > > >>>> of the ordinary. It almost looks like one child process has just run > > > >>>> off > > > >>>> and refuses to stop...but the log is not showing tons of stuff going > > > >>>> on. > > > >>>> > > > >>>> I'm totally freaked and confused now! > > > >>>> > > > >>>> Thanks! > > > >>>> > > > >>>> -- Jason > > > >>>> > > > >>>> On Fri, 2003-05-16 at 14:39, Aaron Stone wrote: > > > >>>>> When you run top, do you find that most of the CPU time is being > > > >>>>> taken by > > > >>>>> dbmail-pop3d itself, or by mysqld? In the latter case, you may do > > > >>>>> well to > > > >>>>> tune the database a bit in my.cnf, and to add the indices posted to > > > >>>>> the > > > >>>>> list a couple of times. If it's pop3d.. umm... sounds bad, post > > > >>>>> more info! > > > >>>>> > > > >>>>> Aaron > > > >>>>> > > > >>>>> On 16 May 2003, Jason Burfield wrote: > > > >>>>>> I have searched through back messages and found one thread with > > > >>>>>> someone > > > >>>>>> having huge load averages caused by dbmail-smtpd. > > > >>>>>> > > > >>>>>> Well...I'm getting HUGE load averages with dbmail-pop3d. > > > >>>>>> > > > >>>>>> For whatever reason the load average on my box is jumping up to > > > >>>>>> well > > > >>>>>> over 7.00 after dbmail-pop3d has been running for a few minutes. > > > >>>>>> > > > >>>>>> This is on a dual 2.4ghz Xeon with 4 gigs of RAM...so I can't for > > > >>>>>> the > > > >>>>>> life of me, imagine what would cause that type of load. > > > >>>>>> > > > >>>>>> This mail server went up last night...so in the past 24 hours it > > > >>>>>> has > > > >>>>>> accepted something around 2000 messages. We do have lots of clients > > > >>>>>> checking mail frequently, around every 5 minutes, however, I > > > >>>>>> wouldn't > > > >>>>>> think that would cause this type of load. > > > >>>>>> > > > >>>>>> I'm running dmail-1.1, MySQL and Postfix on Linux. (RH 8) > > > >>>>>> > > > >>>>>> As an example, at this very moment the box has a load average of > > > >>>>>> 5.62. > > > >>>>>> There are currently 6 clients connected to the pop server. > > > >>>>>> > > > >>>>>> Does anyone have any idea what may be causing this? We don't have > > > >>>>>> that > > > >>>>>> many clients, only about 300 email clients...what type of load > > > >>>>>> should be > > > >>>>>> expected from that type of base? > > > >>>>>> > > > >>>>>> Any help is greatly appreciated. > > > >>>>>> > > > >>>>>> Thanks! > > > >>>>>> > > > >>>>>> -- Jason > > > >>>>>> > > > >>>>>> -- Jason Burfield > > > >>>>>> -- [EMAIL PROTECTED] > > _______________________________________________ > Dbmail mailing list > Dbmail@dbmail.org > https://mailman.fastxs.nl/mailman/listinfo/dbmail > -- End Original Message -- -- Jesse Norell jesse (at) kci.net