Hi Kern! I did what you advised me to do:
When it works: -------------------------------------------------------------------------------------------------------- maculele:~# bconsole -d100 Connecting to Director maculele:9101 bconsole: bnet.c:698 Current host[ipv4:192.168.0.1:9101] All host[ipv4:192.168.0.1:9101] bconsole: bnet.c:952 who=Director daemon host=maculele port=9101 bconsole: btimers.c:169 Start bsock timer 0x8091e78 tid=0x402a2a60 for 300 secs at 1131473741 bconsole: cram-md5.c:100 cram-get: auth cram-md5 <[EMAIL PROTECTED]> ssl=0 bconsole: cram-md5.c:120 sending resp to challenge: 3R+tF8ogF18Qig+Ij8/kOB bconsole: cram-md5.c:52 send: auth cram-md5 <[EMAIL PROTECTED]> ssl=0 bconsole: cram-md5.c:70 Authenticate OK +8tU88pH1RE0K4+LU+hXVA bconsole: authenticate.c:82 >dird: 1000 OK auth bconsole: authenticate.c:89 <dird: 1000 OK: maculele-dir Version: 1.36.2 (28 February 2005) bconsole: btimers.c:183 Stop bsock timer 0x8091e78 tid=0x402a2a60 at 1131473741. 1000 OK: maculele-dir Version: 1.36.2 (28 February 2005) bconsole: console.c:452 Opened connection with Director daemon Enter a period to cancel a command. * --------------------------------------------------------------------------------------------------------- when it doesn't work: --------------------------------------------------------------------------------------------------------- [EMAIL PROTECTED]:~# bconsole -d100 Connecting to Director newton:9101 bconsole: bnet.c:698 Current host[ipv4:192.168.0.8:9101] All host[ipv4:192.168.0.8:9101] bconsole: bnet.c:952 who=Director daemon host=newton port=9101 bconsole: btimers.c:169 Start bsock timer 0x8091f10 tid=0x402a7a60 for 300 secs at 1131477646 bconsole: btimers.c:183 Stop bsock timer 0x8091f10 tid=0x402a7a60 at 1131477651. Director authorization problem. Most likely the passwords do not agree. Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors for help. ERR= --------------------------------------------------------------------------------------------------------------- I still couldn't figure out what is going on... Tássia. Em 7/11/2005, "Kern Sibbald" <[EMAIL PROTECTED]> escreveu: >Set debug to 100 (-d100) on both ends. The debug messages will give more >details about what is going on ... > >On Monday 07 November 2005 21:20, TássiaCamõesAraújo wrote: >> Hi all, >> >> About 10 days ago I wrote a message saying that I had some >> "Authorization Errors" going on, but I didn't have any ideia of what >> could be causing the problem. >> At that time I didn't give information enough for someone to help me, >> thanks Arno Lehmann for trying :) >> >> The scenario is that: I have a Bacula-director running at server >> "maculele" and Bacula-fd runing at almost 10 different servers. The >> problem only happens on server "newton". The File Daemon is refusing >> the Bacula-Director connection, logging the following error: >> >> ------------------------------------------------------------------------- >> 05-Nov 21:00 maculele-dir: Start Backup JobId 603, >> Job=Newton.2005-11-05_21.00.00 >> 05-Nov 21:00 maculele-dir: Newton.2005-11-05_21.00.00 Fatal error: Unable >> to authenticate with File daemon. Possible causes: >> Passwords or names not the same or >> Maximum Concurrent Jobs exceeded on the FD or >> FD networking messed up (restart daemon). >> Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors >> for help. >> ------------------------------------------------------------------------ >> >> Well, this time I tried to collect some more information. >> >> 1) I compared the versions of almost all packages running on my servers >> and I didn't find anything different. >> >> 2) I installed the director on the server the problem was occurring, and >> then I realized that the problem was related with any procedure of >> connecting and authenticating with bacula services. I couldn't even run >> the console, cause it couldn't connect to the director. >> >> 3) I tried to connect through telnet but it didn't work. In fact, all >> bacula ports are open but as soon as the connection is established it is >> reset by the remote host: >> >> ----------------------------------------------------------------- >> [EMAIL PROTECTED]:~$ telnet newton 9101 >> Trying 192.168.0.8... >> Connected to newton.dcc.ufba.br. >> Escape character is '^]'. >> Connection closed by foreign host. >> [EMAIL PROTECTED]:~$ >> ------------------------------------------------------------------ >> >> But it is different from trying to connect on some closed port: >> >> --------------------------------------------------------------------------- >>---- [EMAIL PROTECTED]:~$ telnet newton 9100 >> Trying 192.168.0.8... >> telnet: Unable to connect to remote host: Connection refused >> [EMAIL PROTECTED]:~$ >> --------------------------------------------------------------------------- >>------ >> >> 4) Last attempt: I tried to run strace to monitor the system calls at >> both systems and try to figure out the differences. >> >> I run the bconsole program on maculele (this server works fine), just to >> see the console trying to connect to the director, and got the following >> result (I'm showing only the final lines): >> >> --------------------------------------------------------------------------- >>--------------------------------------- clone(child_stack=0x40ab1b48, >> flags=CLONE_VM|CLONE_FS|CLONE_FILES|CLONE_SIGHAND|CLONE_THREAD|CLONE_SYSVSE >>M|CLONE_SETTLS|CLONE_PARENT_SETTID|CLONE_CHILD_CLEARTID|CLONE_DETACHED, >> parent_tidptr=0x40ab1bf8, {entry_number:6, base_addr:0x40ab1bb0, >> limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, >> limit_in_pages:1, seg_not_present:0, useable:1}, >> child_tidptr=0x40ab1bf8) = 12726 >> futex(0x806bb50, FUTEX_WAKE, 1) = 1 >> time(NULL) = 1131378562 >> write(3, "\0\0\0\32", 4) = 4 >> write(3, "Hello *UserAgent* calling\n", 26) = 26 >> read(3, "\0\0\0009", 4) = 4 >> read(3, "auth cram-md5 <1703362652.113137"..., 57) = 57 >> write(3, "\0\0\0\27", 4) = 4 >> write(3, "P4+EI119mW+PsS85V//BuC\0", 23) = 23 >> select(4, [3], NULL, NULL, {180, 0}) = 1 (in [3], left {179, 960000}) >> read(3, "\0\0\0\r", 4) = 4 >> read(3, "1000 OK auth\n", 13) = 13 >> gettimeofday({1131378562, 978543}, {120, 0}) = 0 >> gettimeofday({1131378562, 978705}, {120, 0}) = 0 >> gettimeofday({1131378562, 978863}, {120, 0}) = 0 >> gettimeofday({1131378562, 979020}, {120, 0}) = 0 >> gettimeofday({1131378562, 979156}, {120, 0}) = 0 >> uname({sys="Linux", node="maculele", ...}) = 0 >> time(NULL) = 1131378562 >> write(3, "\0\0\0005", 4) = 4 >> write(3, "auth cram-md5 <1032677570.113137"..., 53) = 53 >> select(4, [3], NULL, NULL, {180, 0}) = 1 (in [3], left {179, 960000}) >> read(3, "\0\0\0\27", 4) = 4 >> read(3, "/61y86MgOH474FsNak+t2D\0", 23) = 23 >> write(3, "\0\0\0\r", 4) = 4 >> write(3, "1000 OK auth\n", 13) = 13 >> read(3, "\0\0\0009", 4) = 4 >> read(3, "1000 OK: maculele-dir Version: 1"..., 57) = 57 >> time(NULL) = 1131378563 >> futex(0x806bb50, FUTEX_WAKE, 1) = 1 >> futex(0x806bb40, FUTEX_WAKE, 1) = 1 >> futex(0x808ec84, FUTEX_WAKE, 1) = 1 >> write(1, "1000 OK: maculele-dir Version: 1"..., 571000 OK: maculele-dir >> Version: 1.36.2 (28 February 2005) >> ) = 57 >> write(1, "Enter a period to cancel a comma"..., 36Enter a period to >> cancel a command. >> ) = 36 >> open("/root/.bconsolerc", O_RDONLY|O_LARGEFILE) = -1 ENOENT (No such >> file or directory) >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig -icanon -echo >> ...}) = 0 >> write(1, "*", 1*) = 1 >> select(1, [0], NULL, NULL, {30, 0} >> --------------------------------------------------------------------------- >>------------------------------------- >> >> Then I did the same with newton (the server with the problem): >> >> --------------------------------------------------------------------------- >>------------------------------------- clone(child_stack=0x40abbb48, >> flags=CLONE_VM|CLONE_FS|CLONE_FILES|CLONE_SIGHAND|CLONE_THREAD|CLONE_SYSVSE >>M|CLONE_SETTLS|CLONE_PARENT_SETTID|CLONE_CHILD_CLEARTID|CLONE_DETACHED, >> parent_tidptr=0x40abbbf8, {entry_number:6, base_addr:0x40abbbb0, >> limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, >> limit_in_pages:1, seg_not_present:0, useable:1}, >> child_tidptr=0x40abbbf8) = 12967 >> futex(0x806bb50, FUTEX_WAKE, 1) = 1 >> time(NULL) = 1131383077 >> write(3, "\0\0\0\32", 4) = 4 >> write(3, "Hello *UserAgent* calling\n", 26) = -1 EPIPE (Broken pipe) >> --- SIGPIPE (Broken pipe) @ 0 (0) --- >> time(NULL) = 1131383077 >> open("/etc/localtime", O_RDONLY) = 4 >> fstat64(4, {st_mode=S_IFREG|0644, st_size=286, ...}) = 0 >> mmap2(NULL, 131072, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, >> 0) = 0x40abc000 >> read(4, >> "TZif\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\3\0\0\0\3\0"..., >> 131072) = 286 >> close(4) = 0 >> munmap(0x40abc000, 131072) = 0 >> time([1131383077]) = 1131383077 >> rt_sigaction(SIGPIPE, {0x40232a70, [], 0}, {SIG_IGN}, 8) = 0 >> socket(PF_FILE, SOCK_DGRAM, 0) = 4 >> fcntl64(4, F_SETFD, FD_CLOEXEC) = 0 >> connect(4, {sa_family=AF_FILE, path="/dev/log"}, 16) = 0 >> send(4, "<27>Nov 7 14:04:37 bacula-conso"..., 142, 0) = 142 >> rt_sigaction(SIGPIPE, {SIG_IGN}, NULL, 8) = 0 >> write(1, "07-Nov 14:04 bconsole: Error: b"..., 11907-Nov 14:04 >> bconsole: Error: bnet.c:406 Write error sending 26 bytes to Director >> daemon:newton:9101: ERR=Broken pipe >> ) = 119 >> nanosleep({5, 0}, NULL) = 0 >> time(NULL) = 1131383082 >> futex(0x806bb50, FUTEX_WAKE, 1) = 1 >> futex(0x806bb40, FUTEX_WAKE, 1) = 1 >> futex(0x808ec84, FUTEX_WAKE, 1) = 1 >> write(1, "Director authorization problem.\n"..., 156Director >> authorization problem. >> Most likely the passwords do not agree. >> Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors >> for help. >> ) = 156 >> write(2, "ERR=", 4ERR=) = 4 >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig -icanon -echo >> ...}) = 0 ioctl(0, SNDCTL_TMR_START or TCSETS, {B38400 opost isig icanon >> echo ...}) = 0 >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo >> ...}) = 0 >> munmap(0x402a8000, 4096) = 0 >> exit_group(1) >> --------------------------------------------------------------------------- >>------------------------------------- >> >> I think the problem is here: >> When works: "write(3, "Hello *UserAgent* calling\n", 26) = 26" >> When doen't work: "write(3, "Hello *UserAgent* calling\n", 26) = -1 >> EPIPE (Broken pipe)" >> >> And when the problem occurs, the password information isn't even sent! >> I think the authentication happens at this line (in the first example): >> "read(3, "auth cram-md5 <1703362652.113137"..., 57) = 57" >> I can't see any thing like that on the second example. >> >> There is any log that would be useful to look after the cause of this >> problem? >> Is it any problem with libraries that I don't even know the names? >> Please, help me.... >> >> Tássia. >> >> >> ------------------------------------------------------- >> SF.Net email is sponsored by: >> Tame your development challenges with Apache's Geronimo App Server. >> Download it for free - -and be entered to win a 42" plasma tv or your very >> own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php >> _______________________________________________ >> Bacula-users mailing list >> Bacula-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/bacula-users > >-- >Best regards, > >Kern > > ("> > /\ > V_V > > >------------------------------------------------------- >SF.Net email is sponsored by: >Tame your development challenges with Apache's Geronimo App Server. Download >it for free - -and be entered to win a 42" plasma tv or your very own >Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php >_______________________________________________ >Bacula-users mailing list >Bacula-users@lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/bacula-users ------------------------------------------------------- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users