>>>>> On Tue,  8 Nov 2005 21:41:12 -0300 (BRT), "=?ISO-8859-1?Q?T=E1ssia?= 
>>>>> =?ISO-8859-1?Q?Cam=F5es?= =?ISO-8859-1?Q?Ara=FAjo?=" <[EMAIL PROTECTED]> 
>>>>> said:

  Tássia> well, when I scan the ports of both servers I get the same answer:

  Tássia> The server where everything is ok:
  Tássia> 
-----------------------------------------------------------------------------------------------------------
  Tássia> maculele:~# nmap localhost -p 9101,9102,9103

  Tássia> Starting nmap 3.81 ( http://www.insecure.org/nmap/ ) at 2005-11-08 
21:38
  Tássia> BRST
  Tássia> Interesting ports on localhost (127.0.0.1):
  Tássia> PORT     STATE SERVICE
  Tássia> 9101/tcp open  unknown
  Tássia> 9102/tcp open  unknown
  Tássia> 9103/tcp open  unknown

  Tássia> Nmap finished: 1 IP address (1 host up) scanned in 0.122 seconds
  Tássia> maculele:~#
  Tássia> 
-----------------------------------------------------------------------------------------------------------

  Tássia> Where the problem is happening:

  Tássia> 
-----------------------------------------------------------------------------------------------------------
  Tássia> newton:~# nmap localhost -p 9101,9102,9103

  Tássia> Starting nmap 3.81 ( http://www.insecure.org/nmap/ ) at 2005-11-08 
21:43
  Tássia> BRT
  Tássia> Interesting ports on localhost (127.0.0.1):
  Tássia> PORT     STATE SERVICE
  Tássia> 9101/tcp open  unknown
  Tássia> 9102/tcp open  unknown
  Tássia> 9103/tcp open  unknown

  Tássia> Nmap finished: 1 IP address (1 host up) scanned in 0.116 seconds
  Tássia> newton:~#
  Tássia> 
-------------------------------------------------------------------------------------------------------------

  Tássia> Is there any other way to see if the server is listening for 
connection?

You can use

netstat -tanp | grep LISTEN

but since you already managed to connect with telnet, I think the port must be
open.

Try running the Director with -d100 or under strace -f to see what it is
doing.

__Martin



  Tássia> Em 8/11/2005, "Kern Sibbald" <[EMAIL PROTECTED]> escreveu:

  >> 
  >> My best guess is that nothing is listening on 192.168.0.8:9101
  >> 
  >> I leave it to the list to tell you how to debug that if you do not know ...
  >> 

  >> On Tuesday 08 November 2005 20:16, TássiaCamõesAraújo wrote:
  >>> Hi Kern!
  >>> 
  >>> I did what you advised me to do:
  >>> 
  >>> When it works:
  >>> 
---------------------------------------------------------------------------
  >>> ----------------------------- maculele:~# bconsole -d100
  >>> Connecting to Director maculele:9101
  >>> bconsole: bnet.c:698 Current host[ipv4:192.168.0.1:9101] All
  >>> host[ipv4:192.168.0.1:9101]
  >>> bconsole: bnet.c:952 who=Director daemon host=maculele port=9101
  >>> bconsole: btimers.c:169 Start bsock timer 0x8091e78 tid=0x402a2a60 for
  >>> 300 secs at 1131473741
  >>> bconsole: cram-md5.c:100 cram-get: auth cram-md5
  >>> <[EMAIL PROTECTED]> ssl=0
  >>> bconsole: cram-md5.c:120 sending resp to challenge: 3R+tF8ogF18Qig+Ij8/kOB
  >>> bconsole: cram-md5.c:52 send: auth cram-md5
  >>> <[EMAIL PROTECTED]> ssl=0
  >>> bconsole: cram-md5.c:70 Authenticate OK +8tU88pH1RE0K4+LU+hXVA
  >>> bconsole: authenticate.c:82 >dird: 1000 OK auth
  >>> bconsole: authenticate.c:89 <dird: 1000 OK: maculele-dir Version: 1.36.2
  >>> (28 February 2005)
  >>> bconsole: btimers.c:183 Stop bsock timer 0x8091e78 tid=0x402a2a60 at
  >>> 1131473741.
  >>> 1000 OK: maculele-dir Version: 1.36.2 (28 February 2005)
  >>> bconsole: console.c:452 Opened connection with Director daemon
  >>> Enter a period to cancel a command.
  >>> *
  >>> 
---------------------------------------------------------------------------
  >>> ------------------------------
  >>> 
  >>> when it doesn't work:
  >>> 
  >>> 
---------------------------------------------------------------------------
  >>> ------------------------------ [EMAIL PROTECTED]:~# bconsole -d100
  >>> Connecting to Director newton:9101
  >>> bconsole: bnet.c:698 Current host[ipv4:192.168.0.8:9101] All
  >>> host[ipv4:192.168.0.8:9101]
  >>> bconsole: bnet.c:952 who=Director daemon host=newton port=9101
  >>> bconsole: btimers.c:169 Start bsock timer 0x8091f10 tid=0x402a7a60 for
  >>> 300 secs at 1131477646
  >>> bconsole: btimers.c:183 Stop bsock timer 0x8091f10 tid=0x402a7a60 at
  >>> 1131477651.
  >>> Director authorization problem.
  >>> Most likely the passwords do not agree.
  >>> Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors
  >>> for help.
  >>> ERR=
  >>> 
---------------------------------------------------------------------------
  >>> ------------------------------------
  >>> 
  >>> I still couldn't figure out what is going on...
  >>> 
  >>> Tássia.
  >>> 
  >>> Em 7/11/2005, "Kern Sibbald" <[EMAIL PROTECTED]> escreveu:
  >>> >Set debug to 100 (-d100) on both ends.  The debug messages will give more
  >>> >details about what is going on ...
  >>> >
  >>> >On Monday 07 November 2005 21:20, TássiaCamõesAraújo wrote:
  >>> >> Hi all,
  >>> >>
  >>> >> About 10 days ago I wrote a message saying that I had some
  >>> >> "Authorization Errors" going on, but I didn't have any ideia of what
  >>> >> could be causing the problem.
  >>> >> At that time I didn't give information enough for someone to help me,
  >>> >> thanks Arno Lehmann for trying :)
  >>> >>
  >>> >> The scenario is that: I have a Bacula-director running at server
  >>> >> "maculele" and Bacula-fd runing at almost 10 different servers. The
  >>> >> problem only happens on server "newton". The File Daemon is refusing
  >>> >> the Bacula-Director connection, logging the following error:
  >>> >>
  >>> >> 
------------------------------------------------------------------------
  >>> >>- 05-Nov 21:00 maculele-dir: Start Backup JobId 603,
  >>> >> Job=Newton.2005-11-05_21.00.00
  >>> >> 05-Nov 21:00 maculele-dir: Newton.2005-11-05_21.00.00 Fatal error:
  >>> >> Unable to authenticate with File daemon. Possible causes:
  >>> >> Passwords or names not the same or
  >>> >> Maximum Concurrent Jobs exceeded on the FD or
  >>> >> FD networking messed up (restart daemon).
  >>> >> Please see
  >>> >> http://www.bacula.org/html-manual/faq.html#AuthorizationErrors for 
help.
  >>> >> 
------------------------------------------------------------------------
  >>> >>
  >>> >> Well, this time I tried to collect some more information.
  >>> >>
  >>> >> 1) I compared the versions of almost all packages running on my servers
  >>> >> and I didn't find anything different.
  >>> >>
  >>> >> 2) I installed the director on the server the problem was occurring, 
and
  >>> >> then I realized that the problem was related with any procedure of
  >>> >> connecting and authenticating with bacula services. I couldn't even run
  >>> >> the console, cause it couldn't connect to the director.
  >>> >>
  >>> >> 3) I tried to connect through telnet but it didn't work. In fact, all
  >>> >> bacula ports are open but as soon as the connection is established it 
is
  >>> >> reset by the remote host:
  >>> >>
  >>> >> -----------------------------------------------------------------
  >>> >> [EMAIL PROTECTED]:~$ telnet newton 9101
  >>> >> Trying 192.168.0.8...
  >>> >> Connected to newton.dcc.ufba.br.
  >>> >> Escape character is '^]'.
  >>> >> Connection closed by foreign host.
  >>> >> [EMAIL PROTECTED]:~$
  >>> >> ------------------------------------------------------------------
  >>> >>
  >>> >> But it is different from trying to connect on some closed port:
  >>> >>
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- ---- [EMAIL PROTECTED]:~$ telnet newton 9100
  >>> >> Trying 192.168.0.8...
  >>> >> telnet: Unable to connect to remote host: Connection refused
  >>> >> [EMAIL PROTECTED]:~$
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- ------
  >>> >>
  >>> >> 4) Last attempt: I tried to run strace to monitor the system calls at
  >>> >> both systems and try to  figure out the differences.
  >>> >>
  >>> >> I run the bconsole program on maculele (this server works fine), just 
to
  >>> >> see the console trying to connect to the director, and got the 
following
  >>> >> result (I'm showing only the final lines):
  >>> >>
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- --------------------------------------- 
clone(child_stack=0x40ab1b48,
  >>> >> 
flags=CLONE_VM|CLONE_FS|CLONE_FILES|CLONE_SIGHAND|CLONE_THREAD|CLONE_SYS
  >>> >>VSE
  >>> >> M|CLONE_SETTLS|CLONE_PARENT_SETTID|CLONE_CHILD_CLEARTID|CLONE_DETACHED,
  >>> >> parent_tidptr=0x40ab1bf8, {entry_number:6, base_addr:0x40ab1bb0,
  >>> >> limit:1048575, seg_32bit:1, contents:0, read_exec_only:0,
  >>> >> limit_in_pages:1, seg_not_present:0, useable:1},
  >>> >> child_tidptr=0x40ab1bf8) = 12726
  >>> >> futex(0x806bb50, FUTEX_WAKE, 1)         = 1
  >>> >> time(NULL)                              = 1131378562
  >>> >> write(3, "\0\0\0\32", 4)                = 4
  >>> >> write(3, "Hello *UserAgent* calling\n", 26) = 26
  >>> >> read(3, "\0\0\0009", 4)                 = 4
  >>> >> read(3, "auth cram-md5 <1703362652.113137"..., 57) = 57
  >>> >> write(3, "\0\0\0\27", 4)                = 4
  >>> >> write(3, "P4+EI119mW+PsS85V//BuC\0", 23) = 23
  >>> >> select(4, [3], NULL, NULL, {180, 0})    = 1 (in [3], left {179, 
960000})
  >>> >> read(3, "\0\0\0\r", 4)                  = 4
  >>> >> read(3, "1000 OK auth\n", 13)           = 13
  >>> >> gettimeofday({1131378562, 978543}, {120, 0}) = 0
  >>> >> gettimeofday({1131378562, 978705}, {120, 0}) = 0
  >>> >> gettimeofday({1131378562, 978863}, {120, 0}) = 0
  >>> >> gettimeofday({1131378562, 979020}, {120, 0}) = 0
  >>> >> gettimeofday({1131378562, 979156}, {120, 0}) = 0
  >>> >> uname({sys="Linux", node="maculele", ...}) = 0
  >>> >> time(NULL)                              = 1131378562
  >>> >> write(3, "\0\0\0005", 4)                = 4
  >>> >> write(3, "auth cram-md5 <1032677570.113137"..., 53) = 53
  >>> >> select(4, [3], NULL, NULL, {180, 0})    = 1 (in [3], left {179, 
960000})
  >>> >> read(3, "\0\0\0\27", 4)                 = 4
  >>> >> read(3, "/61y86MgOH474FsNak+t2D\0", 23) = 23
  >>> >> write(3, "\0\0\0\r", 4)                 = 4
  >>> >> write(3, "1000 OK auth\n", 13)          = 13
  >>> >> read(3, "\0\0\0009", 4)                 = 4
  >>> >> read(3, "1000 OK: maculele-dir Version: 1"..., 57) = 57
  >>> >> time(NULL)                              = 1131378563
  >>> >> futex(0x806bb50, FUTEX_WAKE, 1)         = 1
  >>> >> futex(0x806bb40, FUTEX_WAKE, 1)         = 1
  >>> >> futex(0x808ec84, FUTEX_WAKE, 1)         = 1
  >>> >> write(1, "1000 OK: maculele-dir Version: 1"..., 571000 OK: maculele-dir
  >>> >> Version: 1.36.2 (28 February 2005)
  >>> >> ) = 57
  >>> >> write(1, "Enter a period to cancel a comma"..., 36Enter a period to
  >>> >> cancel a command.
  >>> >> ) = 36
  >>> >> open("/root/.bconsolerc", O_RDONLY|O_LARGEFILE) = -1 ENOENT (No such
  >>> >> file or directory)
  >>> >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig -icanon 
-echo
  >>> >> ...}) = 0
  >>> >> write(1, "*", 1*)                        = 1
  >>> >> select(1, [0], NULL, NULL, {30, 0}
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- -------------------------------------
  >>> >>
  >>> >> Then I did the same with newton (the server with the problem):
  >>> >>
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- ------------------------------------- clone(child_stack=0x40abbb48,
  >>> >> 
flags=CLONE_VM|CLONE_FS|CLONE_FILES|CLONE_SIGHAND|CLONE_THREAD|CLONE_SYS
  >>> >>VSE
  >>> >> M|CLONE_SETTLS|CLONE_PARENT_SETTID|CLONE_CHILD_CLEARTID|CLONE_DETACHED,
  >>> >> parent_tidptr=0x40abbbf8, {entry_number:6, base_addr:0x40abbbb0,
  >>> >> limit:1048575, seg_32bit:1, contents:0, read_exec_only:0,
  >>> >> limit_in_pages:1, seg_not_present:0, useable:1},
  >>> >> child_tidptr=0x40abbbf8) = 12967
  >>> >> futex(0x806bb50, FUTEX_WAKE, 1)         = 1
  >>> >> time(NULL)                              = 1131383077
  >>> >> write(3, "\0\0\0\32", 4)                = 4
  >>> >> write(3, "Hello *UserAgent* calling\n", 26) = -1 EPIPE (Broken pipe)
  >>> >> --- SIGPIPE (Broken pipe) @ 0 (0) ---
  >>> >> time(NULL)                              = 1131383077
  >>> >> open("/etc/localtime", O_RDONLY)        = 4
  >>> >> fstat64(4, {st_mode=S_IFREG|0644, st_size=286, ...}) = 0
  >>> >> mmap2(NULL, 131072, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, 
-1,
  >>> >> 0) = 0x40abc000
  >>> >> read(4,
  >>> >> "TZif\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\3\0\0\0\3\0"...,
  >>> >> 131072) = 286
  >>> >> close(4)                                = 0
  >>> >> munmap(0x40abc000, 131072)              = 0
  >>> >> time([1131383077])                      = 1131383077
  >>> >> rt_sigaction(SIGPIPE, {0x40232a70, [], 0}, {SIG_IGN}, 8) = 0
  >>> >> socket(PF_FILE, SOCK_DGRAM, 0)          = 4
  >>> >> fcntl64(4, F_SETFD, FD_CLOEXEC)         = 0
  >>> >> connect(4, {sa_family=AF_FILE, path="/dev/log"}, 16) = 0
  >>> >> send(4, "<27>Nov  7 14:04:37 bacula-conso"..., 142, 0) = 142
  >>> >> rt_sigaction(SIGPIPE, {SIG_IGN}, NULL, 8) = 0
  >>> >> write(1, "07-Nov 14:04 bconsole:  Error: b"..., 11907-Nov 14:04
  >>> >> bconsole:  Error: bnet.c:406 Write error sending 26 bytes to Director
  >>> >> daemon:newton:9101: ERR=Broken pipe
  >>> >> ) = 119
  >>> >> nanosleep({5, 0}, NULL)                 = 0
  >>> >> time(NULL)                              = 1131383082
  >>> >> futex(0x806bb50, FUTEX_WAKE, 1)         = 1
  >>> >> futex(0x806bb40, FUTEX_WAKE, 1)         = 1
  >>> >> futex(0x808ec84, FUTEX_WAKE, 1)         = 1
  >>> >> write(1, "Director authorization problem.\n"..., 156Director
  >>> >> authorization problem.
  >>> >> Most likely the passwords do not agree.
  >>> >> Please see
  >>> >> http://www.bacula.org/html-manual/faq.html#AuthorizationErrors for 
help.
  >>> >> ) = 156
  >>> >> write(2, "ERR=", 4ERR=)                     = 4
  >>> >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig -icanon 
-echo
  >>> >> ...}) = 0 ioctl(0, SNDCTL_TMR_START or TCSETS, {B38400 opost isig 
icanon
  >>> >> echo ...}) = 0
  >>> >> ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo
  >>> >> ...}) = 0
  >>> >> munmap(0x402a8000, 4096)                = 0
  >>> >> exit_group(1)
  >>> >> 
------------------------------------------------------------------------
  >>> >>--- -------------------------------------
  >>> >>
  >>> >> I think the problem is here:
  >>> >> When works: "write(3, "Hello *UserAgent* calling\n", 26) = 26"
  >>> >> When doen't work: "write(3, "Hello *UserAgent* calling\n", 26) = -1
  >>> >> EPIPE (Broken pipe)"
  >>> >>
  >>> >> And when the problem occurs, the password information isn't even sent!
  >>> >> I think the authentication happens at this line (in the first example):
  >>> >> "read(3, "auth cram-md5 <1703362652.113137"..., 57) = 57"
  >>> >> I can't see any thing like that on the second example.
  >>> >>
  >>> >> There is any log that would be useful to look after the cause of this
  >>> >> problem?
  >>> >> Is it any problem with libraries that I don't even know the names?
  >>> >> Please, help me....
  >>> >>
  >>> >> Tássia.
  >>> >>
  >>> >>
  >>> >> -------------------------------------------------------
  >>> >> SF.Net email is sponsored by:
  >>> >> Tame your development challenges with Apache's Geronimo App Server.
  >>> >> Download it for free - -and be entered to win a 42" plasma tv or your
  >>> >> very own Sony(tm)PSP.  Click here to play:
  >>> >> http://sourceforge.net/geronimo.php
  >>> >> _______________________________________________
  >>> >> Bacula-users mailing list
  >>> >> Bacula-users@lists.sourceforge.net
  >>> >> https://lists.sourceforge.net/lists/listinfo/bacula-users
  >>> >
  >>> >--
  >>> >Best regards,
  >>> >
  >>> >Kern
  >>> >
  >>> >  (">
  >>> >  /\
  >>> >  V_V
  >>> >
  >>> >
  >>> >-------------------------------------------------------
  >>> >SF.Net email is sponsored by:
  >>> >Tame your development challenges with Apache's Geronimo App Server.
  >>> > Download it for free - -and be entered to win a 42" plasma tv or your
  >>> > very own Sony(tm)PSP.  Click here to play:
  >>> > http://sourceforge.net/geronimo.php
  >>> > _______________________________________________
  >>> >Bacula-users mailing list
  >>> >Bacula-users@lists.sourceforge.net
  >>> >https://lists.sourceforge.net/lists/listinfo/bacula-users
  >>> 
  >>> -------------------------------------------------------
  >>> SF.Net email is sponsored by:
  >>> Tame your development challenges with Apache's Geronimo App Server.
  >>> Download it for free - -and be entered to win a 42" plasma tv or your very
  >>> own Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
  >>> _______________________________________________
  >>> Bacula-users mailing list
  >>> Bacula-users@lists.sourceforge.net
  >>> https://lists.sourceforge.net/lists/listinfo/bacula-users
  >> 
  >> -- 
  >> Best regards,
  >> 
  >> Kern
  >> 
  >> (">
  >> /\
  >> V_V
  >> 
  >> 
  >> -------------------------------------------------------
  >> SF.Net email is sponsored by:
  >> Tame your development challenges with Apache's Geronimo App Server. 
Download
  >> it for free - -and be entered to win a 42" plasma tv or your very own
  >> Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
  >> _______________________________________________
  >> Bacula-users mailing list
  >> Bacula-users@lists.sourceforge.net
  >> https://lists.sourceforge.net/lists/listinfo/bacula-users



-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. Download
it for free - -and be entered to win a 42" plasma tv or your very own
Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to