date:20080812

Re: sysinstall compilation issue

2008-08-12 Thread Unga

--- On Fri, 8/8/08, Oliver Fromme <[EMAIL PROTECTED]> wrote:

> From: Oliver Fromme <[EMAIL PROTECTED]>
> Subject: Re: sysinstall compilation issue
> To: freebsd-stable@FreeBSD.ORG, [EMAIL PROTECTED]
> Date: Friday, August 8, 2008, 9:36 PM
> Unga <[EMAIL PROTECTED]> wrote:
>  > This is i386 RELENG_7.
>  > 
>  > Following section of the
> /usr/src/usr.sbin/sysinstall/Makefile does not generate C
> code properly:
>  > 
>  > makedevs.c: Makefile rtermcap
>  > echo '#include '
> > makedevs.c
>  > ${RTERMCAP} ansi | \
>  > file2c 'const char termcap_ansi[]
> = {' ',0};' \
>  > > > makedevs.c
>  > 
>  > At compile time, above expands to:
>  >
> TERMCAP=/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src
> ./rtermcap ansi |  file2c 'const char termcap_ansi[] =
> {' ',0};'  >> makedevs.c
>  > 
>  > Which generates C code as follows:
>  > const char termcap_ansi[] = {
>  > 
>  > ,0};
>  > 
>  > That is, it generates 3 lines, which results a
> compilation error.
>  > 
>  > I presume, intended generated code should be:
>  > const char termcap_ansi[] = {' ',0};
>  > 
>  > Am I right?
> 
> No, it should generate an array containung a dump of
> the "ansi" termcap entry.  Please see the
> file2c(1)
> manpage.
> 
>  > What is wrong at my end?
>  > 
>  > Note, I linked the rtermcap with ncursesw libraries,
> can that be the cause? Any ideas?
> 
> Yes, that's the cause.  Why did you do that?
> 
> FreeBSD's ncurses port contains a patch so the
> tgetent()
> function (which is used by rtermcap) returns the actual
> termcap entry in the buffer.  The original ncurses code
> (which is terminfo based) doesn't do that.
> 
> When you linked rtermcap with the wrong library, you
> restored the original behaviour of tgetent(), so the
> output of rtermcap was empty, so file2c produced invalid
> source.
> 

Sorry for my late reply on this. I was not well during weekend, an eye ache due 
to over work :(

Here is the situation at my end, no matter whether I link with ncurses widec 
libs or non-widec libs, its still return the same code as above I mentioned. 

Possible causes can be:
1. The way I compile and install ncurses is not correct.
2. OR some essential files required are missing
3. OR there can be an other option

I used truss as follows:
TERMCAP=/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src truss -o 
truss.log -f ./rtermcap ansi |  file2c 'const char termcap_ansi[] = {' ',0};'  
>> makedevs.c

The last few lines of truss.log shows:
9310: stat("/usr/share/misc/terminfo.db",{mode=-rw-r--r-- ,inode=22613331,size=
3170304,blksize=4096}) = 0 (0x0)
 9310: open("/usr/share/misc/terminfo.db",O_RDONLY,0644) = 4 (0x4)
 9310: fcntl(4,F_SETFD,FD_CLOEXEC)   = 0 (0x0)
 9310: read(4,"\0\^F\^Ua\0\0\0\^B\0\0\^D\M-R\0"...,260) = 260 (0x104)
 9310: __sysctl(0xbfbfd574,0x2,0xbfbfd570,0xbfbfd57c,0x0,0x0) = 0 (0x0)
 9310: lseek(4,0x132000,SEEK_SET)= 1253376 (0x132000)
 9310: read(4,"\^^\0\M-|\^O\M-T\^O\M-K\^O\M-*"...,4096) = 4096 (0x1000)
 9310: lseek(4,0x26b000,SEEK_SET)= 2535424 (0x26b000)
 9310: read(4,"\n\0\M-Y\^O\^O\n\0\n\r\^D\^E\^D"...,4096) = 4096 (0x1000)
 9310: close(4)  = 0 (0x0)
 9310: ioctl(2,TIOCGETA,0xbfbfdba4)  = 0 (0x0)
 9310: ioctl(2,TIOCGETA,0x81020d8)   = 0 (0x0)
 9310: ioctl(2,TIOCGETA,0xbfbfdb64)  = 0 (0x0)
 9310: ioctl(2,TIOCGWINSZ,0xbfbfdbc4)= 0 (0x0)
 9310: fstat(1,{mode=p- ,inode=0,size=0,blksize=4096}) = 0 (0x0)
 9310: process exit, rval = 0

Note, above log has no entry to open 
/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src.

So what can be the issue, ncurses has not been patched correctly or some 
essential files missing? If you guys think that ncurses has not been patched 
correctly, then I'll open another thread to discuss that.

Best Regards
Unga




  
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

Hi,

Since I added IPv6 to my network, and started really using it, I'm seeing
some strange things happening.

For instance, I'm on machine 2a01:678:1:443::443, and I do :

$ traceroute6 -n 2a01:678:100:2::
traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
2a01:678:1:443::443, 64 hops max, 12 byte packets
 1  2a01:678:1:443::  0.636 ms  0.602 ms  0.525 ms
 2  2a01:678:1:443::  2999.665 ms !A  2999.636 ms !A  2999.680 ms !A

2a01:678:1:443:: is it's default gateway, and is also directly connected to
2a01:678:100:2::, but it does not seem to be able to contact it.

If I log onto the gateway, and I :

$ ping6 -c 1 2a01:678:100:2::
PING6(56=40+8+8 bytes) 2a01:678:100:: --> 2a01:678:100:2::
16 bytes from 2a01:678:100:2::, icmp_seq=0 hlim=64 time=1.146 ms

--- 2a01:678:100:2:: ping6 statistics ---
1 packets transmitted, 1 packets received, 0.0% packet loss
round-trip min/avg/max/std-dev = 1.146/1.146/1.146/0.000 ms

It works, and now, I can :
$ traceroute6 -n 2a01:678:100:2::
traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
2a01:678:1:443::443, 64 hops max, 12 byte packets
 1  2a01:678:1:443::  0.647 ms  0.671 ms  0.417 ms
 2  2a01:678:100:2::  0.852 ms  0.790 ms  0.669 ms

Maybe I'm doing something wrong, but, well, I can't seem to find ou what.

2a01:678:1:443::443 is a 7.0
2a01:678:1:443::is a 6.2
2a01:678:100:2::is a 6.0

Those are not up to date to the latest thing you can get, but they're
production machines, and I'm not really willing to upgrade them unless I
really need to :-)

-- 
Mathieu Arnold

pgp4DzemEQu4t.pgp
Description: PGP signature

Re: neighbor discovery problem

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 09:45:48AM +0200, Mathieu Arnold wrote:
> Since I added IPv6 to my network, and started really using it, I'm seeing
> some strange things happening.
> 
> For instance, I'm on machine 2a01:678:1:443::443, and I do :
> 
> $ traceroute6 -n 2a01:678:100:2::
> traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
> 2a01:678:1:443::443, 64 hops max, 12 byte packets
>  1  2a01:678:1:443::  0.636 ms  0.602 ms  0.525 ms
>  2  2a01:678:1:443::  2999.665 ms !A  2999.636 ms !A  2999.680 ms !A
> 
> 2a01:678:1:443:: is it's default gateway, and is also directly connected to
> 2a01:678:100:2::, but it does not seem to be able to contact it.
> 
> If I log onto the gateway, and I :
> 
> $ ping6 -c 1 2a01:678:100:2::
> PING6(56=40+8+8 bytes) 2a01:678:100:: --> 2a01:678:100:2::
> 16 bytes from 2a01:678:100:2::, icmp_seq=0 hlim=64 time=1.146 ms
> 
> --- 2a01:678:100:2:: ping6 statistics ---
> 1 packets transmitted, 1 packets received, 0.0% packet loss
> round-trip min/avg/max/std-dev = 1.146/1.146/1.146/0.000 ms
> 
> It works, and now, I can :
> $ traceroute6 -n 2a01:678:100:2::
> traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
> 2a01:678:1:443::443, 64 hops max, 12 byte packets
>  1  2a01:678:1:443::  0.647 ms  0.671 ms  0.417 ms
>  2  2a01:678:100:2::  0.852 ms  0.790 ms  0.669 ms
> 
> Maybe I'm doing something wrong, but, well, I can't seem to find ou what.
> 
> 2a01:678:1:443::443 is a 7.0
> 2a01:678:1:443::is a 6.2
> 2a01:678:100:2::is a 6.0
> 
> Those are not up to date to the latest thing you can get, but they're
> production machines, and I'm not really willing to upgrade them unless I
> really need to :-)

Important note: I know absolutely nothing about IPv6.

Do you have ACLs on any of these machines?  !A in traceroute commonly
means there's an ACL blocking said packets:

!A  (communication with destination network administratively prohibited)

A ping from the other host might cause a stateful firewall to begin
allowing said traffic to/from the machine which previously wasn't
working.

If you use a firewall on these machines (ipfw, pf, etc.), I'd recommend
posting your problem to the freebsd-pf list instead.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: cpufreq(4) panic on RELENG_7 (was: Re: Call for bfe(4) testers.)

2008-08-12 Thread pluknet

2008/8/11 John Baldwin <[EMAIL PROTECTED]>:
> On Monday 11 August 2008 12:35:17 pm pluknet wrote:
>> 2008/8/11 John Baldwin <[EMAIL PROTECTED]>:
>> > On Saturday 09 August 2008 07:16:37 am Ulrich Spoerlein wrote:
>> >> Hi John,
>> >>
>> >> I now figured out the "who", the "why" still eludes me.
>> >>
>> >> So, after your MFC of ichss.c on June 27th the device now attaches at my
>> >> laptop. It didn't before, so it could cause no trouble.
>> >>
>> >> With ichss loaded, the kernel will panic 1-3 minutes after powerd has
>> >> been started (if I kill powerd early enough, it seems pretty stable).
>> >>
>> >> I'm now running a kernel from 2008-08-08 with
>> >> hint.ichss.0.disabled="1"
>> >
>> > Ok.  Can you get a crashdump from a crash?
>> >
>>
>> ehm,. I am not Ulrich Spoerlein, but I can help with this issue.
>>
>> my crashdump from kgdb and some debug info.
>> (ouch, I forgot to include it in my prev. mail
>> http://lists.freebsd.org/pipermail/freebsd-stable/2008-August/044182.html )
>>
>> wbr,
>> pluknet
>>
>> Unread portion of the kernel message buffer:
>>
>>
>> Fatal trap 12: page fault while in kernel mode
>> fault virtual address   = 0x38
>> fault code  = supervisor read, page not present
>> instruction pointer = 0x20:0xc056cf46
>> stack pointer   = 0x28:0xe6592ac8
>> frame pointer   = 0x28:0xe6592ac8
>> code segment= base 0x0, limit 0xf, type 0x1b
>> = DPL 0, pres 1, def32 1, gran 1
>> processor eflags= interrupt enabled, resume, IOPL = 0
>> current process = 2507 (powerd)
>> Physical memory: 1014 MB
>> Dumping 120 MB: 105 89 73 57 41 25 9
>>
>> #0  doadump () at pcpu.h:195
>> 195 pcpu.h: No such file or directory.
>> in pcpu.h
>> (kgdb) bt
>> #0  doadump () at pcpu.h:195
>> #1  0xc0458f89 in db_fncall (dummy1=-1010027648, dummy2=0, dummy3=0,
>> dummy4=0xe6592860 "0╛йц") at /media/src-7/sys/ddb/db_command.c:516
>> #2  0xc045953a in db_command (last_cmdp=0xc07dcf14, cmd_table=0x0,
> dopager=1)
>> at /media/src-7/sys/ddb/db_command.c:413
>> #3  0xc0459655 in db_command_loop ()
> at /media/src-7/sys/ddb/db_command.c:466
>> #4  0xc045b17c in db_trap (type=12, code=0)
>> at /media/src-7/sys/ddb/db_main.c:228
>> #5  0xc0575023 in kdb_trap (type=12, code=0, tf=0xe6592a88)
>> at /media/src-7/sys/kern/subr_kdb.c:524
>> #6  0xc07460bf in trap_fatal (frame=0xe6592a88, eva=56)
>> at /media/src-7/sys/i386/i386/trap.c:890
>> #7  0xc074636b in trap_pfault (frame=0xe6592a88, usermode=0, eva=56)
>> at /media/src-7/sys/i386/i386/trap.c:812
>> #8  0xc0746d36 in trap (frame=0xe6592a88)
>> at /media/src-7/sys/i386/i386/trap.c:490
>> #9  0xc072fd4b in calltrap () at /media/src-7/sys/i386/i386/exception.s:139
>> #10 0xc056cf46 in device_is_attached (dev=0x0)
>> at /media/src-7/sys/kern/subr_bus.c:2228
>> #11 0xc0512de7 in cf_set_method (dev=0xc3c9c880, level=0xc4525ef4,
>> priority=100) at /media/src-7/sys/kern/kern_cpu.c:332
>> #12 0xc0511452 in cpufreq_curr_sysctl (oidp=0xc3c8bac0, arg1=0xc3bc7c00,
>> arg2=0, req=0xe6592ba4) at cpufreq_if.h:32
>> ---Type  to continue, or q  to quit---
>> #13 0xc0554b67 in sysctl_root (oidp=Variable "oidp" is not available.
>> )
>> at /media/src-7/sys/kern/kern_sysctl.c:1306
>> #14 0xc0554cd1 in userland_sysctl (td=0xc4245440, name=0xe6592c14,
> namelen=4,
>> old=0x0, oldlenp=0x0, inkernel=0, new=0xbfbfe7c4, newlen=4,
>> retval=0xe6592c10, flags=0) at /media/src-7/sys/kern/kern_sysctl.c:1401
>> #15 0xc0555a7c in __sysctl (td=0xc4245440, uap=0xe6592cfc)
>> at /media/src-7/sys/kern/kern_sysctl.c:1336
>> #16 0xc07466d5 in syscall (frame=0xe6592d38)
>> at /media/src-7/sys/i386/i386/trap.c:1035
>> #17 0xc072fdb0 in Xint0x80_syscall ()
>> at /media/src-7/sys/i386/i386/exception.s:196
>> #18 0x0033 in ?? ()
>> Previous frame inner to this frame (corrupt stack?)
>> (kgdb) f 11
>> #11 0xc0512de7 in cf_set_method (dev=0xc3c9c880, level=0xc4525ef4,
>> priority=100) at /media/src-7/sys/kern/kern_cpu.c:332
>> 332 if (!device_is_attached(set->dev)) {
>> (kgdb) list
>> 327 }
>> 328
>> 329 /* Next, set any/all relative frequencies via their drivers.
> */
>> 330 for (i = 0; i < level->rel_count; i++) {
>> 331 set = &level->rel_set[i];
>> 332 if (!device_is_attached(set->dev)) {
>> 333 error = ENXIO;
>> 334 goto out;
>> 335 }
>> 336
>> (kgdb) p level.rel_count
>> $1 = 1986356271
>> (kgdb) p i
>> $2 = 0
>> (kgdb) p level.rel_set
>> $3 = {{freq = 0, volts = 0, power = 0, lat = 0, dev = 0x0, spec = {0, 0, 0,
>>   0}}, {freq = 0, volts = 0, power = 0, lat = 0, dev = 0x0, spec = {0,
> 0,
>>   0, 0}}, {freq = 0, volts = 0, power = 0, lat = 0, dev = 0x0, spec =
> {0,
>>   0, 0, 0}}, {freq = 0, volts = 0, power = 0, lat = 0, dev = 0x0, spec =
> {
>>   0, 0, 0,

Re: umtxn and Apache 2.2

2008-08-12 Thread Borja Marcos



On Aug 12, 2008, at 12:12 AM, Ivan Voras wrote:


Borja Marcos wrote:

Hello,
I'm running a server with FreeBSD 7-STABLE as of August 8, Apache  
2.2 with mpm/worker and threads support, and PHP 5.2.6.
Everything works like a charm, but I see that Apache is leaking  
processes that get stuck in umtxn state.


I run Apache 2.2 with worker MPM but without mod_php (I use PHP as  
FastCGI) on many servers and I don't have such problems. Maybe it's  
PHP's fault? (I agree you need a trace with debugging symbols).


May me my fault. It's a system that I didn't use to administrate. I  
upgraded it from 6.2-REL to 7-STABLE in place, and recompiled  
packages. But there was a lot of litter. I'm just wondering if that  
could be a problem.


Just in case I've done a thorough cleanup, recompiled needed ports,  
and included debugging support in Apache. Let's see what happens.


And please accept my apologies if this  has been a silly false alarm :)

Thank you very much,




Borja.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: HEADS UP: inpcb/inpcbinfo rwlocking: coming to a 7-STABLE branch near you

2008-08-12 Thread Robert Watson



On Mon, 11 Aug 2008, Mike Tancsa wrote:


At 05:21 PM 8/8/2008, Robert Watson wrote:


http://www.watson.org/~robert/freebsd/netperf/20080808-7stable-rwlock-inpcb.diff

These incude the inpcb/inpcbinfo read/write locking changes (although not 
yet for raw/divert sockets).  Any testing, especially with heavy UDP loads, 
would be much appreciated -- this are fairly complex changes, and also 
quite a complex MFC.


   So far so good with the patches. I am running them on a busy sendmail 
server that also does a lot of DNS locally for itself and a number of other 
boxes.


Excellent news.  I have a couple of other reviews and hopefully some more 
testing coming in, and will commit in a few days if all continues to go well. 
An updated version of the patch is here:


  
http://www.watson.org/~robert/freebsd/netperf/20080811-7stable-rwlock-inpcb.diff

There are no changes from previous versions, but I was asked to regenerate the 
patch with function names, so have done so.


Anyone out there running name servers, NFS over UDP, and other UDP workloads: 
your testing of this patch prior to commit would be much appreciated.


Thanks,

Robert N M Watson
Computer Laboratory
University of Cambridge
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: umtxn and Apache 2.2

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 12:17:05PM +0200, Borja Marcos wrote:
>
> On Aug 12, 2008, at 12:12 AM, Ivan Voras wrote:
>
>> Borja Marcos wrote:
>>> Hello,
>>> I'm running a server with FreeBSD 7-STABLE as of August 8, Apache  
>>> 2.2 with mpm/worker and threads support, and PHP 5.2.6.
>>> Everything works like a charm, but I see that Apache is leaking  
>>> processes that get stuck in umtxn state.
>>
>> I run Apache 2.2 with worker MPM but without mod_php (I use PHP as  
>> FastCGI) on many servers and I don't have such problems. Maybe it's  
>> PHP's fault? (I agree you need a trace with debugging symbols).
>
> May me my fault. It's a system that I didn't use to administrate. I  
> upgraded it from 6.2-REL to 7-STABLE in place, and recompiled packages. 
> But there was a lot of litter. I'm just wondering if that could be a 
> problem.
>
> Just in case I've done a thorough cleanup, recompiled needed ports, and 
> included debugging support in Apache. Let's see what happens.
>
> And please accept my apologies if this  has been a silly false alarm :)

Please be sure to report back with the outcome (in a few days, or
whenever suits you) -- I've seen a report of similar oddities (threads
locking up) on the suPHP mailing list, when using Apache with the worker
MPM.  No one stated what state the process is in (re: umtxn), so I'm not
sure if it's the same issue as what you've seen.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

lagg(4) and failover

2008-08-12 Thread Marian Hettwer

Hi Folks,

I'm using lagg(4) on some of our servers and I'm just wondering how the
failover is implemented.
The manpage isn't quite clear:

 failover Sends and receives traffic only through the master port. 
If
  the master port becomes unavailable, the next active port
is
  used.  The first interface added is the master port; any
  interfaces added after that are used as failover devices.

What is meant by "becomes unavailable"? Is it just the physical link which
needs to become unavailable to trigger a failover?

I do wonder, because there might be other faults where the link is still
active, but the port is unusable. Think of a wrong vlan on the switch
itself.

When using bonding under Linux (yeah, I know, the configuration sucks ;) ),
I can configure the device to check for arp respones of it's default
gateway. If arp to the default gw becomes unavailable, bonding fails over
to the next interface and tries it luck over there.
With that kind of configuration, I could cover a misconfigured switch port
and still have failover.

Long Story short: How is failover in lagg(4) implemented?

Thanks for any hints :)

Or should I ask the OpenBSD boys, since lagg(4) seems to be a port of
trunk(4)?? :)

best regards,
Marian

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: lagg(4) and failover

2008-08-12 Thread Eugene Grosbein

On Tue, Aug 12, 2008 at 12:37:15PM +0200, Marian Hettwer wrote:

> I'm using lagg(4) on some of our servers and I'm just wondering how the
> failover is implemented.
> The manpage isn't quite clear:
> 
>  failover Sends and receives traffic only through the master port. 
> If
>   the master port becomes unavailable, the next active port
> is
>   used.  The first interface added is the master port; any
>   interfaces added after that are used as failover devices.
> 
> What is meant by "becomes unavailable"? Is it just the physical link which
> needs to become unavailable to trigger a failover?

Yes. It seems you need lacp protocol described later in the manual.

Eugene Grosbein
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: Ath driver causes kernel panic (page fault) on 7.0-STABLE during use

2008-08-12 Thread Thomas Zander

Hi,

On Wed, Jul 23, 2008 at 21:37, Edward Ruggeri <[EMAIL PROTECTED]> wrote:

> I recently purchased a Lenovo ThinkPad with a ThinkPad 11a/b/g
> Wireless LAN Mini Express Adapter (based on the AR5212 chipset).  I
> got kernel panics while using the wireless card under 7-STABLE

Do you still have this problem or did you find a workaround?
Does it look something like this? :
http://www.freebsd.org/cgi/query-pr.cgi?pr=126475

I upgraded my notebook from 6.3 to 7-STABLE on the weekend and get
reproducible kernel panics ~2 sec of using the ath network card.

Riggs
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Pete French

> Since I added IPv6 to my network, and started really using it, I'm seeing
> some strange things happening.
>
> For instance, I'm on machine 2a01:678:1:443::443, and I do :
>
> $ traceroute6 -n 2a01:678:100:2::
> traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
> 2a01:678:1:443::443, 64 hops max, 12 byte packets
>  1  2a01:678:1:443::  0.636 ms  0.602 ms  0.525 ms
>  2  2a01:678:1:443::  2999.665 ms !A  2999.636 ms !A  2999.680 ms !A
>
> 2a01:678:1:443:: is it's default gateway, and is also directly connected to
> 2a01:678:100:2::, but it does not seem to be able to contact it.

What are your prefix lengths on the various interfaces, and does
2a01:678:100:2:: have a route back to 2a01:678:1:443::443 ? If you
can show us the config on the interfaces of the three machines then
we might be able to get a better idea. I am imagining how you have these
three boxes connected in my head, but nothing beats an explanation plus
the config :-)

> Maybe I'm doing something wrong, but, well, I can't seem to find ou what.
>
> 2a01:678:1:443::443 is a 7.0
> 2a01:678:1:443::is a 6.2
> 2a01:678:100:2::is a 6.0

I've used all of those with IPv6 and they work fine, it's most likely
a small config problem. I had a lot of frustrations with IPv6 when I
started using it - though now it is working I wouldn't be without it.

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: lagg(4) and failover

2008-08-12 Thread Marian Hettwer



On Tue, 12 Aug 2008 18:55:52 +0800, Eugene Grosbein <[EMAIL PROTECTED]>
wrote:
> On Tue, Aug 12, 2008 at 12:37:15PM +0200, Marian Hettwer wrote:
> 
>> I'm using lagg(4) on some of our servers and I'm just wondering how the
>> failover is implemented.
>> The manpage isn't quite clear:
>>
>>  failover Sends and receives traffic only through the master
> port.
>> If
>>   the master port becomes unavailable, the next active
> port
>> is
>>   used.  The first interface added is the master port;
> any
>>   interfaces added after that are used as failover
> devices.
>>
>> What is meant by "becomes unavailable"? Is it just the physical link
> which
>> needs to become unavailable to trigger a failover?
> 
> Yes. It seems you need lacp protocol described later in the manual.
> 
Thanks for your answer.
However, IMO lacp doesn't solve that problem. lacp is used for link
aggregation, not failover.
If I'm wrong over there, I should have a read about lacp... should do that
anyway, I guess.

The manpage states "In the event of changes in physical connectivity...".
Again, does that mean, the link needs to be physically unavailable? If so,
it'll be the same behaviour as in failover mode and doesn't solve my
problem of a misconfigured switch...

Cheers,
Marian

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 01:34:03 -0700, Jeremy Chadwick a dit :
| Important note: I know absolutely nothing about IPv6.
| 
| Do you have ACLs on any of these machines?  !A in traceroute commonly
| means there's an ACL blocking said packets:
| 
| !A  (communication with destination network administratively prohibited)
| 
| A ping from the other host might cause a stateful firewall to begin
| allowing said traffic to/from the machine which previously wasn't
| working.
| 
| If you use a firewall on these machines (ipfw, pf, etc.), I'd recommend
| posting your problem to the freebsd-pf list instead.

Hum, no, I've verified it already, there is pf enabled on the gateway, which
is also a firewall, but only on the external interface which does not come in
play here.

-- 
Mathieu Arnold

pgpRJlh29vC4Q.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Peter Jeremy

On 2008-Aug-12 18:55:52 +0800, Eugene Grosbein <[EMAIL PROTECTED]> wrote:
>On Tue, Aug 12, 2008 at 12:37:15PM +0200, Marian Hettwer wrote:
>
>> I'm using lagg(4) on some of our servers and I'm just wondering how the
>> failover is implemented.

As far as I can tell, not especially well :-(.  It doesn't seem to detect
much short of layer 1 failure.  In particular, shutting down the switch
port will not trigger a failover.

>> The manpage isn't quite clear:
>> 
>>  failover Sends and receives traffic only through the master port. 
>> If
>>   the master port becomes unavailable, the next active port
>> is
>>   used.  The first interface added is the master port; any
>>   interfaces added after that are used as failover devices.
>> 
>> What is meant by "becomes unavailable"? Is it just the physical link which
>> needs to become unavailable to trigger a failover?

It seems to be,

>Yes. It seems you need lacp protocol described later in the manual.

Actually, lacp and failover are used differently: lacp is primarily
used to increase the bandwidth between the host and the switch whilst
failover is used for redundancy.

With lacp, all the physical interfaces must be connected to a single
switch.  With failover, the physical interfaces will normally be
connected to different switches (so a failure in one switch will not
cause the loss of all connectivity.

-- 
Peter Jeremy
Please excuse any delays as the result of my ISP's inability to implement
an MTA that is either RFC2821-compliant or matches their claimed behaviour.

pgpGk05yQX4JR.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Pete French

> However, IMO lacp doesn't solve that problem. lacp is used for link
> aggregation, not failover.

It does both - if one of the links becomes unavailable then it will
stop using it. We use this for failover and it works fine, the only
caveat being that your LACP device at the far end needs to look like
a single phsyical device (the nicer Cisco switches do this quite happily)

> The manpage states "In the event of changes in physical connectivity...".
> Again, does that mean, the link needs to be physically unavailable? If so,
> it'll be the same behaviour as in failover mode and doesn't solve my
> problem of a misconfigured switch...

lagg is to handle failover at the physical layer for when one of your
ether ports fails, or someone unplugs a cable. If I understand you
correctly you are looking for something at the next layer up, to handle
a problem where the ports work fine, but are not going to their expected
destinations. lagg won't do this.

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 01:17:27PM +0200, Mathieu Arnold wrote:
> +-le 12.08.2008 01:34:03 -0700, Jeremy Chadwick a dit :
> | Important note: I know absolutely nothing about IPv6.
> | 
> | Do you have ACLs on any of these machines?  !A in traceroute commonly
> | means there's an ACL blocking said packets:
> | 
> | !A  (communication with destination network administratively prohibited)
> | 
> | A ping from the other host might cause a stateful firewall to begin
> | allowing said traffic to/from the machine which previously wasn't
> | working.
> | 
> | If you use a firewall on these machines (ipfw, pf, etc.), I'd recommend
> | posting your problem to the freebsd-pf list instead.
> 
> Hum, no, I've verified it already, there is pf enabled on the gateway, which
> is also a firewall, but only on the external interface which does not come in
> play here.

That depends.  Are you using "set skip" on non-external interfaces, or
are you using pass rules to explicitly pass all traffic?

Sorry if it sounds like I'm doubting you, but !A really looks like an
ACL thing.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold



+-le 12.08.2008 13:17:27 +0200, Mathieu Arnold a dit :
| +-le 12.08.2008 01:34:03 -0700, Jeremy Chadwick a dit :
|| Important note: I know absolutely nothing about IPv6.
|| 
|| Do you have ACLs on any of these machines?  !A in traceroute commonly
|| means there's an ACL blocking said packets:
|| 
|| !A  (communication with destination network administratively prohibited)
|| 
|| A ping from the other host might cause a stateful firewall to begin
|| allowing said traffic to/from the machine which previously wasn't
|| working.
|| 
|| If you use a firewall on these machines (ipfw, pf, etc.), I'd recommend
|| posting your problem to the freebsd-pf list instead.
| 
| Hum, no, I've verified it already, there is pf enabled on the gateway, which
| is also a firewall, but only on the external interface which does not come
| in play here.

There's a pass and not a skip, but all my block rules have log, and no
packets show up in pflog, which tends to make me believe that, well, it's not
a firewall problem.

-- 
Mathieu Arnold

pgpzbCYquQM0m.pgp
Description: PGP signature

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 12:02:42 +0100, Pete French a dit :
|> Since I added IPv6 to my network, and started really using it, I'm seeing
|> some strange things happening.
|> 
|> For instance, I'm on machine 2a01:678:1:443::443, and I do :
|> 
|> $ traceroute6 -n 2a01:678:100:2::
|> traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
|> 2a01:678:1:443::443, 64 hops max, 12 byte packets
|>  1  2a01:678:1:443::  0.636 ms  0.602 ms  0.525 ms
|>  2  2a01:678:1:443::  2999.665 ms !A  2999.636 ms !A  2999.680 ms !A
|> 
|> 2a01:678:1:443:: is it's default gateway, and is also directly connected to
|> 2a01:678:100:2::, but it does not seem to be able to contact it.
| 
| What are your prefix lengths on the various interfaces, and does
| 2a01:678:100:2:: have a route back to 2a01:678:1:443::443 ? If you
| can show us the config on the interfaces of the three machines then
| we might be able to get a better idea. I am imagining how you have these
| three boxes connected in my head, but nothing beats an explanation plus
| the config :-)

Hum, 2a01:678:1:443::443 is a /64, and 2a01:678:100:2:: is on a /48, both
have the "same" gateway, that is, the same box, which has :
inet6 2a01:678:1:443:: prefixlen 64
inet6 2a01:678:100:: prefixlen 48

The problem I'm seeing is that before I ping the machine from the gateway,
all ndp -a says about this machine is :

2a01:678:100:2:: (incomplete) em0 1sI  3

when I ping it from the first host, I see :

2a01:678:1:443::443 > 2a01:678:100:2::: ICMP6, echo request, seq 0, length 16
fe80::207:e9ff:fe0e:dead > ff02::1:ff00:0: ICMP6, neighbor solicitation, who
has 2a01:678:100:2::, length 32
fe80::207:e9ff:fe0e:dead > 2a01:678:1:443::443: ICMP6, redirect,
2a01:678:100:2:: to 2a01:678:100:2::, length 104
fe80::2b0:d0ff:fee1:c05f > fe80::207:e9ff:fe0e:dead: ICMP6, neighbor
advertisement, tgt is 2a01:678:100:2::, length 32
fe80::207:e9ff:fe0e:dead > ff02::1:ffe1:c05f: ICMP6, neighbor solicitation,
who has fe80::2b0:d0ff:fee1:c05f, length 32
fe80::2b0:d0ff:fee1:c05f > fe80::207:e9ff:fe0e:dead: ICMP6, neighbor
advertisement, tgt is fe80::2b0:d0ff:fee1:c05f, length 32

and when I ping it from the gateway, I see :

2a01:678:100:25:: > 2a01:678:100::: ICMP6, neighbor solicitation, who has
2a01:678:100::, length 32
2a01:678:100:: > 2a01:678:100:25::: ICMP6, neighbor advertisement, tgt is
2a01:678:100::, length 24
2a01:678:100:: > ff02::1:ff00:0: ICMP6, neighbor solicitation, who has
2a01:678:100:2::, length 32
2a01:678:100:2:: > 2a01:678:100::: ICMP6, neighbor advertisement, tgt is
2a01:678:100:2::, length 32

And I don't understand why there's a difference, and why when the packets
don't come from the gateway, the neighbor solicitation goes up wrong and does
not work.

|> Maybe I'm doing something wrong, but, well, I can't seem to find ou what.
|> 
|> 2a01:678:1:443::443 is a 7.0
|> 2a01:678:1:443::is a 6.2
|> 2a01:678:100:2::is a 6.0
| 
| I've used all of those with IPv6 and they work fine, it's most likely
| a small config problem. I had a lot of frustrations with IPv6 when I
| started using it - though now it is working I wouldn't be without it.

Well, I'm certain it must be some stupid configuration problem, but, well, I
can't seem to find it :-)

-- 
Mathieu Arnold

pgprcESzoGv2f.pgp
Description: PGP signature

Re: neighbor discovery problem

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 01:34:35PM +0200, Mathieu Arnold wrote:
> 
> 
> +-le 12.08.2008 13:17:27 +0200, Mathieu Arnold a dit :
> | +-le 12.08.2008 01:34:03 -0700, Jeremy Chadwick a dit :
> || Important note: I know absolutely nothing about IPv6.
> || 
> || Do you have ACLs on any of these machines?  !A in traceroute commonly
> || means there's an ACL blocking said packets:
> || 
> || !A  (communication with destination network administratively prohibited)
> || 
> || A ping from the other host might cause a stateful firewall to begin
> || allowing said traffic to/from the machine which previously wasn't
> || working.
> || 
> || If you use a firewall on these machines (ipfw, pf, etc.), I'd recommend
> || posting your problem to the freebsd-pf list instead.
> | 
> | Hum, no, I've verified it already, there is pf enabled on the gateway, which
> | is also a firewall, but only on the external interface which does not come
> | in play here.
> 
> There's a pass and not a skip, but all my block rules have log, and no
> packets show up in pflog, which tends to make me believe that, well, it's not
> a firewall problem.

A pass on RELENG_7 will still cause state to be kept (keep state is
implicit on RELENG_7).

Do you see state mismatches?  pfctl -s info.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: lagg(4) and failover

2008-08-12 Thread Pete French

> As far as I can tell, not especially well :-(.  It doesn't seem to detect
> much short of layer 1 failure.  In particular, shutting down the switch
> port will not trigger a failover.

Are you using bce devices as your phsyical interfaces ? Take a look at
the thread from last week about ifconfig - with the patch posted a port
shutdown now *does* trigger a failover quite happily. If you are using
e devices then I suggest you try it.

> With lacp, all the physical interfaces must be connected to a single
> switch.  With failover, the physical interfaces will normally be
> connected to different switches (so a failure in one switch will not
> cause the loss of all connectivity.

This is true - with the caveat that certain pairs of switches can be made
to appear as a single phsyical device for the purposes of LACP, in which
case it works fine for failover.

We have two farms here - an old one using a pair of Cisco 3560s and
a new one using a pair of 3750-Es. The 3750s will act as a single
device and we use LACP on the machines connected to those, but the 3560s
appear as a pair of devices, so for those we use failover mode. LACP
failover always worked fine, and with the bce patch from last week
the normal failover now also works.

Nore that you can enable LACP on the 3560,s and it does appear to negotiate
and work, but the switches keep changing their idea of which port to use every
few seconds. So the connection works, but with high rates of packet loss
as a few go missing every time the switch pair flip-flops.

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 04:31:23 -0700, Jeremy Chadwick a dit :
| Sorry if it sounds like I'm doubting you, but !A really looks like an
| ACL thing.

Oh, and in traceroute(8), !A is "!A  (communication with destination network
administratively prohibited", which is just right from your point of view,
but, in traceroute6(8), !A is "Destination Unreachable - Address Unreachable"

Though, in traceroute6(8), !P is "Destination Unreachable - Administratively
Prohibited" whereas in traceroute(8), !P is "protocol unreachable"

Yes, I know, much much confusing :-)

-- 
Mathieu Arnold

pgpe5L4wgM5i1.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Marian Hettwer

Hi Pete,

On Tue, 12 Aug 2008 12:30:12 +0100, Pete French
<[EMAIL PROTECTED]> wrote:
>> However, IMO lacp doesn't solve that problem. lacp is used for link
>> aggregation, not failover.
> 
> It does both - if one of the links becomes unavailable then it will
> stop using it. We use this for failover and it works fine, the only
> caveat being that your LACP device at the far end needs to look like
> a single phsyical device (the nicer Cisco switches do this quite happily)
> 
thanks for that info.

>> The manpage states "In the event of changes in physical
> connectivity...".
>> Again, does that mean, the link needs to be physically unavailable? If
> so,
>> it'll be the same behaviour as in failover mode and doesn't solve my
>> problem of a misconfigured switch...
> 
> lagg is to handle failover at the physical layer for when one of your
> ether ports fails, or someone unplugs a cable. If I understand you
> correctly you are looking for something at the next layer up, to handle
> a problem where the ports work fine, but are not going to their expected
> destinations. lagg won't do this.
>
Thats unfortunate...
bonding in Linux is capable of doing this and solaris too.
Well then. At least everythings clear now. And in the end, clarifing things
was the reason for that mail thread :)

Cheers,
Marian

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 04:36:57 -0700, Jeremy Chadwick a dit :
| A pass on RELENG_7 will still cause state to be kept (keep state is
| implicit on RELENG_7).

the gateway is a 6.2 ;-)

| Do you see state mismatches?  pfctl -s info.

There are some, but, hum
  searches40881638069911471.5/s
  state-mismatch   86554670.2/s

the box has been up more than 400 days, and the number has not been going up
while I was trying my things. I bet it goes up when something goes wrong in
my network, which happens :-)

-- 
Mathieu Arnold

pgpYYcQyQTvQX.pgp
Description: PGP signature

Re: em(4) on FreeBSD is sometimes annoying

2008-08-12 Thread Martin Sugioarto


> For what it's worth, I have a T60 that dual boots 6.3-R/amd64 and 7.0-R/i386 
> and neither install has this problem.  I can cold boot it with the NIC 
> unplugged, plug in a cable, I get a link light and ifconfig em0 goes to 
> active, dhclient em0 gets an IP successfully.
> 

Did you try to run /etc/rc.d/netif start after you've booted your laptop 
unplugged?

Try to do that, THEN connect the cable.

The problem appears ONLY in this situation. And it's quite common, because you 
often
use your laptop with wireless network and suddenly you decide to connect it to
wired network without having to switch the laptop off.

My NIC is in such a state that I am forced to switch it off, or else I don't get
link signal.

I don't think it's a BIOS firmware problem (I have tried every update).

I can remember that Linux had this issues, too a while ago. It works there now,
but FreeBSD is still the same. Please read the steps how I cause this situation.
It appears ONLY when you do it like I described it.

I've seen that people first boot with plugged in cable and start to play with 
dhclient.
Both is wrong.

Correct steps to reproduce is:
You have to start with unhooked interface AND the line ifconfig_re0="DHCP" in 
/etc/rc.conf.

Then wait until you can login and try to attach the cable.


--
Martin

___
EINE FÜR ALLE: die kostenlose WEB.DE-Plattform für Freunde und Deine
Homepage mit eigenem Namen. Jetzt starten! http://unddu.de/[EMAIL PROTECTED]

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Pete French

> Hum, 2a01:678:1:443::443 is a /64, and 2a01:678:100:2:: is on a /48, both
> have the "same" gateway, that is, the same box, which has :
> inet6 2a01:678:1:443:: prefixlen 64
> inet6 2a01:678:100:: prefixlen 48

O.K., that should work. My best advice here is to do what I did - which is
to disable 'pf' entirely and see if it works then. When it does you know
the IPv6 config is correct and can then work out which rule in 'pf' is
stopping it working when enabled. Sorry, but I van't think of anything else
right now, and I guess that may not be an option on a production machine.

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: lagg(4) and failover

2008-08-12 Thread Max Laier

On Tuesday 12 August 2008 13:43:29 Marian Hettwer wrote:
> Hi Pete,
>
> On Tue, 12 Aug 2008 12:30:12 +0100, Pete French
>
> <[EMAIL PROTECTED]> wrote:
> >> However, IMO lacp doesn't solve that problem. lacp is used for link
> >> aggregation, not failover.
> >
> > It does both - if one of the links becomes unavailable then it will
> > stop using it. We use this for failover and it works fine, the only
> > caveat being that your LACP device at the far end needs to look like
> > a single phsyical device (the nicer Cisco switches do this quite happily)
>
> thanks for that info.
>
> >> The manpage states "In the event of changes in physical
> >
> > connectivity...".
> >
> >> Again, does that mean, the link needs to be physically unavailable? If
> >
> > so,
> >
> >> it'll be the same behaviour as in failover mode and doesn't solve my
> >> problem of a misconfigured switch...
> >
> > lagg is to handle failover at the physical layer for when one of your
> > ether ports fails, or someone unplugs a cable. If I understand you
> > correctly you are looking for something at the next layer up, to handle
> > a problem where the ports work fine, but are not going to their expected
> > destinations. lagg won't do this.
>
> Thats unfortunate...
> bonding in Linux is capable of doing this and solaris too.
> Well then. At least everythings clear now. And in the end, clarifing things
> was the reason for that mail thread :)

You are looking for net/ifstated

-- 
/"\  Best regards,  | [EMAIL PROTECTED]
\ /  Max Laier  | ICQ #67774661
 X   http://pf4freebsd.love2party.net/  | [EMAIL PROTECTED]
/ \  ASCII Ribbon Campaign  | Against HTML Mail and News
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 04:36:57 -0700, Jeremy Chadwick a dit :
| A pass on RELENG_7 will still cause state to be kept (keep state is
| implicit on RELENG_7).

I've just tried doing pfctl -d, and not help, I *really* don't believe that
it's a pf problem ;-)

-- 
Mathieu Arnold

pgp0e94czU8F1.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Peter Jeremy

On 2008-Aug-12 13:43:29 +0200, Marian Hettwer <[EMAIL PROTECTED]> wrote:
>> lagg is to handle failover at the physical layer for when one of your
>> ether ports fails, or someone unplugs a cable. If I understand you
>>
>Thats unfortunate...

I tend to agree.

>bonding in Linux is capable of doing this and solaris too.

It shouldn't be too difficult to create something that behaves
functionally similarly to Slowaris ipmpd (and with marginally more
effort, you could create something that could be configured to behave
sensibly).

-- 
Peter Jeremy
Please excuse any delays as the result of my ISP's inability to implement
an MTA that is either RFC2821-compliant or matches their claimed behaviour.

pgph5IlfzxY1r.pgp
Description: PGP signature

Re: neighbor discovery problem

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 02:01:45PM +0200, Mathieu Arnold wrote:
> +-le 12.08.2008 04:36:57 -0700, Jeremy Chadwick a dit :
> | A pass on RELENG_7 will still cause state to be kept (keep state is
> | implicit on RELENG_7).
> 
> I've just tried doing pfctl -d, and not help, I *really* don't believe that
> it's a pf problem ;-)

Cool -- just covering all the bases!  :-)

P.S. -- The high number of state-mismatches you have may be the result
of PR 125261, which has been fixed (in RELENG_7 and HEAD).  It's
unrelated to your issue, but I thought I'd mention it anyways.

http://www.freebsd.org/cgi/query-pr.cgi?pr=125261

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 12:50:35 +0100, Pete French a dit :
|> Hum, 2a01:678:1:443::443 is a /64, and 2a01:678:100:2:: is on a /48, both
|> have the "same" gateway, that is, the same box, which has :
|> inet6 2a01:678:1:443:: prefixlen 64
|> inet6 2a01:678:100:: prefixlen 48
| 
| O.K., that should work. My best advice here is to do what I did - which is
| to disable 'pf' entirely and see if it works then. When it does you know
| the IPv6 config is correct and can then work out which rule in 'pf' is
| stopping it working when enabled. Sorry, but I van't think of anything else
| right now, and I guess that may not be an option on a production machine.

Well, stopping the firewall a few seconds can't do really bad things, and,
well, pfctl -d, wait a bit, try again, still no luck...

The network is pretty simple,

gateway :
em0: flags=8843 mtu 1500
options=b
inet6 fe80::207:e9ff:fe0e:dead%em0 prefixlen 64 scopeid 0x1 
inet6 2a01:678:1:443:: prefixlen 64 
inet6 2a01:678:100:: prefixlen 48 

first machine :
bridge0: flags=8843 metric 0 mtu 1500
inet6 fe80::2d0:dead:beef:cafe%bridge0 prefixlen 64 scopeid 0x4 
inet6 2a01:678:1:443::443 prefixlen 64 

destination machine :
fxp0: flags=8843 mtu 1500
options=8
inet6 fe80::2b0:d0ff:fee1:c05f%fxp0 prefixlen 64 scopeid 0x1 
inet6 2a01:678:100:2:: prefixlen 48 


All three ports on a switch.

I don't believe it's a problem in if_bridge(4) because it's the same if I try
from outside my network.

-- 
Mathieu Arnold

pgpsPPJRM7sTR.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Marian Hettwer

Hi Max,

On Tue, 12 Aug 2008 14:00:18 +0200, Max Laier <[EMAIL PROTECTED]> wrote:
>> Thats unfortunate...
>> bonding in Linux is capable of doing this and solaris too.
>> Well then. At least everythings clear now. And in the end, clarifing
> things
>> was the reason for that mail thread :)
> 
> You are looking for net/ifstated
>
at a first glance into pkg-descr. Yeah, seems like I'm looking for
ifstated.
Thanks for the heads up :-)

Cheers,
Marian


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold



+-le 12.08.2008 05:10:05 -0700, Jeremy Chadwick a dit :
| On Tue, Aug 12, 2008 at 02:01:45PM +0200, Mathieu Arnold wrote:
|> +-le 12.08.2008 04:36:57 -0700, Jeremy Chadwick a dit :
|> | A pass on RELENG_7 will still cause state to be kept (keep state is
|> | implicit on RELENG_7).
|> 
|> I've just tried doing pfctl -d, and not help, I *really* don't believe that
|> it's a pf problem ;-)
| 
| Cool -- just covering all the bases!  :-)

Well, could have been, I'm getting crazy with this problem.

| P.S. -- The high number of state-mismatches you have may be the result
| of PR 125261, which has been fixed (in RELENG_7 and HEAD).  It's
| unrelated to your issue, but I thought I'd mention it anyways.
|
| http://www.freebsd.org/cgi/query-pr.cgi?pr=125261

I have plans to upgrade the gateway to RELENG_7, but, I'm supposed to be on
vacation right now, so, it'll wait until I get back.

-- 
Mathieu Arnold

pgp1AgQWR8Dwo.pgp
Description: PGP signature

Re: lagg(4) and failover

2008-08-12 Thread Marian Hettwer

Hi Peter,

On Tue, 12 Aug 2008 22:03:07 +1000, Peter Jeremy
<[EMAIL PROTECTED]> wrote:
> On 2008-Aug-12 13:43:29 +0200, Marian Hettwer <[EMAIL PROTECTED]> wrote:
>>> lagg is to handle failover at the physical layer for when one of your
>>> ether ports fails, or someone unplugs a cable. If I understand you
>>>
>>Thats unfortunate...
> 
> I tend to agree.
> 
>>bonding in Linux is capable of doing this and solaris too.
> 
> It shouldn't be too difficult to create something that behaves
> functionally similarly to Slowaris ipmpd (and with marginally more
> effort, you could create something that could be configured to behave
> sensibly).
> 
har har. Yeah, you're right.
But as Max pointed out, theres net/ifstated. I'll have a look into that
tiny daemon :)

Cheers,
Marian

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Pete French

> The network is pretty simple,
>
> gateway :
> em0: flags=8843 mtu 1500
> options=b
> inet6 fe80::207:e9ff:fe0e:dead%em0 prefixlen 64 scopeid 0x1 
> inet6 2a01:678:1:443:: prefixlen 64 
> inet6 2a01:678:100:: prefixlen 48 

Hmmm, are machine numbers of all zeroes legal in IPv6 ? I would
configure those as '2a01:678:1:443::1' and '2a01:678:100::1' if
I were you. All zeroes as a machine number is certainly a no-no in
IPv4, and I wouldn't use it in IPv6 either. I have no idea if it's
actually a problem, but it gives me a bad feeling

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: Ath driver causes kernel panic (page fault) on 7.0-STABLE during use

2008-08-12 Thread Edward Ruggeri

On Tue, Aug 12, 2008 at 5:34 AM, Thomas Zander
<[EMAIL PROTECTED]> wrote:
> Hi,
>
> On Wed, Jul 23, 2008 at 21:37, Edward Ruggeri <[EMAIL PROTECTED]> wrote:
>
>> I recently purchased a Lenovo ThinkPad with a ThinkPad 11a/b/g
>> Wireless LAN Mini Express Adapter (based on the AR5212 chipset).  I
>> got kernel panics while using the wireless card under 7-STABLE
>
> Do you still have this problem or did you find a workaround?
> Does it look something like this? :
> http://www.freebsd.org/cgi/query-pr.cgi?pr=126475
>
> I upgraded my notebook from 6.3 to 7-STABLE on the weekend and get
> reproducible kernel panics ~2 sec of using the ath network card.

I never really got an answer about this bug.  I'm sorry someone else has it.

The bug went away when I updated world and rebuilt the kernel maybe
two weeks ago.  There didn't seem to be a recent change to the ath
driver at the time, so maybe it was something outside it that caused
the problem.  It's hard to say...

I am no longer on the freebsd-stable list, so if there's anything else
I can do, please feel free to write me directly.

Sincerely,

-- Ned Ruggeri
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Max Laier

On Tuesday 12 August 2008 13:35:33 Mathieu Arnold wrote:
> +-le 12.08.2008 12:02:42 +0100, Pete French a dit :
> |> Since I added IPv6 to my network, and started really using it, I'm
> |> seeing some strange things happening.
> |>
> |> For instance, I'm on machine 2a01:678:1:443::443, and I do :
> |>
> |> $ traceroute6 -n 2a01:678:100:2::
> |> traceroute6 to 2a01:678:100:2:: (2a01:678:100:2::) from
> |> 2a01:678:1:443::443, 64 hops max, 12 byte packets
> |>  1  2a01:678:1:443::  0.636 ms  0.602 ms  0.525 ms
> |>  2  2a01:678:1:443::  2999.665 ms !A  2999.636 ms !A  2999.680 ms !A
> |>
> |> 2a01:678:1:443:: is it's default gateway, and is also directly connected
> |> to 2a01:678:100:2::, but it does not seem to be able to contact it.
> |
> | What are your prefix lengths on the various interfaces, and does
> | 2a01:678:100:2:: have a route back to 2a01:678:1:443::443 ? If you
> | can show us the config on the interfaces of the three machines then
> | we might be able to get a better idea. I am imagining how you have these
> | three boxes connected in my head, but nothing beats an explanation plus
> | the config :-)
>
> Hum, 2a01:678:1:443::443 is a /64, and 2a01:678:100:2:: is on a /48, both
> have the "same" gateway, that is, the same box, which has :
> inet6 2a01:678:1:443:: prefixlen 64
> inet6 2a01:678:100:: prefixlen 48
>
> The problem I'm seeing is that before I ping the machine from the gateway,
> all ndp -a says about this machine is :
>
> 2a01:678:100:2:: (incomplete) em0 1sI 
> 3
>
> when I ping it from the first host, I see :
>
> 2a01:678:1:443::443 > 2a01:678:100:2::: ICMP6, echo request, seq 0, length
> 16 fe80::207:e9ff:fe0e:dead > ff02::1:ff00:0: ICMP6, neighbor solicitation,
> who has 2a01:678:100:2::, length 32
> fe80::207:e9ff:fe0e:dead > 2a01:678:1:443::443: ICMP6, redirect,
 ^--^

There is your problem.  You have all three hosts connected to the same 
broadcast domain?  Disable redirects or separate the domains using VLAN or 
similar - otherwise the gateway will get smart and try to avoid work.

> 2a01:678:100:2:: to 2a01:678:100:2::, length 104
> fe80::2b0:d0ff:fee1:c05f > fe80::207:e9ff:fe0e:dead: ICMP6, neighbor
> advertisement, tgt is 2a01:678:100:2::, length 32
> fe80::207:e9ff:fe0e:dead > ff02::1:ffe1:c05f: ICMP6, neighbor solicitation,
> who has fe80::2b0:d0ff:fee1:c05f, length 32
> fe80::2b0:d0ff:fee1:c05f > fe80::207:e9ff:fe0e:dead: ICMP6, neighbor
> advertisement, tgt is fe80::2b0:d0ff:fee1:c05f, length 32
>
> and when I ping it from the gateway, I see :
>
> 2a01:678:100:25:: > 2a01:678:100::: ICMP6, neighbor solicitation, who has
> 2a01:678:100::, length 32
> 2a01:678:100:: > 2a01:678:100:25::: ICMP6, neighbor advertisement, tgt is
> 2a01:678:100::, length 24
> 2a01:678:100:: > ff02::1:ff00:0: ICMP6, neighbor solicitation, who has
> 2a01:678:100:2::, length 32
> 2a01:678:100:2:: > 2a01:678:100::: ICMP6, neighbor advertisement, tgt is
> 2a01:678:100:2::, length 32
>
> And I don't understand why there's a difference, and why when the packets
> don't come from the gateway, the neighbor solicitation goes up wrong and
> does not work.
>
> |> Maybe I'm doing something wrong, but, well, I can't seem to find ou
> |> what.
> |>
> |> 2a01:678:1:443::443 is a 7.0
> |> 2a01:678:1:443::is a 6.2
> |> 2a01:678:100:2::is a 6.0
> |
> | I've used all of those with IPv6 and they work fine, it's most likely
> | a small config problem. I had a lot of frustrations with IPv6 when I
> | started using it - though now it is working I wouldn't be without it.
>
> Well, I'm certain it must be some stupid configuration problem, but, well,
> I can't seem to find it :-)

-- 
/"\  Best regards,  | [EMAIL PROTECTED]
\ /  Max Laier  | ICQ #67774661
 X   http://pf4freebsd.love2party.net/  | [EMAIL PROTECTED]
/ \  ASCII Ribbon Campaign  | Against HTML Mail and News
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 13:26:00 +0100, Pete French a dit :
|> The network is pretty simple,
|> 
|> gateway :
|> em0: flags=8843 mtu 1500
|> options=b
|> inet6 fe80::207:e9ff:fe0e:dead%em0 prefixlen 64 scopeid 0x1 
|> inet6 2a01:678:1:443:: prefixlen 64 
|> inet6 2a01:678:100:: prefixlen 48 
| 
| Hmmm, are machine numbers of all zeroes legal in IPv6 ? I would
| configure those as '2a01:678:1:443::1' and '2a01:678:100::1' if
| I were you. All zeroes as a machine number is certainly a no-no in
| IPv4, and I wouldn't use it in IPv6 either. I have no idea if it's
| actually a problem, but it gives me a bad feeling

Well, hum, I'm pretty sure there's no network and broadcast addresses in
IPv6. Routing works, ping works, only neighbor discovery does strange things,
from time to time. I'm a bit reluctant to change all that the eve of my going
on vacation.

I just discovered that I can't ping6 ff02::1%em0 on the gateway, well, I can,
there are answers comming back, but, hum, they don't seem to sink in, and
ping6 don't say anything.

I'll investigate.

-- 
Mathieu Arnold

pgpkJ4mjdQXIE.pgp
Description: PGP signature

Re: kernel panic

2008-08-12 Thread John Baldwin

On Tuesday 12 August 2008 02:42:52 am Johan Kuuse wrote:
> On Monday 11 August 2008 23:04:30 John Baldwin wrote:
> > On Sunday 10 August 2008 10:01:49 pm Johan Kuuse wrote:
> > > Hi,
> > >
> > > I am a kgdb newbie, so please be patient.
> > > I suspect (just based on the fact that this is the 4th time I edit text
> >
> > files on my NTFS partition through ntfs-3g, using Emacs, and getting
> > frequent I/O error messages inside Emacs, and then a kernel panic) that
> > this is a ntfs-3g related problem.
> >
> > > If you ask me exactly how to reproduce it, I sorry, I can tell you
> > > exactly
> >
> > (but see the kgdb output below).
> >
> > > Anyway, the kernel seems to panic at /usr/src/sys/kern/vfs_bio.c:1530
> > >
> > > Just a suggestion for a patch (without knowing the functionality
> >
> > of /usr/src/sys/kern/vfs_bio.c):
> > > The line where the kernel panics:
> > > /usr/src/sys/kern/vfs_bio.c:
> > > --
> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> > > ...
> > > --
> > >
> > > Comparing to another file, which does error checking before calling
> >
> > VM_OBJECT_LOCK:
> > > /usr/src/sys/kern/vfs_aio.c:
> > > --
> > > if (vp->v_object != NULL) {
> > > VM_OBJECT_LOCK(vp->v_object);
> > > ...
> > > --
> > >
> > > Perhaps the kernel panic could be avoided with the following patch?
> > > /usr/src/sys/kern/vfs_bio.c (suggested patch):
> > > --
> > > if ((bp->b_bufobj != NULL) && (bp->b_bufobj->bo_object != NULL)) {
> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> > > ...
> > > --
> > >
> > > Please let me know if you need more information.
> > >
> > > Regards,
> > > Johan Kuuse
> > >
> > > ---
> > > kgdb kernel.debug
> > > /var/crash/vmcore.1
> > > [GDB will not be able to debug user-mode threads:
> > > /usr/lib/libthread_db.so:
> >
> > Undefined symbol "ps_pglobal_lookup"]
> >
> > > GNU gdb 6.1.1 [FreeBSD]
> > > Copyright 2004 Free Software Foundation, Inc.
> > > GDB is free software, covered by the GNU General Public License, and
> > > you are welcome to change it and/or distribute copies of it under
> > > certain
> >
> > conditions.
> >
> > > Type "show copying" to see the conditions.
> > > There is absolutely no warranty for GDB.  Type "show warranty" for
> > > details. This GDB was configured as "i386-marcel-freebsd".
> > >
> > > Unread portion of the kernel message buffer:
> > >
> > >
> > > Fatal trap 12: page fault while in kernel mode
> > > cpuid = 0; apic id = 00
> > > fault virtual address   = 0x34
> > > fault code  = supervisor read, page not present
> > > instruction pointer = 0x20:0xc07b6de4
> > > stack pointer   = 0x28:0xe79de7c8
> > > frame pointer   = 0x28:0xe79de7e8
> > > code segment= base 0x0, limit 0xf, type 0x1b
> > > = DPL 0, pres 1, def32 1, gran 1
> > > processor eflags= interrupt enabled, resume, IOPL = 0
> > > current process = 1214 (opera)
> > > trap number = 12
> > > panic: page fault
> > > cpuid = 0
> > > Uptime: 5h20m30s
> > > Physical memory: 2035 MB
> > > Dumping 218 MB: 203 187 171 155 139 123 107 91 75 59 43 27 11
> > >
> > > #0  doadump () at pcpu.h:195
> > > 195 __asm __volatile("movl %%fs:0,%0" : "=r" (td));
> > > (kgdb) list *0xc07b6de4
> > > 0xc07b6de4 is in vfs_vmio_release (/usr/src/sys/kern/vfs_bio.c:1530).
> > > 1525vfs_vmio_release(struct buf *bp)
> > > 1526{
> > > 1527int i;
> > > 1528vm_page_t m;
> > > 1529
> > > 1530VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> > > 1531vm_page_lock_queues();
> > > 1532for (i = 0; i < bp->b_npages; i++) {
> > > 1533m = bp->b_pages[i];
> > > 1534bp->b_pages[i] = NULL;
> > > (kgdb) bt
> > > #0  doadump () at pcpu.h:195
> > > #1  0xc0754457 in boot (howto=260) at
> > > /usr/src/sys/kern/kern_shutdown.c:409 #2  0xc0754719 in panic
> > > (fmt=Variable "fmt" is not available.
> > > ) at /usr/src/sys/kern/kern_shutdown.c:563
> > > #3  0xc0a4905c in trap_fatal (frame=0xe79de788, eva=52)
> >
> > at /usr/src/sys/i386/i386/trap.c:899
> >
> > > #4  0xc0a492e0 in trap_pfault (frame=0xe79de788, usermode=0, eva=52)
> >
> > at /usr/src/sys/i386/i386/trap.c:812
> >
> > > #5  0xc0a49c8c in trap (frame=0xe79de788)
> >
> > at /usr/src/sys/i386/i386/trap.c:490
> >
> > > #6  0xc0a2fc0b in calltrap () at /usr/src/sys/i386/i386/exception.s:139
> > > #7  0xc07b6de4 in vfs_vmio_release (bp=0xd927e33c)
> >
> > at /usr/src/sys/kern/vfs_bio.c:1530
> >
> > > #8  0xc07b8a81 in getnewbuf (slpflag=0, slptimeo=0, size=Variable
> > > "size" is
> >
> > not available.
> >
> > > ) at /usr/src/sys/kern/vfs_bio.c:1847
> > > #9  0xc07ba118 in getblk (vp=0xc

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 14:53:00 +0200, Mathieu Arnold a dit :
| I'll investigate.

Ok, I rebooted the gateway, and now, it works just as it should...

Maybe once upon a time, I did something strange, which borked everything...

Anyway, thanks all :-)

-- 
Mathieu Arnold

pgpFAKhbnBRan.pgp
Description: PGP signature

Re: neighbor discovery problem

2008-08-12 Thread Mathieu Arnold

+-le 12.08.2008 14:53:24 +0200, Max Laier a dit :
|> 2a01:678:1:443::443 > 2a01:678:100:2::: ICMP6, echo request, seq 0, length
|> 16 fe80::207:e9ff:fe0e:dead > ff02::1:ff00:0: ICMP6, neighbor solicitation,
|> who has 2a01:678:100:2::, length 32
|> fe80::207:e9ff:fe0e:dead > 2a01:678:1:443::443: ICMP6, redirect,
|  ^--^
| 
| There is your problem.  You have all three hosts connected to the same 
| broadcast domain?  Disable redirects or separate the domains using VLAN or 
| similar - otherwise the gateway will get smart and try to avoid work.

Hum, yes, the same domain, but it must not have been the problem, now, it
works, and I still have :

fe80::207:e9ff:fe0e:dead > ff02::1:ff00:0: ICMP6, neighbor solicitation, who
has 2a01:678:100:2::, length 32
fe80::207:e9ff:fe0e:dead > 2a01:678:1:443::443: ICMP6, redirect,
2a01:678:100:2:: to 2a01:678:100:2::, length 104
fe80::2b0:d0ff:fee1:c05f > fe80::207:e9ff:fe0e:dead: ICMP6, neighbor
advertisement, tgt is 2a01:678:100:2::, length 32

but it works.

-- 
Mathieu Arnold

pgpKeAREtE5Hl.pgp
Description: PGP signature

Re: cpufreq(4) panic on RELENG_7 (was: Re: Call for bfe(4) testers.)

2008-08-12 Thread John Baldwin

On Tuesday 12 August 2008 04:36:29 am pluknet wrote:
> 2008/8/11 John Baldwin <[EMAIL PROTECTED]>:
> > On Monday 11 August 2008 12:35:17 pm pluknet wrote:
> >> 2008/8/11 John Baldwin <[EMAIL PROTECTED]>:
> >> > On Saturday 09 August 2008 07:16:37 am Ulrich Spoerlein wrote:
> >> >> Hi John,
> >> >>
> >> >> I now figured out the "who", the "why" still eludes me.
> >> >>
> >> >> So, after your MFC of ichss.c on June 27th the device now attaches at 
my
> >> >> laptop. It didn't before, so it could cause no trouble.
> >> >>
> >> >> With ichss loaded, the kernel will panic 1-3 minutes after powerd has
> >> >> been started (if I kill powerd early enough, it seems pretty stable).
> >> >>
> >> >> I'm now running a kernel from 2008-08-08 with
> >> >> hint.ichss.0.disabled="1"
> >> >
> >> > Ok.  Can you get a crashdump from a crash?
> >> >
> >>
> >> ehm,. I am not Ulrich Spoerlein, but I can help with this issue.
> >>
> >> my crashdump from kgdb and some debug info.
> >> (ouch, I forgot to include it in my prev. mail
> >> 
http://lists.freebsd.org/pipermail/freebsd-stable/2008-August/044182.html )
> >>
> >> wbr,
> >> pluknet
> >>
> >> Unread portion of the kernel message buffer:
> >>
> >>
> >> Fatal trap 12: page fault while in kernel mode
> >> fault virtual address   = 0x38
> >> fault code  = supervisor read, page not present
> >> instruction pointer = 0x20:0xc056cf46
> >> stack pointer   = 0x28:0xe6592ac8
> >> frame pointer   = 0x28:0xe6592ac8
> >> code segment= base 0x0, limit 0xf, type 0x1b
> >> = DPL 0, pres 1, def32 1, gran 1
> >> processor eflags= interrupt enabled, resume, IOPL = 0
> >> current process = 2507 (powerd)
> >> Physical memory: 1014 MB
> >> Dumping 120 MB: 105 89 73 57 41 25 9
> >>
> >> #0  doadump () at pcpu.h:195
> >> 195 pcpu.h: No such file or directory.
> >> in pcpu.h
> >> (kgdb) bt
> >> #0  doadump () at pcpu.h:195
> >> #1  0xc0458f89 in db_fncall (dummy1=-1010027648, dummy2=0, dummy3=0,
> >> dummy4=0xe6592860 "0╛йц") at /media/src-7/sys/ddb/db_command.c:516
> >> #2  0xc045953a in db_command (last_cmdp=0xc07dcf14, cmd_table=0x0,
> > dopager=1)
> >> at /media/src-7/sys/ddb/db_command.c:413
> >> #3  0xc0459655 in db_command_loop ()
> > at /media/src-7/sys/ddb/db_command.c:466
> >> #4  0xc045b17c in db_trap (type=12, code=0)
> >> at /media/src-7/sys/ddb/db_main.c:228
> >> #5  0xc0575023 in kdb_trap (type=12, code=0, tf=0xe6592a88)
> >> at /media/src-7/sys/kern/subr_kdb.c:524
> >> #6  0xc07460bf in trap_fatal (frame=0xe6592a88, eva=56)
> >> at /media/src-7/sys/i386/i386/trap.c:890
> >> #7  0xc074636b in trap_pfault (frame=0xe6592a88, usermode=0, eva=56)
> >> at /media/src-7/sys/i386/i386/trap.c:812
> >> #8  0xc0746d36 in trap (frame=0xe6592a88)
> >> at /media/src-7/sys/i386/i386/trap.c:490
> >> #9  0xc072fd4b in calltrap () 
at /media/src-7/sys/i386/i386/exception.s:139
> >> #10 0xc056cf46 in device_is_attached (dev=0x0)
> >> at /media/src-7/sys/kern/subr_bus.c:2228
> >> #11 0xc0512de7 in cf_set_method (dev=0xc3c9c880, level=0xc4525ef4,
> >> priority=100) at /media/src-7/sys/kern/kern_cpu.c:332
> >> #12 0xc0511452 in cpufreq_curr_sysctl (oidp=0xc3c8bac0, arg1=0xc3bc7c00,
> >> arg2=0, req=0xe6592ba4) at cpufreq_if.h:32
> >> ---Type  to continue, or q  to quit---
> >> #13 0xc0554b67 in sysctl_root (oidp=Variable "oidp" is not available.
> >> )
> >> at /media/src-7/sys/kern/kern_sysctl.c:1306
> >> #14 0xc0554cd1 in userland_sysctl (td=0xc4245440, name=0xe6592c14,
> > namelen=4,
> >> old=0x0, oldlenp=0x0, inkernel=0, new=0xbfbfe7c4, newlen=4,
> >> retval=0xe6592c10, flags=0) 
at /media/src-7/sys/kern/kern_sysctl.c:1401
> >> #15 0xc0555a7c in __sysctl (td=0xc4245440, uap=0xe6592cfc)
> >> at /media/src-7/sys/kern/kern_sysctl.c:1336
> >> #16 0xc07466d5 in syscall (frame=0xe6592d38)
> >> at /media/src-7/sys/i386/i386/trap.c:1035
> >> #17 0xc072fdb0 in Xint0x80_syscall ()
> >> at /media/src-7/sys/i386/i386/exception.s:196
> >> #18 0x0033 in ?? ()
> >> Previous frame inner to this frame (corrupt stack?)
> >> (kgdb) f 11
> >> #11 0xc0512de7 in cf_set_method (dev=0xc3c9c880, level=0xc4525ef4,
> >> priority=100) at /media/src-7/sys/kern/kern_cpu.c:332
> >> 332 if (!device_is_attached(set->dev)) {
> >> (kgdb) list
> >> 327 }
> >> 328
> >> 329 /* Next, set any/all relative frequencies via their 
drivers.
> > */
> >> 330 for (i = 0; i < level->rel_count; i++) {
> >> 331 set = &level->rel_set[i];
> >> 332 if (!device_is_attached(set->dev)) {
> >> 333 error = ENXIO;
> >> 334 goto out;
> >> 335 }
> >> 336
> >> (kgdb) p level.rel_count
> >> $1 = 1986356271
> >> (kgdb) p i
> >> $2 = 0
> >> (kgdb) p level.rel_set
> >> $3 = {{freq = 0, volts = 0, power = 0, lat = 0, dev = 0x0, spe

Re: lagg(4) and failover

2008-08-12 Thread Andrew Thompson

On Tue, Aug 12, 2008 at 12:37:15PM +0200, Marian Hettwer wrote:
> Hi Folks,
> 
> I'm using lagg(4) on some of our servers and I'm just wondering how the
> failover is implemented.
> The manpage isn't quite clear:
> 
>  failover Sends and receives traffic only through the master port. 
> If
>   the master port becomes unavailable, the next active port
> is
>   used.  The first interface added is the master port; any
>   interfaces added after that are used as failover devices.
> 
> What is meant by "becomes unavailable"? Is it just the physical link which
> needs to become unavailable to trigger a failover?
> 
> I do wonder, because there might be other faults where the link is still
> active, but the port is unusable. Think of a wrong vlan on the switch
> itself.
> 
> When using bonding under Linux (yeah, I know, the configuration sucks ;) ),
> I can configure the device to check for arp respones of it's default
> gateway. If arp to the default gw becomes unavailable, bonding fails over
> to the next interface and tries it luck over there.
> With that kind of configuration, I could cover a misconfigured switch port
> and still have failover.
> 
> Long Story short: How is failover in lagg(4) implemented?

It is simply performed on the physical link state, nothing more.

Adding smarter methods of detecting the link such as what Linux does are
very welcome. You may want to also look at LACP mode where heatbeat
frames are exchanged with the peer.


Andrew
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: lagg(4) and failover

2008-08-12 Thread Andrew Thompson

On Tue, Aug 12, 2008 at 09:24:30PM +1000, Peter Jeremy wrote:
> On 2008-Aug-12 18:55:52 +0800, Eugene Grosbein <[EMAIL PROTECTED]> wrote:
> >On Tue, Aug 12, 2008 at 12:37:15PM +0200, Marian Hettwer wrote:
> >
> >> I'm using lagg(4) on some of our servers and I'm just wondering how the
> >> failover is implemented.
> 
> As far as I can tell, not especially well :-(.  It doesn't seem to detect
> much short of layer 1 failure.  In particular, shutting down the switch
> port will not trigger a failover.
> 
> >> The manpage isn't quite clear:
> >> 
> >>  failover Sends and receives traffic only through the master port. 
> >> If
> >>   the master port becomes unavailable, the next active port
> >> is
> >>   used.  The first interface added is the master port; any
> >>   interfaces added after that are used as failover devices.
> >> 
> >> What is meant by "becomes unavailable"? Is it just the physical link which
> >> needs to become unavailable to trigger a failover?
> 
> It seems to be,
> 
> >Yes. It seems you need lacp protocol described later in the manual.
> 
> Actually, lacp and failover are used differently: lacp is primarily
> used to increase the bandwidth between the host and the switch whilst
> failover is used for redundancy.
> 
> With lacp, all the physical interfaces must be connected to a single
> switch.  With failover, the physical interfaces will normally be
> connected to different switches (so a failure in one switch will not
> cause the loss of all connectivity.

Actually you can use lacp in failover mode by connecting interfaces to
different switches. It will only bundle an aggregation to one switch at
a time but if that becomes unavailable then it will automatically choose
the next switch.


Andrew
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

command not found: problem with dash in filenames

2008-08-12 Thread KES

NOTICE: 
when I run purepw it is located and runned, but 
when I run pure-pw (NOTICE: dash in name) I get: Command not found
kes# env |grep PATH
PATH=/sbin:/bin:/usr/sbin:/usr/bin:/usr/games:/usr/local/sbin:/usr/local/bin:/root/bin
kes# env | grep PATH
PATH=/sbin:/bin:/usr/sbin:/usr/bin:/usr/games:/usr/local/sbin:/usr/local/bin:/root/bin
kes# ls -l /usr/local/bin/pure*
-r-xr-xr-x  1 root  wheel  24052 12 авг 06:24 /usr/local/bin/pure-pw
-r-xr-xr-x  1 root  wheel   4432 12 авг 06:24 /usr/local/bin/pure-pwconvert
-r-xr-xr-x  1 root  wheel   6016 12 авг 06:24 /usr/local/bin/pure-statsdecode
kes# pure-pw
pure-pw: Command not found
kes# /usr/local/bin/pure-pw

Usage :

pure-pw useradd  [-f ] -u  [-g ]
-D/-d  [-c ]
[-t ] [-T ]
.
kes# cp pure-pw purepw
kes# purepw

Usage :

pure-pw useradd  [-f ] -u  [-g ]
-D/-d  [-c ]
[-t ] [-T ]

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: kernel panic

2008-08-12 Thread Johan Kuuse

> On Tuesday 12 August 2008 02:42:52 am Johan Kuuse wrote:
>> On Monday 11 August 2008 23:04:30 John Baldwin wrote:
>> > On Sunday 10 August 2008 10:01:49 pm Johan Kuuse wrote:
>> > > Hi,
>> > >
>> > > I am a kgdb newbie, so please be patient.
>> > > I suspect (just based on the fact that this is the 4th time I edit text
>> >
>> > files on my NTFS partition through ntfs-3g, using Emacs, and getting
>> > frequent I/O error messages inside Emacs, and then a kernel panic) that
>> > this is a ntfs-3g related problem.
>> >
>> > > If you ask me exactly how to reproduce it, I sorry, I can tell you
>> > > exactly
>> >
>> > (but see the kgdb output below).
>> >
>> > > Anyway, the kernel seems to panic at /usr/src/sys/kern/vfs_bio.c:1530
>> > >
>> > > Just a suggestion for a patch (without knowing the functionality
>> >
>> > of /usr/src/sys/kern/vfs_bio.c):
>> > > The line where the kernel panics:
>> > > /usr/src/sys/kern/vfs_bio.c:
>> > > --
>> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
>> > > ...
>> > > --
>> > >
>> > > Comparing to another file, which does error checking before calling
>> >
>> > VM_OBJECT_LOCK:
>> > > /usr/src/sys/kern/vfs_aio.c:
>> > > --
>> > > if (vp->v_object != NULL) {
>> > > VM_OBJECT_LOCK(vp->v_object);
>> > > ...
>> > > --
>> > >
>> > > Perhaps the kernel panic could be avoided with the following patch?
>> > > /usr/src/sys/kern/vfs_bio.c (suggested patch):
>> > > --
>> > > if ((bp->b_bufobj != NULL) && (bp->b_bufobj->bo_object != NULL)) {
>> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
>> > > ...
>> > > --
>> > >
>> > > Please let me know if you need more information.
>> > >
>> > > Regards,
>> > > Johan Kuuse
>> > >
>> > > ---
>> > > kgdb kernel.debug
>> > > /var/crash/vmcore.1
>> > > [GDB will not be able to debug user-mode threads:
>> > > /usr/lib/libthread_db.so:
>> >
>> > Undefined symbol "ps_pglobal_lookup"]
>> >
>> > > GNU gdb 6.1.1 [FreeBSD]
>> > > Copyright 2004 Free Software Foundation, Inc.
>> > > GDB is free software, covered by the GNU General Public License, and
>> > > you are welcome to change it and/or distribute copies of it under
>> > > certain
>> >
>> > conditions.
>> >
>> > > Type "show copying" to see the conditions.
>> > > There is absolutely no warranty for GDB.  Type "show warranty" for
>> > > details. This GDB was configured as "i386-marcel-freebsd".
>> > >
>> > > Unread portion of the kernel message buffer:
>> > >
>> > >
>> > > Fatal trap 12: page fault while in kernel mode
>> > > cpuid = 0; apic id = 00
>> > > fault virtual address   = 0x34
>> > > fault code  = supervisor read, page not present
>> > > instruction pointer = 0x20:0xc07b6de4
>> > > stack pointer   = 0x28:0xe79de7c8
>> > > frame pointer   = 0x28:0xe79de7e8
>> > > code segment= base 0x0, limit 0xf, type 0x1b
>> > > = DPL 0, pres 1, def32 1, gran 1
>> > > processor eflags= interrupt enabled, resume, IOPL = 0
>> > > current process = 1214 (opera)
>> > > trap number = 12
>> > > panic: page fault
>> > > cpuid = 0
>> > > Uptime: 5h20m30s
>> > > Physical memory: 2035 MB
>> > > Dumping 218 MB: 203 187 171 155 139 123 107 91 75 59 43 27 11
>> > >
>> > > #0  doadump () at pcpu.h:195
>> > > 195 __asm __volatile("movl %%fs:0,%0" : "=r" (td));
>> > > (kgdb) list *0xc07b6de4
>> > > 0xc07b6de4 is in vfs_vmio_release (/usr/src/sys/kern/vfs_bio.c:1530).
>> > > 1525vfs_vmio_release(struct buf *bp)
>> > > 1526{
>> > > 1527int i;
>> > > 1528vm_page_t m;
>> > > 1529
>> > > 1530VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
>> > > 1531vm_page_lock_queues();
>> > > 1532for (i = 0; i < bp->b_npages; i++) {
>> > > 1533m = bp->b_pages[i];
>> > > 1534bp->b_pages[i] = NULL;
>> > > (kgdb) bt
>> > > #0  doadump () at pcpu.h:195
>> > > #1  0xc0754457 in boot (howto=260) at
>> > > /usr/src/sys/kern/kern_shutdown.c:409 #2  0xc0754719 in panic
>> > > (fmt=Variable "fmt" is not available.
>> > > ) at /usr/src/sys/kern/kern_shutdown.c:563
>> > > #3  0xc0a4905c in trap_fatal (frame=0xe79de788, eva=52)
>> >
>> > at /usr/src/sys/i386/i386/trap.c:899
>> >
>> > > #4  0xc0a492e0 in trap_pfault (frame=0xe79de788, usermode=0, eva=52)
>> >
>> > at /usr/src/sys/i386/i386/trap.c:812
>> >
>> > > #5  0xc0a49c8c in trap (frame=0xe79de788)
>> >
>> > at /usr/src/sys/i386/i386/trap.c:490
>> >
>> > > #6  0xc0a2fc0b in calltrap () at /usr/src/sys/i386/i386/exception.s:139
>> > > #7  0xc07b6de4 in vfs_vmio_release (bp=0xd927e33c)
>> >
>> > at /usr/src/sys/kern/vfs_bio.c:1530
>> >
>> > > #8  0xc07b8a81 in getnewbuf (slpflag=0, slptimeo=0, size=Va

Re: kernel panic

2008-08-12 Thread John Baldwin

On Tuesday 12 August 2008 02:23:30 pm Johan Kuuse wrote:
> > On Tuesday 12 August 2008 02:42:52 am Johan Kuuse wrote:
> >> On Monday 11 August 2008 23:04:30 John Baldwin wrote:
> >> > On Sunday 10 August 2008 10:01:49 pm Johan Kuuse wrote:
> >> > > Hi,
> >> > >
> >> > > I am a kgdb newbie, so please be patient.
> >> > > I suspect (just based on the fact that this is the 4th time I edit 
text
> >> >
> >> > files on my NTFS partition through ntfs-3g, using Emacs, and getting
> >> > frequent I/O error messages inside Emacs, and then a kernel panic) that
> >> > this is a ntfs-3g related problem.
> >> >
> >> > > If you ask me exactly how to reproduce it, I sorry, I can tell you
> >> > > exactly
> >> >
> >> > (but see the kgdb output below).
> >> >
> >> > > Anyway, the kernel seems to panic at /usr/src/sys/kern/vfs_bio.c:1530
> >> > >
> >> > > Just a suggestion for a patch (without knowing the functionality
> >> >
> >> > of /usr/src/sys/kern/vfs_bio.c):
> >> > > The line where the kernel panics:
> >> > > /usr/src/sys/kern/vfs_bio.c:
> >> > > --
> >> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> >> > > ...
> >> > > --
> >> > >
> >> > > Comparing to another file, which does error checking before calling
> >> >
> >> > VM_OBJECT_LOCK:
> >> > > /usr/src/sys/kern/vfs_aio.c:
> >> > > --
> >> > > if (vp->v_object != NULL) {
> >> > > VM_OBJECT_LOCK(vp->v_object);
> >> > > ...
> >> > > --
> >> > >
> >> > > Perhaps the kernel panic could be avoided with the following patch?
> >> > > /usr/src/sys/kern/vfs_bio.c (suggested patch):
> >> > > --
> >> > > if ((bp->b_bufobj != NULL) && (bp->b_bufobj->bo_object != NULL)) {
> >> > > VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> >> > > ...
> >> > > --
> >> > >
> >> > > Please let me know if you need more information.
> >> > >
> >> > > Regards,
> >> > > Johan Kuuse
> >> > >
> >> > 
> ---
> >> > > kgdb kernel.debug
> >> > > /var/crash/vmcore.1
> >> > > [GDB will not be able to debug user-mode threads:
> >> > > /usr/lib/libthread_db.so:
> >> >
> >> > Undefined symbol "ps_pglobal_lookup"]
> >> >
> >> > > GNU gdb 6.1.1 [FreeBSD]
> >> > > Copyright 2004 Free Software Foundation, Inc.
> >> > > GDB is free software, covered by the GNU General Public License, and
> >> > > you are welcome to change it and/or distribute copies of it under
> >> > > certain
> >> >
> >> > conditions.
> >> >
> >> > > Type "show copying" to see the conditions.
> >> > > There is absolutely no warranty for GDB.  Type "show warranty" for
> >> > > details. This GDB was configured as "i386-marcel-freebsd".
> >> > >
> >> > > Unread portion of the kernel message buffer:
> >> > >
> >> > >
> >> > > Fatal trap 12: page fault while in kernel mode
> >> > > cpuid = 0; apic id = 00
> >> > > fault virtual address   = 0x34
> >> > > fault code  = supervisor read, page not present
> >> > > instruction pointer = 0x20:0xc07b6de4
> >> > > stack pointer   = 0x28:0xe79de7c8
> >> > > frame pointer   = 0x28:0xe79de7e8
> >> > > code segment= base 0x0, limit 0xf, type 0x1b
> >> > > = DPL 0, pres 1, def32 1, gran 1
> >> > > processor eflags= interrupt enabled, resume, IOPL = 0
> >> > > current process = 1214 (opera)
> >> > > trap number = 12
> >> > > panic: page fault
> >> > > cpuid = 0
> >> > > Uptime: 5h20m30s
> >> > > Physical memory: 2035 MB
> >> > > Dumping 218 MB: 203 187 171 155 139 123 107 91 75 59 43 27 11
> >> > >
> >> > > #0  doadump () at pcpu.h:195
> >> > > 195 __asm __volatile("movl %%fs:0,%0" : "=r" (td));
> >> > > (kgdb) list *0xc07b6de4
> >> > > 0xc07b6de4 is in vfs_vmio_release (/usr/src/sys/kern/vfs_bio.c:1530).
> >> > > 1525vfs_vmio_release(struct buf *bp)
> >> > > 1526{
> >> > > 1527int i;
> >> > > 1528vm_page_t m;
> >> > > 1529
> >> > > 1530VM_OBJECT_LOCK(bp->b_bufobj->bo_object);
> >> > > 1531vm_page_lock_queues();
> >> > > 1532for (i = 0; i < bp->b_npages; i++) {
> >> > > 1533m = bp->b_pages[i];
> >> > > 1534bp->b_pages[i] = NULL;
> >> > > (kgdb) bt
> >> > > #0  doadump () at pcpu.h:195
> >> > > #1  0xc0754457 in boot (howto=260) at
> >> > > /usr/src/sys/kern/kern_shutdown.c:409 #2  0xc0754719 in panic
> >> > > (fmt=Variable "fmt" is not available.
> >> > > ) at /usr/src/sys/kern/kern_shutdown.c:563
> >> > > #3  0xc0a4905c in trap_fatal (frame=0xe79de788, eva=52)
> >> >
> >> > at /usr/src/sys/i386/i386/trap.c:899
> >> >
> >> > > #4  0xc0a492e0 in trap_pfault (frame=0xe79de788, usermode=0, eva=52)
> >> >
> >> > at /usr/src/sys/i386/i386/trap.c:812
> >> >
> >> > > #5  0xc0a49c8c in trap (frame=0xe79de788)
> >

Re: 6.3 (ish) hang on reboot w/ Supermicro C2SBA+

2008-08-12 Thread Manjunath Ranganathaiah

On Tue, Aug 12, 2008 at 1:04 AM, Daniel O'Connor <[EMAIL PROTECTED]> wrote:
> Hi,
> I have a 6.3 (RELENG_6 a bit past 6.3 actually) system which hangs when
> I try and reboot (or shutdown). It has a Supermicro C2SBA+, 2ware
> 9650SE and Core2Duo CPU.
>
> It shuts down as normal except after printing the uptime it hangs solid.
> Numlock does nothing (during the shutdown process & disk sync it
> toggles OK). The disks are synced OK (comes up clean when I press the
> reset switch)
>
> Waiting (max 60 seconds) for system process 'vnlru' to stop...done
> Waiting (max 60 seconds) for system process 'bufdaemon' to stop...done
> Waiting (max 60 seconds) for system process 'syncer' to stop...
> Syncing disks, vnodes remaining...6 5 2 3 0 1 0 0 0 done
> All buffers ynced.
> Swap device da0s1b removed.
> Uptime: 1m52s
> 
>
> I have tried setting hw.acpi.disable_on_reboot and hw.acpi.handle_reboot
> to no effect.
>
> I have attached a verbose dmesg.
>
> --
> Daniel O'Connor software and network engineer
> for Genesis Software - http://www.gsoft.com.au
> "The nice thing about standards is that there
> are so many of them to choose from."
>  -- Andrew Tanenbaum
> GPG Fingerprint - 5596 B766 97C0 0E94 4347 295E E593 DC20 7B3F CE8C

can you try with  debug.acpi.disabled="cpu timer" ? or booting with
acpi disabled

-Manjunath
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: command not found: problem with dash in filenames

2008-08-12 Thread Henrik Brix Andersen

On Tue, Aug 12, 2008 at 10:00:02PM +0400, KES wrote:
> NOTICE: 
> when I run purepw it is located and runned, but 
> when I run pure-pw (NOTICE: dash in name) I get: Command not found

Did you by any chance just install this binary? Have you tried running
'rehash' and trying again?

Brix
-- 
Henrik Brix Andersen <[EMAIL PROTECTED]>


pgpIZPuvaqK8t.pgp
Description: PGP signature

Re: Hardware monitoring for Intel Atom D945GCLF.

2008-08-12 Thread Eugene Butusov


Jeremy Chadwick pisze:

On Mon, Aug 11, 2008 at 04:45:47PM -0700, Jeremy Chadwick wrote:

I don't need an API, but this kind of statement makes Intel sound like
they're not willing to disclose the SMBus offsets for monitoring.  I
might have to look at lm-sensors from Linux, but that code is very
difficult to follow.  I'm not sure if Intel gives this sort of
information out publicly, but I sure hope so.


There's a web page mentioning this board (note: entirely Japanese) --

http://iktaka.dyndns.org/node/11

The bottom part of the page states that Linux's lm_sensors 3.0.2 can
successfully monitor temperatures, voltages, and fan RPMs on that board,
very likely via SMBus.

Ideally I should be able to track down technical details by looking at
that code.  I'd feel much more comfortable asking Intel and having them
provide necessary registers and offsets, though; I prefer to avoid
reverse-engineering things if possible (less mistakes).



Thanks for the reply. Looks like there is no tool for FreeBSD which 
supports ICH7 currently. mbmon has support for ICH6, but I'm not sure 
is it still alive. What about your project, are you planing to release 
it to the public in nearest future?


Best regards,
--
_/_/   .. Eugene Butusov
 _/_/  ... www.devilka.info
  _/_/  ebutusov(at)gmail(dot)com
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: Hardware monitoring for Intel Atom D945GCLF.

2008-08-12 Thread Jeremy Chadwick

On Tue, Aug 12, 2008 at 10:38:11PM +0200, Eugene Butusov wrote:
> Jeremy Chadwick pisze:
>> On Mon, Aug 11, 2008 at 04:45:47PM -0700, Jeremy Chadwick wrote:
>>> I don't need an API, but this kind of statement makes Intel sound like
>>> they're not willing to disclose the SMBus offsets for monitoring.  I
>>> might have to look at lm-sensors from Linux, but that code is very
>>> difficult to follow.  I'm not sure if Intel gives this sort of
>>> information out publicly, but I sure hope so.
>>
>> There's a web page mentioning this board (note: entirely Japanese) --
>>
>> http://iktaka.dyndns.org/node/11
>>
>> The bottom part of the page states that Linux's lm_sensors 3.0.2 can
>> successfully monitor temperatures, voltages, and fan RPMs on that board,
>> very likely via SMBus.
>>
>> Ideally I should be able to track down technical details by looking at
>> that code.  I'd feel much more comfortable asking Intel and having them
>> provide necessary registers and offsets, though; I prefer to avoid
>> reverse-engineering things if possible (less mistakes).
>>
>
> Thanks for the reply. Looks like there is no tool for FreeBSD which  
> supports ICH7 currently. mbmon has support for ICH6, but I'm not sure is 
> it still alive.

You have to understand, it's hard for a program to say it "supports a
chipset" unless that specific chipset is doing the H/W monitoring.  For
example, lots of Supermicro boards use Winbond H/W monitoring ICs, but
you can interface with the chip via SMBus.  The chipsets on those boards
are Intel ICHx (ICH7 through 9), but the ICHx doesn't do the monitoring.

I'm not sure the ICHx chips can actually do monitoring natively; I
haven't looked at the specifications.  If they do, that may be what
mbmon is talking about -- otherwise, it might just be a vague
description that more or less says "works with Intel ICHx chipsets that
support SMBus".  Hard to explain really...

> What about your project, are you planing to release it to the public
> in nearest future?

Once I get a couple more Supermicro boards added, yep.  The code is rock
solid at this point in time[1], but there's still a lot to be done before I
consider it worthy of public release.

Remaining items on my TODO list for bsdhwmon:

* Poke individuals testing software for further updates (many have
  been quiet since the last tarball)
* Write manpage
* Update README, and INSTALL (if necessary)
* Update Makefile to be FreeBSD ports-friendly, and general cleanup
* Make a FreeBSD port for the software

These are things I myself have to do.  I'm sorry I don't have an ETR for
you -- I really wish I did.

[1]: No crashes/segfaults, and compiles cleanly with -Werror -Wall
-Wunused -Waggregate-return -Wbad-function-cast -Wcast-align
-Wdeclaration-after-statement -Wdisabled-optimization -Wfloat-equal
-Winline -Wmissing-declarations -Wmissing-prototypes -Wnested-externs
-Wold-style-definition -Wpacked -Wpointer-arith -Wredundant-decls
-Wsign-compare -Wstrict-prototypes -Wunreachable-code -Wwrite-strings

I'm one of those "warnings should be taken seriously" individuals.
:-)

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Fwd: Re: command not found: problem with dash in filenames

2008-08-12 Thread Jeremy Chadwick

- Forwarded message from KES <[EMAIL PROTECTED]> -

> From: KES <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Date:   Tue, 12 Aug 2008 23:40:08 +0400
> Subject: Re: command not found: problem with dash in filenames
> 
> 
> 
> 12.08.08, 22:25, "Jeremy Chadwick" <[EMAIL PROTECTED]>:
> 
> > On Tue, Aug 12, 2008 at 10:00:02PM +0400, KES wrote:
> > > NOTICE: 
> > > when I run purepw it is located and runned, but 
> > > when I run pure-pw (NOTICE: dash in name) I get: Command not found
> > > kes# env |grep PATH
> > > PATH=/sbin:/bin:/usr/sbin:/usr/bin:/usr/games:/usr/local/sbin:/usr/local/bin:/root/bin
> > > kes# env | grep PATH
> > > PATH=/sbin:/bin:/usr/sbin:/usr/bin:/usr/games:/usr/local/sbin:/usr/local/bin:/root/bin
> > > kes# ls -l /usr/local/bin/pure*
> > > -r-xr-xr-x  1 root  wheel  24052 12 ??? 06:24 /usr/local/bin/pure-pw
> > > -r-xr-xr-x  1 root  wheel   4432 12 ??? 06:24 
> > > /usr/local/bin/pure-pwconvert
> > > -r-xr-xr-x  1 root  wheel   6016 12 ??? 06:24 
> > > /usr/local/bin/pure-statsdecode
> > > kes# pure-pw
> > > pure-pw: Command not found
> > > kes# /usr/local/bin/pure-pw
> > > 
> > > Usage :
> > > 
> > > pure-pw useradd  [-f ] -u  [-g ]
> > > -D/-d  [-c ]
> > > [-t ] [-T ]
> > > .
> > > kes# cp pure-pw purepw
> > > kes# purepw
> > > 
> > > Usage :
> > > 
> > > pure-pw useradd  [-f ] -u  [-g ]
> > > -D/-d  [-c ]
> > > [-t ] [-T ]
> > > 
> > Looks like you're using csh/tcsh.  Try using "rehash" and see if it
> > fixes the problem for you.
> 
> thank you very mach
> problem was resolved
> 

- End forwarded message -

Individual replied to me off-list; "refresh" was indeed the answer.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: sysinstall compilation issue

2008-08-12 Thread Unga

--- On Tue, 8/12/08, Unga <[EMAIL PROTECTED]> wrote:

> From: Unga <[EMAIL PROTECTED]>
> Subject: Re: sysinstall compilation issue
> To: freebsd-stable@FreeBSD.ORG
> Cc: [EMAIL PROTECTED]
> Date: Tuesday, August 12, 2008, 3:28 PM
> --- On Fri, 8/8/08, Oliver Fromme
> <[EMAIL PROTECTED]> wrote:
> 
> > From: Oliver Fromme <[EMAIL PROTECTED]>
> > Subject: Re: sysinstall compilation issue
> > To: freebsd-stable@FreeBSD.ORG, [EMAIL PROTECTED]
> > Date: Friday, August 8, 2008, 9:36 PM
> > Unga <[EMAIL PROTECTED]> wrote:
> >  > This is i386 RELENG_7.
> >  > 
> >  > Following section of the
> > /usr/src/usr.sbin/sysinstall/Makefile does not
> generate C
> > code properly:
> >  > 
> >  > makedevs.c: Makefile rtermcap
> >  > echo '#include
> '
> > > makedevs.c
> >  > ${RTERMCAP} ansi | \
> >  > file2c 'const char
> termcap_ansi[]
> > = {' ',0};' \
> >  > > > makedevs.c
> >  > 
> >  > At compile time, above expands to:
> >  >
> >
> TERMCAP=/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src
> > ./rtermcap ansi |  file2c 'const char
> termcap_ansi[] =
> > {' ',0};'  >> makedevs.c
> >  > 
> >  > Which generates C code as follows:
> >  > const char termcap_ansi[] = {
> >  > 
> >  > ,0};
> >  > 
> >  > That is, it generates 3 lines, which results a
> > compilation error.
> >  > 
> >  > I presume, intended generated code should be:
> >  > const char termcap_ansi[] = {' ',0};
> >  > 
> >  > Am I right?
> > 
> > No, it should generate an array containung a dump of
> > the "ansi" termcap entry.  Please see the
> > file2c(1)
> > manpage.
> > 
> >  > What is wrong at my end?
> >  > 
> >  > Note, I linked the rtermcap with ncursesw
> libraries,
> > can that be the cause? Any ideas?
> > 
> > Yes, that's the cause.  Why did you do that?
> > 
> > FreeBSD's ncurses port contains a patch so the
> > tgetent()
> > function (which is used by rtermcap) returns the
> actual
> > termcap entry in the buffer.  The original ncurses
> code
> > (which is terminfo based) doesn't do that.
> > 
> > When you linked rtermcap with the wrong library, you
> > restored the original behaviour of tgetent(), so the
> > output of rtermcap was empty, so file2c produced
> invalid
> > source.
> > 
> 
> Sorry for my late reply on this. I was not well during
> weekend, an eye ache due to over work :(
> 
> Here is the situation at my end, no matter whether I link
> with ncurses widec libs or non-widec libs, its still return
> the same code as above I mentioned. 
> 
> Possible causes can be:
> 1. The way I compile and install ncurses is not correct.
> 2. OR some essential files required are missing
> 3. OR there can be an other option
> 
> I used truss as follows:
> TERMCAP=/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src
> truss -o truss.log -f ./rtermcap ansi |  file2c 'const
> char termcap_ansi[] = {' ',0};'  >>
> makedevs.c
> 
> The last few lines of truss.log shows:
> 9310:
> stat("/usr/share/misc/terminfo.db",{mode=-rw-r--r--
> ,inode=22613331,size=
> 3170304,blksize=4096}) = 0 (0x0)
>  9310:
> open("/usr/share/misc/terminfo.db",O_RDONLY,0644)
> = 4 (0x4)
>  9310: fcntl(4,F_SETFD,FD_CLOEXEC)   = 0 (0x0)
>  9310:
> read(4,"\0\^F\^Ua\0\0\0\^B\0\0\^D\M-R\0"...,260)
> = 260 (0x104)
>  9310:
> __sysctl(0xbfbfd574,0x2,0xbfbfd570,0xbfbfd57c,0x0,0x0) = 0
> (0x0)
>  9310: lseek(4,0x132000,SEEK_SET)= 1253376
> (0x132000)
>  9310:
> read(4,"\^^\0\M-|\^O\M-T\^O\M-K\^O\M-*"...,4096)
> = 4096 (0x1000)
>  9310: lseek(4,0x26b000,SEEK_SET)= 2535424
> (0x26b000)
>  9310:
> read(4,"\n\0\M-Y\^O\^O\n\0\n\r\^D\^E\^D"...,4096)
> = 4096 (0x1000)
>  9310: close(4)  = 0 (0x0)
>  9310: ioctl(2,TIOCGETA,0xbfbfdba4)  = 0 (0x0)
>  9310: ioctl(2,TIOCGETA,0x81020d8)   = 0 (0x0)
>  9310: ioctl(2,TIOCGETA,0xbfbfdb64)  = 0 (0x0)
>  9310: ioctl(2,TIOCGWINSZ,0xbfbfdbc4)= 0 (0x0)
>  9310: fstat(1,{mode=p-
> ,inode=0,size=0,blksize=4096}) = 0 (0x0)
>  9310: process exit, rval = 0
> 
> Note, above log has no entry to open
> /usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src.
> 
> So what can be the issue, ncurses has not been patched
> correctly or some essential files missing? If you guys think
> that ncurses has not been patched correctly, then I'll
> open another thread to discuss that.
> 

There is another update. I have tested it with ncurses and ncurses-devel ports. 
It seems they don't work as the ncurses in the base system.

The test is as follows:
cc -o rtermcap /usr/src/usr.sbin/sysinstall/rtermcap.c 
/usr/ports/devel/ncurses/work/ncurses-5.6/build.nowidec/lib/libncurses.so.5.6 
/usr/ports/devel/ncurses/work/ncurses-5.6/build.nowidec/lib/libtinfo.so.5.6

export 
LD_LIBRARY_PATH=/usr/ports/devel/ncurses/work/ncurses-5.6/build.nowidec/lib

cp -v mypath/terminfo.db /usr/local/share/misc/

TERMCAP=/usr/src/usr.sbin/sysinstall/../../share/termcap/termcap.src ./rtermcap 
ansi |  file2c 'const char

52 matches

Mail list logo