Re: ncplogin panic

2006-08-02 Thread m . ehinger


ejc <[EMAIL PROTECTED]> schrieb am 01.08.2006 20:28:46:


>
> I found your backtrace by digging a bit through the freebsd-fs list
> and we appear to be reaching selrecord though different paths.  Mine
> is in sopoll() at ../../../kern/uipc_socket.c:2059

In which way different? Mine is in sopoll() at ../../../kern/uipc_socket.c:2059 
too.



> I don't know if it makes a difference, but I'm trying to use IP
> instead of IPX to access our server.

I also use IP only. Sorry if didn't mention that.

Regards,

Maik

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: GEOM_BDE: where is my partition?

2006-08-02 Thread Iulian M
On Tuesday 01 August 2006 17:57, Stefan Bethke wrote:
> Maybe you got lucky, and only the first sector of the disk got lost
> in that crash. If you know how you had partitioned that disk
> *exactly*, or you have another disk of the same size that is
> partitioned *exactly* the same, you might try to re-create the slices
> usign fdisk, or copying over the first sector with dd.  Otherwise,
> you need to restore from backup.

Not sure if it will help you, but there is a tool called gpart 
(sysutils/gpart) in the ports witch tries to recover your slices/partitions 
information so you can recreate them.

-- 
Exceptions prove the rule, and wreck the budget.
-- Miller


pgpLfIcoSelYt.pgp
Description: PGP signature


Re: ncplogin panic [backtrace with module symbols loaded]

2006-08-02 Thread m . ehinger
Hi,

here is mine backtrace with module symbols loaded.

Thanks,

Maik

Kernel Source:   /usr/obj/usr/src/sys/GENERIC
Getting KLD information and locations...
Traceback (most recent call last):
  File "kld_deb.py", line 139, in ?
a = int(j[5], 16)
ValueError: invalid literal for int(): file
source /tmp/kgdb.asf.6179
[GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so: 
Undefined symbol "ps_pglobal_lookup"]
GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-marcel-freebsd".
Ready to go.  Enter 'tr' to connect to the remote target
with /dev/cuad0, 'tr /dev/cuad1' to connect to a different port
or 'trf portno' to connect to the remote target with the firewire
interface.  portno defaults to 5556.

Type 'getsyms' after connection to load kld symbols.

If you're debugging a local system, you can use 'kldsyms' instead
to load the kld symbols.  That's a less obnoxious interface.

Unread portion of the kernel message buffer:


Fatal trap 12: page fault while in kernel mode
fault virtual address  = 0x0
fault code = supervisor write, page not present
instruction pointer= 0x20:0xc068a3fc
stack pointer= 0x28:0xdcebc8dc
frame pointer= 0x28:0xdcebc8e4
code segment = base 0x0, limit 0xf, type 0x1b
 = DPL 0, pres 1, def32 1, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process  = 772 (ncplogin)
trap number= 12
panic: page fault
Uptime: 16m46s
Dumping 494 MB (2 chunks)
  chunk 0: 1MB (159 pages) ... ok
  chunk 1: 495MB (126511 pages) 479 463 447 431 415 399 383 367 351 335 319 303 
287 271 255 239 223 207 191 175 159 143 127 111 95
79 63 47 31 15

#0  doadump () at pcpu.h:165
165  __asm __volatile("movl %%fs:0,%0" : "=r" (td));
(kgdb) getsm yms
During symbol reading, Incomplete CFI data; unspecified registers at 0xc0666959.
Id Refs AddressSize Name
 1   18 0xc040 6e15d0   kernel
 21 0xc0ae2000 5f60 snd_ich.ko
 32 0xc0ae8000 22b88sound.ko
 42 0xc0b0b000 fd68 drm.ko
 51 0xc0b1b000 615c i915.ko
 61 0xc0b22000 5984cacpi.ko
 71 0xc3574000 16000linux.ko
 81 0xc369b000 4000 logo_saver.ko
 91 0xc3816000 8000 ncp.ko
101 0xc381e000 3000 libmchain.ko
Select the list above with the mouse, paste into the screen
and then press ^D.  Yes, this is annoying.
 1   18 0xc040 6e15d0   kernel
 21 0xc0ae2000 5f60 snd_ich.ko
 32 0xc0ae8000 22b88sound.ko
 42 0xc0b0b000 fd68 drm.ko
 51 0xc0b1b000 615c i915.ko
 61 0xc0b22000 5984cacpi.ko
 71 0xc3574000 16000linux.ko
 81 0xc369b000 4000 logo_saver.ko
 91 0xc3816000 8000 ncp.ko
101 0xc381e000 3000 libmchain.ko
^Dadd symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/acpi/acpi/acpi.ko.debug"
 at
 .text_addr = 0xc0b32de0
 .data_addr = 0xc0b661a0
 .bss_addr = 0xc0b684e0
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/drm/drm/drm.ko.debug" 
at
 .text_addr = 0xc0b0e68c
 .data_addr = 0xc0b18100
 .bss_addr = 0xc0b18b24
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/drm/i915/i915.ko.debug"
 at
 .text_addr = 0xc0b1c1b4
 .data_addr = 0xc0b1fd80
 .bss_addr = 0xc0b1fff4
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/libmchain/libmchain.ko.debug"
 at
 .text_addr = 0xc381e60c
 .data_addr = 0xc38200b8
 .bss_addr = 0xc3820150
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/linux/linux.ko.debug" 
at
 .text_addr = 0xc357ab44
 .data_addr = 0xc3588000
 .bss_addr = 0xc35896c0
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/ncp/ncp.ko.debug" at
 .text_addr = 0xc3817e40
 .data_addr = 0xc381d280
 .bss_addr = 0xc381db80
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/sound/sound/sound.ko.debug"
 at
 .text_addr = 0xc0af0c30
 .data_addr = 0xc0aff4c0
 .bss_addr = 0xc0b02d80
add symbol table from file 
"/usr/obj/usr/src/sys/GENERIC/modules/usr/src/sys/modules/sound/driver/ich/snd_ich.ko.debug"
 at
 .text_addr = 0xc0ae3b08
 .data_addr = 0xc0ae6960
  

Re: Safe card to replace for ICP Vortex GDT8514RZ ...

2006-08-02 Thread Martin Nilsson

Freddie Cash wrote:

We've been bitten bad by these cards.  LSI MegaRAID 150-6 SATA RAID
controllers (PCI-X format).  The management tools are crap, the
throughput is crap, the onboard SATA chipset is a Silicon Image.


The design is three years old and is showing its age. Their performance 
is not on par with modern cards like 3ware 9550, the price is however 
considerably lower. The newer MegaRAID 300-8x (and coming -4e, -8e) are 
modern designs with faster processors and acceptable performance.



And they will not run reliably when plugged into a riser card.


Wasn't that problem caused by a Tyan motherboard with a substandard 
riser? I have used lots of 150-4/6 with Supermicro boards and risers 
without any problems.



We lost data on several servers before narrowing down the cause to
these cards.  We lost several weeks of time diagnosing these things. 
And we lost several hundred dollars when the vendor wouldn't take them

all back (we got them to trade most of them for 3Ware 9550SX cards).


Buy things from vendors who know what they sell :-) It will be more 
expensive but in the long run you will save both money and especially time.



In our experience, these cards are crap, and their tech support isn't
much better.


LSI's tech support is top notch and very helpful. Tyan is the real 
stinker in the business. Uneducated, without any lab resources and hard 
to get hold of. (excluding the Taiwanese server wanna-be manufacturers 
who's support is virtually non-existent)


/Martin

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread ejc

On 8/1/06, John Baldwin <[EMAIL PROTECTED]> wrote:

On Tuesday 01 August 2006 14:28, ejc wrote:
> On 8/1/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> >
> > Hi,
> >
> > i had the same problem. See my thread on the freebsd-fs mailinglist
> >
> > http://lists.freebsd.org/pipermail/freebsd-fs/2006-July/002060.html
> >
> >
> > After some research i use the attached patch against ncp_sock.c.
> >
> > So it is not the real solution to this problem it only avoids the panics.
I'm using it quiet a while without any other known
> > problems.
> > Hopefully someone with more knowledge can help on this.
> >
> > I also get some "md_get_mem(461): incomplete copy" messages which seem to
do no harm, so far.
> >
> > Regards,
> >
> > Maik
> >
> >
> > !!! Use atyour own risk !!!
> >
> > --- ncp_sock.c.origFri Jan  7 02:45:49 2005
> > +++ ncp_sock.c   Thu Jul 20 14:12:45 2006
> > @@ -189,7 +189,12 @@
> >  struct thread *td = curthread;
> >  struct ucred *cred = NULL;
> >
> > -return so->so_proto->pr_usrreqs->pru_sopoll(so, events, cred, td);
> > +if ( td->td_selq.tqh_last == NULL ) {
> > +printf("ncp_poll: td->td_selq.tqh_last == NULL\n");
> > +return 0;
> > +}
> > +
> > +   return so->so_proto->pr_usrreqs->pru_sopoll(so, events, cred, td);
> >  }
> >
> >  int
> >
> >  pach ends here ---
>
> After setting my bios to only use one CPU I was able to get a core
> dump and the panic is happening at the exact same place as yours:
> in selrecord (../../../kern/sys_generic.c:1105)
> 1100 * it alone as we've already added pointed it at us
> and added it to
> 1101 * our list.
> 1102 */
> 1103if (sip->si_thread == NULL) {
> 1104sip->si_thread = selector;
> 1105TAILQ_INSERT_TAIL(&selector->td_selq, sip,
si_thrlist);
> 1106} else if (sip->si_thread != selector) {
> 1107sip->si_flags |= SI_COLL;
> 1108}
> 1109
>
> I found your backtrace by digging a bit through the freebsd-fs list
> and we appear to be reaching selrecord though different paths.  Mine
> is in sopoll() at ../../../kern/uipc_socket.c:2059
>
> I don't know if it makes a difference, but I'm trying to use IP
> instead of IPX to access our server.
> My dump backtrace is attached.

It would be very helpful if you could get the symbols loaded for the modules
in you backtrace.  You can either compile everything into a static kernel or
you can use the 'asf' tool to generate appropriate gdb script commands to
source to get symbols for your modules.  You can find a kldstat gdb command
in src/tools/debugscripts/ that would be helpful to use with asf.

--
John Baldwin


Backtrace with module symbols loaded.  It looks almost exactly the
same as Maik's .

Thanks
Eric


dump.out
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: GEOM_BDE: where is my partition?

2006-08-02 Thread Felipe Neuwald

Hi Iulian,

ok, I'll try gpart here and tell you about the results.

Thank you,

Felipe Neuwald.

Iulian M escreveu:

On Tuesday 01 August 2006 17:57, Stefan Bethke wrote:
  

Maybe you got lucky, and only the first sector of the disk got lost
in that crash. If you know how you had partitioned that disk
*exactly*, or you have another disk of the same size that is
partitioned *exactly* the same, you might try to re-create the slices
usign fdisk, or copying over the first sector with dd.  Otherwise,
you need to restore from backup.



Not sure if it will help you, but there is a tool called gpart 
(sysutils/gpart) in the ports witch tries to recover your slices/partitions 
information so you can recreate them.


  


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread ejc

On 8/2/06, ejc <[EMAIL PROTECTED]> wrote:

Backtrace with module symbols loaded.  It looks almost exactly the
same as Maik's .


I messed up that last one and loaded the local module symbols with
'kldsyms'.  Here's one with 'getsyms'.

Thanks
Eric


dump.out
Description: Binary data
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

re interface broken

2006-08-02 Thread Gerrit Kühn
Hi folks,

I've got a notebook (HP/Compaq nx7010) with which I'm tracking 6-stable.
Some days (maybe two weeks?) ago my internal network interface (re driver)
became inoperational. The hardware shows a link, but nothing goes over the
wire. I cannot ping, cannot get a dhcp address, nothing.
Meanwhile I did various updates using the old kernel (earlier 6.1-stable),
and I saw several changes affecting if*, re* and so on. However, even
after the latest cvsup today, the interface still doesn't work. Is there
work going on to fix this, can I provide further information to track this
problem down?


cu
  Gerrit
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread John Baldwin
On Wednesday 02 August 2006 09:33, ejc wrote:
> On 8/2/06, ejc <[EMAIL PROTECTED]> wrote:
> > Backtrace with module symbols loaded.  It looks almost exactly the
> > same as Maik's .
> 
> I messed up that last one and loaded the local module symbols with
> 'kldsyms'.  Here's one with 'getsyms'.

Can you jump to the 'selrecord' frame and do 'p selector->td_selq' and
'p *sip'?

-- 
John Baldwin

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


ncplogin panic

2006-08-02 Thread m . ehinger

> Can you jump to the 'selrecord' frame and do 'p selector->td_selq' and
> 'p *sip'?

Here is what i get.

(kgdb) up 7
#7  0xc068a3fc in selrecord (selector=0xc33f9a80, sip=0xc363a8a8) at 
/usr/src/sys/kern/sys_generic.c:1105
1105 TAILQ_INSERT_TAIL(&selector->td_selq, sip, 
si_thrlist);
(kgdb) p selector->td_selq
$1 = {
  tqh_first = 0x0,
  tqh_last = 0x0
}
(kgdb) p *sip
$2 = {
  si_thrlist = {
tqe_next = 0x0,
tqe_prev = 0x0
  },
  si_thread = 0xc33f9a80,
  si_note = {
kl_list = {
  slh_first = 0x0
},
kl_lock = 0xc064bf08 ,
kl_unlock = 0xc064bf3c ,
kl_locked = 0xc064bf78 ,
kl_lockarg = 0xc363a8cc
  },
  si_flags = 0x0
}
(kgdb)

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread ejc

On 8/2/06, John Baldwin <[EMAIL PROTECTED]> wrote:

On Wednesday 02 August 2006 09:33, ejc wrote:
> On 8/2/06, ejc <[EMAIL PROTECTED]> wrote:
> > Backtrace with module symbols loaded.  It looks almost exactly the
> > same as Maik's .
>
> I messed up that last one and loaded the local module symbols with
> 'kldsyms'.  Here's one with 'getsyms'.

Can you jump to the 'selrecord' frame and do 'p selector->td_selq' and
'p *sip'?


(kgdb) frame 7
#7  0xc0507520 in selrecord (selector=0xc174d000, sip=0xc17755e0) at
../../../kern/sys_generic.c:1105
1105TAILQ_INSERT_TAIL(&selector->td_selq, sip, si_thrlist);
(kgdb) p selector->td_selq
$1 = {
 tqh_first = 0x0,
 tqh_last = 0x0
}
(kgdb) p *sip
$2 = {
 si_thrlist = {
   tqe_next = 0x0,
   tqe_prev = 0x0
 },
 si_thread = 0xc174d000,
 si_note = {
   kl_list = {
 slh_first = 0x0
   },
   kl_lock = 0xc04c6b1c ,
   kl_unlock = 0xc04c6b54 ,
   kl_locked = 0xc04c6b90 ,
   kl_lockarg = 0xc1775604
 },
 si_flags = 0x0
}
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread John Baldwin
On Wednesday 02 August 2006 11:43, ejc wrote:
> On 8/2/06, John Baldwin <[EMAIL PROTECTED]> wrote:
> > On Wednesday 02 August 2006 09:33, ejc wrote:
> > > On 8/2/06, ejc <[EMAIL PROTECTED]> wrote:
> > > > Backtrace with module symbols loaded.  It looks almost exactly the
> > > > same as Maik's .
> > >
> > > I messed up that last one and loaded the local module symbols with
> > > 'kldsyms'.  Here's one with 'getsyms'.
> >
> > Can you jump to the 'selrecord' frame and do 'p selector->td_selq' and
> > 'p *sip'?

Ok, can you try http://www.FreeBSD.org/~jhb/patches/ncp_poll.patch

-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: ncplogin panic

2006-08-02 Thread ejc

On 8/2/06, John Baldwin <[EMAIL PROTECTED]> wrote:

On Wednesday 02 August 2006 11:43, ejc wrote:
> On 8/2/06, John Baldwin <[EMAIL PROTECTED]> wrote:
> > On Wednesday 02 August 2006 09:33, ejc wrote:
> > > On 8/2/06, ejc <[EMAIL PROTECTED]> wrote:
> > > > Backtrace with module symbols loaded.  It looks almost exactly the
> > > > same as Maik's .
> > >
> > > I messed up that last one and loaded the local module symbols with
> > > 'kldsyms'.  Here's one with 'getsyms'.
> >
> > Can you jump to the 'selrecord' frame and do 'p selector->td_selq' and
> > 'p *sip'?

Ok, can you try http://www.FreeBSD.org/~jhb/patches/ncp_poll.patch


I had to massage the patch a bit to get it to apply to v1.15 of
ncp_sock.c  (I'm running 6.1-STABLE) but I don't get the panics
anymore.

Thanks
Eric
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: re interface broken

2006-08-02 Thread Pyun YongHyeon
On Wed, Aug 02, 2006 at 04:16:34PM +0200, Gerrit K?hn wrote:
 > Hi folks,
 > 
 > I've got a notebook (HP/Compaq nx7010) with which I'm tracking 6-stable.
 > Some days (maybe two weeks?) ago my internal network interface (re driver)
 > became inoperational. The hardware shows a link, but nothing goes over the
 > wire. I cannot ping, cannot get a dhcp address, nothing.
 > Meanwhile I did various updates using the old kernel (earlier 6.1-stable),
 > and I saw several changes affecting if*, re* and so on. However, even
 > after the latest cvsup today, the interface still doesn't work. Is there
 > work going on to fix this, can I provide further information to track this
 > problem down?
 > 

I think you should show 'dmesg' and 'pciconf -lv' output first.

-- 
Regards,
Pyun YongHyeon
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


fdisk problem on 3ware based system (6.1-RELEASE-p3)

2006-08-02 Thread Michael R. Wayne

I see to have an fdisk issue on one of our boxes with a 3ware card.  Relevant
lines from dmesg:
   twe0: <3ware Storage Controller. Driver version 1.50.01.002> port 
0x9c00-0x9c0f mem 0xfc00-0xfc7f irq 20 at device 1.0 on pci2
   twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048
   twed0:  on twe0
   twed0: 76318MB (156299440 sectors)

This started out because I am unable to create a new partition on
a machine running 6.1-RELEASE-p3 and am beginning to suspect something
is wrong in fdisk.  If I run sysinstall and go to the partition
editor, I get the following, which seems correct:

   Disk name:  twed0  FDISK Partition Editor
   DISK Geometry:  9729 cyls/255 heads/63 sectors = 156296385 sectors (76316MB)

   Offset   Size(ST)End Name  PType   Desc  SubtypeFlags

0 63 62- 12 unused0
   63   31455207   31455269  twed0s1  8freebsd  165
 31455270   58717575   90172844  twed0s2  8freebsd  165
 90172845   66126595  156299439- 12 unused0

But, I am unable to create a third partition.  Every time I do that, I get:
   ERROR: Unable to write data to disk twed0!

This machine is not running with an elevated security level:
   kern.securelevel: -1


So, I decided to go in with fdisk and see what was up.  It looks
like fdisk is very confused on partition 2, which is likely
why I can not create a partition 3 (even using fdisk):

   > fdisk /dev/twed0
   *** Working on device /dev/twed0 ***
   parameters extracted from in-core disklabel are:
   cylinders=9729 heads=255 sectors/track=63 (16065 blks/cyl)

   Figures below won't work with BIOS for partitions not in cyl 1
   parameters to be used for BIOS calculations are:
   cylinders=9729 heads=255 sectors/track=63 (16065 blks/cyl)

   Media sector size is 512
   Warning: BIOS sector numbering starts with sector 1
   Information from DOS bootblock is:
   The data for partition 1 is:
   sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD)
   start 63, size 31455207 (15358 Meg), flag 80 (active)
   beg: cyl 0/ head 1/ sector 1;
   end: cyl 1023/ head 254/ sector 63
   The data for partition 2 is:
   sysid 165 (0xa5),(FreeBSD/NetBSD/386BSD)
   start 31455270, size 58717575 (28670 Meg), flag 0
   beg: cyl 1023/ head 255/ sector 63;
   end: cyl 1023/ head 254/ sector 63   < !!
   The data for partition 3 is:
   
   The data for partition 4 is:
   

At this point, I'm suspecting that fdisk is computing something
incorrectly and am not sure how to proceed as I'd prefer not to
corrupt my disk label.  Before I consider filing a PR, is this a
known problem?

/\/\ \/\/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"