Re: 20TB Storage System (fsck????)

2003-09-04 Thread Terry Lambert
Max Clark wrote:
> Ohh, that's an interesting snag. I was under the impression that 5.x w/ PAE
> could address more than 4GB of Ram.

The kernel being able to address the RAM does not meant that
the KVA+UVA space is larger than 4G.  At best, you could take
the uiomove/copyin/copyout performance hit, and move both of
thse to 4G, each, rather than 4G total.  That still limits you
to 4G.


> If fsck requires 700K for each 1GB of Disk, we are talking about 7GB of Ram
> for 10TB of disk. Is this correct? Will PAE not function correctly to give
> me 8GB of Ram? To check 10TB of disk?

No, it will not.


> Is there anyway to bypass this requirement and split fsck into smaller
> chunks? Being able to fsck my disk is kinda important.

Yes.  Limit the number of CG bitmaps you examine simultaneously,
and make the operation multiple pass over the disk.  This is not
that hard a modification to fsck, and it can be done fairly
quickly by anyone who understands the code.  The code in time to
fsck the disk will go up inversely proportionally to the amount
of RAM it's allowed to use, which is limited to the UVA size
minus the fsck program size itself, and the fsck buffers used for
things like FS metadata for a given file/directory.


> I have zero experience with either itanium or opteron. What is the current
> status of support for these processors in FreeBSD? What would the preferred
> CPU be? Will there be PCI cards that I would not be able to use in either of
> these systems?

I have no idea whether these systems support a larger UVA size,
or how much memory you could jam into them...

-- Terry
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: possible change for 4.9 rc file?

2003-09-04 Thread Doug Barton
I oppose this going in prior to 4.9 just on general principles. The
concept looks sound on a very light review though. If you'd like more in
depth review, please forward your patch to [EMAIL PROTECTED]

Doug

-- 

This .signature sanitized for your protection

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 20TB Storage System (fsck????)

2003-09-04 Thread Geoff Buckingham
On Thu, Sep 04, 2003 at 01:12:45AM -0700, Terry Lambert wrote:
> 
> Yes.  Limit the number of CG bitmaps you examine simultaneously,
> and make the operation multiple pass over the disk.  This is not
> that hard a modification to fsck, and it can be done fairly
> quickly by anyone who understands the code.  The code in time to
> fsck the disk will go up inversely proportionally to the amount
> of RAM it's allowed to use, which is limited to the UVA size
> minus the fsck program size itself, and the fsck buffers used for
> things like FS metadata for a given file/directory.
> 
> 
Pardon my ignorance but does the number of inodes in the filesystem have a
significant impact on the memory requirement of fsck?

I ask as it was previously stated the smallest file on the 10TB filessytem
would be 500MB which would enable a vastley reduced number of inodes and
possibly very large block fragment and cluster sizes?
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


RE: non reliable nmi

2003-09-04 Thread Don Bowman
> From: Mike Silbersack [mailto:[EMAIL PROTECTED]
> On Thu, 28 Aug 2003, Don Bowman wrote:
> 
> > I have machdep.ddb_on_nmi=1.
> > I can drop into the debugger with the magic
> > key sequence. However, when i hit the NMI
> > jumper, i don't always go there. Sometimes
> > I do.
> > The system is 4-way SMP [2xHTT xeon processors]
> > with 4.7.
> >
> > Any suggestion on where my NMI might be going?
> 
> Is your NMI about 106K in size, and does it have the subjects 
> "Thank you",
> "Wicked screensaver" and others?  If so, I think I know where it's
> going...

?
it was actually a serious question. The nmi header on
my board goes into the ICH-3 of my chipset, but shorting the
jumper out doesn't always enter the debugger. It does
sometimes. If I set the NMI_NOW bit in the ICH-3 I always
enter the debugger. I was curious if anyone else had seen
this behaviour.

--don
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: non reliable nmi

2003-09-04 Thread Peter Pentchev
On Thu, Sep 04, 2003 at 09:16:23AM -0400, Don Bowman wrote:
> > From: Mike Silbersack [mailto:[EMAIL PROTECTED]
> > On Thu, 28 Aug 2003, Don Bowman wrote:
> > 
> > > I have machdep.ddb_on_nmi=1.
> > > I can drop into the debugger with the magic
> > > key sequence. However, when i hit the NMI
> > > jumper, i don't always go there. Sometimes
> > > I do.
> > > The system is 4-way SMP [2xHTT xeon processors]
> > > with 4.7.
> > >
> > > Any suggestion on where my NMI might be going?
> > 
> > Is your NMI about 106K in size, and does it have the subjects 
> > "Thank you",
> > "Wicked screensaver" and others?  If so, I think I know where it's
> > going...
> 
> ?
> it was actually a serious question. The nmi header on
> my board goes into the ICH-3 of my chipset, but shorting the
> jumper out doesn't always enter the debugger. It does
> sometimes. If I set the NMI_NOW bit in the ICH-3 I always
> enter the debugger. I was curious if anyone else had seen
> this behaviour.

I haven't kept quite up to date on my x86 hardware lately (read: in
the past five to ten years), but I distinctly remember a time when
everyone referred to x86's NMI as a joke: a non-maskable interrupt that
anyone could mask using a simple CLI instruction.  Not sure if this is
still the case, others would have to say if today's processors still may
get so wedged that a NMI request would simply be ignored.

The other possibility is some kind of kernel mess-up, bad enough that an
NMI does indeed reach the processor, the processor does invoke the NMI
handler, yet the handler (DDB) is quite unable to actually do any useful
work - messed up data structures and such.

G'luck,
Peter

-- 
Peter Pentchev  [EMAIL PROTECTED][EMAIL PROTECTED][EMAIL PROTECTED]
PGP key:http://people.FreeBSD.org/~roam/roam.key.asc
Key fingerprint FDBA FD79 C26F 3C51 C95E  DF9E ED18 B68D 1619 4553
This sentence no verb.


pgp0.pgp
Description: PGP signature


Re: FYI - Just got a kernel panic - RELENG_4

2003-09-04 Thread Marc Ramirez
On Fri, 29 Aug 2003, Kris Kennaway wrote:

> On Thu, Aug 28, 2003 at 05:34:29PM -0400, Marc Ramirez wrote:
> >
> > supped as of ~ 1:40pm EST today
> >
> > The panic:
> >
> > Fatal trap 12: page fault while in kernel mode
>
> This is probably hardware-related, but it's possible it may be due to
> a kernel problem.  As a first step, you need to try and get a gdb
> traceback with debugging symbols as explained in the developer's
> handbook on the website.

I forgot to turn off the cvsup cron job; I went to rebuild the kernel with
DDB, and found out that my code base had changed...

Everything works now.  I have been unable to reproduce the crash...
Thanks for taking the time, though.

Marc.

--
Marc Ramirez
Blue Circle Software Corporation
513-688-1070 (main)
513-382-1270 (direct)
http://www.bluecirclesoft.com
http://www.mrami.com (personal)
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Reminder: BSDCon next week in San Mateo!

2003-09-04 Thread Robert Watson

This is just a friendly reminder e-mail that the BSD Conference is taking
place in San Mateo next week, and that if you're planning to attend and
haven't yet registered, you might want to.  Or, just turn up and register
at the door.

There's a really strong lineup of FreeBSD-related papers, especially
relating to new features in the 5-CURRENT development line. I've attached
a list of just some of the interesting things that will be going on there:
they include a number of tutorials relating to development and
administration, technical session presentations relating to the
development of FreeBSD, development of products using FreeBSD, and the
deployment of FreeBSD-based systems.  And, as always, there will be a
variety of invited talks, BoFs and work-in-progress sessions. 

USENIX has extended their early registration pricing, and also (I believe) 
has an online registration discount.  Multi-employee discounts are also
available for companies sending more than one employee.  You can find out
more about the location, schedule of events, etc, at: 

  http://www.usenix.org/events/bsdcon03/

I look forward to seeing you there!

Robert N M Watson FreeBSD Core Team, TrustedBSD Projects
[EMAIL PROTECTED]  Network Associates Laboratories


  Several excellent tutorials including one on developing storage
extensions using GEOM

  Keynote: Computing Fallacies (or, What Is the World Coming To?)

  Reasoning about SMP in FreeBSD

  devd-A Device Configuration Daemon

  ULE: A Modern Scheduler for FreeBSD

  An Automated Binary Security Update System for FreeBSD

  Building a High-performance Computing Cluster Using FreeBSD

  build.sh: Cross-building NetBSD

  Invited Talk: Long Range 802.11 WANs

  BSD Status Reports

  GBDE-GEOM Based Disk Encryption

  Cryptographic Device Support for FreeBSD

  Enhancements to the Fast Filesystem to Support Multi-Terabyte Storage
Systems

  Invited Talk: Social and Technical Implications of Nonproprietary
Software

  Running BSD Kernels as User Processes by Partial Emulation and Rewriting
of Machine Instructions

  A Digital Preservation Network Appliance Based on OpenBSD

  Using FreeBSD to Render Realtime Localized Audio and Video

  Work in Progess Reports (WiPs)

  Tagging Data in the Network Stack: mbuf_tags

  Fast IPSec: A High-Performance IPsec Implementation

  The WHBA Project: Experiences "deeply embedding" NetBSD

  Invited Talk: Post-Digital Possibilities


___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: non reliable nmi

2003-09-04 Thread cjarvis
In <[EMAIL PROTECTED]>, on 09/04/03 
   at 09:12 AM, Peter Pentchev <[EMAIL PROTECTED]> said:

>I haven't kept quite up to date on my x86 hardware lately (read: in the
>past five to ten years), but I distinctly remember a time when everyone
>referred to x86's NMI as a joke: a non-maskable interrupt that anyone
>could mask using a simple CLI instruction.  Not sure if this is still the
>case, others would have to say if today's processors still may get so
>wedged that a NMI request would simply be ignored.

CLI doesn't stop an NMI, but you can mask off NMI in the CMOS RAM.  On
standard PC-AT platforms anyway.

-- 
Clark




**
This email, and any files transmitted with it, are confidential
and intended solely for the use of the individual or entity to
whom they are addressed. If you have received this email
in error please advise [EMAIL PROTECTED]
**

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


getblk() may return NULL, but most caller's don't check for it

2003-09-04 Thread Bob Willcox
Hi All,

While perusing the FreeBSD kernel source code today I noticed that
there are several reasons that the getblk() function in vfs_bio.c might
decide to return a NULL return code. However, checking both 4-stable and
5-current I can find only place (in 5-curren) where the return value
from getblk() is actually checked for the possibility of being NULL (in
function cluster_rbuild() in vfs_cluster.c near line 389). In all other
instances that I can find the returned value is quickly dereferenced w/o
any check.

What am I missing here? Is this really a bug, or are there extenuating
circumstances that prevent getblk() from returning NULL in actual
operation?

Thanks for any insight that you may provide.
Bob

-- 
Bob Willcox  Maslow's Maxim:
[EMAIL PROTECTED]   If the only tool you have is a hammer,
Austin, TX   you treat everything like a nail.
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 20TB Storage System (fsck????)

2003-09-04 Thread Julian Elischer


On Wed, 3 Sep 2003, Tim Kientzle wrote:

> Max Clark wrote:
> > Ohh, that's an interesting snag. I was under the impression that 5.x w/ PAE
> > could address more than 4GB of Ram.
> 
> That's >4G of memory in the system.  32-bit processors
> are still limited to 4G processor address space, which means
> <3G per process (allowing some memory for kernel operations).
> You can't get around that unless you either go for a 64-bit
> processor or do some complex coding to break your application
> storage across multiple processes.


It's worse than that, becasue I think that to handle >4GB of ram you
need to limit your processes to about 2G of virtual space.



___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: non reliable nmi

2003-09-04 Thread John Baldwin

On 04-Sep-2003 [EMAIL PROTECTED] wrote:
> In <[EMAIL PROTECTED]>, on 09/04/03 
>at 09:12 AM, Peter Pentchev <[EMAIL PROTECTED]> said:
> 
>>I haven't kept quite up to date on my x86 hardware lately (read: in the
>>past five to ten years), but I distinctly remember a time when everyone
>>referred to x86's NMI as a joke: a non-maskable interrupt that anyone
>>could mask using a simple CLI instruction.  Not sure if this is still the
>>case, others would have to say if today's processors still may get so
>>wedged that a NMI request would simply be ignored.
> 
> CLI doesn't stop an NMI, but you can mask off NMI in the CMOS RAM.  On
> standard PC-AT platforms anyway.

The bug is probably in FreeBSD in that we mask the NMI LVT entry in the
local APIC on all the APs and only leave it enabled in the BSP.  Thus,
if your NMI gets sent to one of the APs, it is effectively ignored.  You
can try fixing this in mpapic.c like so:

Index: mpapic.c
===
RCS file: /usr/cvs/src/sys/i386/i386/mpapic.c,v
retrieving revision 1.64
diff -u -r1.64 mpapic.c
--- mpapic.c25 Aug 2003 09:48:47 -  1.64
+++ mpapic.c4 Sep 2003 18:40:47 -
@@ -74,7 +74,9 @@
/* setup LVT2 as NMI, masked till later... */
temp = lapic.lvt_lint1;
temp &= ~(APIC_LVT_M | APIC_LVT_TM | APIC_LVT_IIPP | APIC_LVT_DM);
-   temp |= 0x00010400; /* masked, edge trigger, active hi */
+   temp |= 0x0400; /* edge trigger, active hi */
+   if (PCPU_GET(cpuid) == 0)
+   temp |= APIC_LVT_MSET;  /* masked till later */
 
lapic.lvt_lint1 = temp;

-- 

John Baldwin <[EMAIL PROTECTED]>  <><  http://www.FreeBSD.org/~jhb/
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 20TB Storage System (fsck????)

2003-09-04 Thread Andrew Kinney
On 4 Sep 2003, at 11:53, Julian Elischer wrote:

> On Wed, 3 Sep 2003, Tim Kientzle wrote:
> 
> > Max Clark wrote:
> > > Ohh, that's an interesting snag. I was under the impression that
> > > 5.x w/ PAE could address more than 4GB of Ram.
> > 
> > That's >4G of memory in the system.  32-bit processors
> > are still limited to 4G processor address space, which means
> > <3G per process (allowing some memory for kernel operations).
> > You can't get around that unless you either go for a 64-bit
> > processor or do some complex coding to break your application
> > storage across multiple processes.
> 
> 
> It's worse than that, becasue I think that to handle >4GB of ram you
> need to limit your processes to about 2G of virtual space.
> 

Our experience has been that with 4GB of RAM (or more) you 
really must increase your KVA to 2GB, leaving only 2GB of UVA.  
So, I would concur with what Julian said.

 ;-)

With the lack of third party filesystem support in FreeBSD, might 
you be better served by looking at a Linux system running 
ReiserFS or one of the other file systems designed for such 
behemoth disk systems?

These days, I think Sun even gives away Solaris licenses with their 
low end x86 servers, so that might even be an option.

UFS is great, but there are other filesystems out there that have 
already addressed such problems from their use in academic, 
government, and scientific computing where gigantic filesystems 
tend to be more prevalent.

>From my brief research on the subject, the FreeBSD community 
has been highly resistant to supporting third party filesystems 
precisely because nobody with such needs as yours has ever 
contributed the code necessary to make third party filesystem 
support a reality.  The response is usually something like "if you 
want it, go program it and submit it to a committer for review," 
which is really the only reasonable response that can be made 
given FreeBSD's goals.

C'est la vie.

Sincerely,
Andrew Kinney
President and
Chief Technology Officer
Advantagecom Networks, Inc.
http://www.advantagecom.net

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 20TB Storage System (fsck????)

2003-09-04 Thread Julian Elischer


On Thu, 4 Sep 2003, Andrew Kinney wrote:
> > 
> 
> Our experience has been that with 4GB of RAM (or more) you 
> really must increase your KVA to 2GB, leaving only 2GB of UVA.  
> So, I would concur with what Julian said.
> 
>  thrown> ;-)
> 
> With the lack of third party filesystem support in FreeBSD, might 
> you be better served by looking at a Linux system running 
> ReiserFS or one of the other file systems designed for such 
> behemoth disk systems?
> 
> These days, I think Sun even gives away Solaris licenses with their 
> low end x86 servers, so that might even be an option.
> 
> UFS is great, but there are other filesystems out there that have 
> already addressed such problems from their use in academic, 
> government, and scientific computing where gigantic filesystems 
> tend to be more prevalent.
> 

UFS2 will make the filesystem..
All we need is a way to FIX such a filesystem.

My brief analysis of this indicates that a 'serial' fsck should be
possible.

What this would do is read through the filesystem metadata, creating 
several 'list' files on another filesystem. These would then be
duplicated and sorted on  several different fields, and then
recombined in a 'merge' manner, to produce lists of unallocated files,
bad directory entries, duplicate allocated blocks etc. etc.

This would probably be workable in a similar order of magnitute
of time as a normal fsck, except 'offline' and able to handle a much
larger filesystem.

julian


___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: 20TB Storage System (fsck????)

2003-09-04 Thread David Schultz
On Thu, Sep 04, 2003, Andrew Kinney wrote:
> Our experience has been that with 4GB of RAM (or more) you 
> really must increase your KVA to 2GB, leaving only 2GB of UVA.  
> So, I would concur with what Julian said.
> 
>  thrown> ;-)
> 
> With the lack of third party filesystem support in FreeBSD, might 
> you be better served by looking at a Linux system running 
> ReiserFS or one of the other file systems designed for such 
> behemoth disk systems?

I understand that ReiserFS fares rather poorly on large
filesystems.  People have complained about it overflowing the
kernel heap when they try to create massive volumes.  Perhaps more
recent versions have addressed this, and I haven't verified this
myself.

> These days, I think Sun even gives away Solaris licenses with their 
> low end x86 servers, so that might even be an option.

Umm...the Sun UFS implementation doesn't support >2TB volumes,
since it uses 32-bit block numbers.

> UFS is great, but there are other filesystems out there that have 
> already addressed such problems from their use in academic, 
> government, and scientific computing where gigantic filesystems 
> tend to be more prevalent.

Yep.  The most significant problem I see with UFS is that
recovering from a crash or power failure is an O(data) operation,
and that just doesn't scale well.  Other designs (the most
prevalent of which involve logging) support O(1) recovery.

> >From my brief research on the subject, the FreeBSD community 
> has been highly resistant to supporting third party filesystems 
> precisely because nobody with such needs as yours has ever 
> contributed the code necessary to make third party filesystem 
> support a reality.  The response is usually something like "if you 
> want it, go program it and submit it to a committer for review," 
> which is really the only reasonable response that can be made 
> given FreeBSD's goals.

There's probably only a handful of committers who have access to
the quantities of storage that would make this kind of filesystem
development interesting.  Meanwhile, UFS2 is perfectly adequate
for most systems.  I would imagine that an interested company
would have to step forward and fund this sort of work.  Moreover,
there are licensing issues that would have to be considered to do
a port of an existing filesystem...
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"