On 11/12/17 14:13, Otto Moerbeek wrote: > On Sun, Nov 12, 2017 at 01:28:39PM -0500, Nick Holland wrote: > >> Help. >> >> I was upgrading a few very similar machines to -current today. >> ONE of the three decided to be unpleasant. The thing has a >> serial console, and but it is about 370km from me. :-/ >> >> Upgrade from Sep 9 current to today's current via bsd.rd, just >> like the other two. >> >> Upon reboot, it does this (from /boot) : >> >> booting hd0a:/bsd: 8484712+2429968+244048+0+667648 [636809heap full >> (0x9d304+65536) >> >> And then reboots the system, as if from power-down/power-up. >> (already something I haven't seen before) >> >> Reboot from "bsd.rd" and "bsd.sp", same results. reboot from "obsd" >> (Sept 9), same results. Not a kernel problem, it seems. About this >> point, I'm starting to think how the serial console has let me down. >> >> I remember how to bring up a DRAC remote CD image via ssh tunnels >> to the drac and how to run java in a windows browser, and >> reboot off the remote CD image, do another upgrade, all goes fine >> (again), but upon reboot, same results... "heap full" and reboot. >> >> Boot from remote CD, at the boot> prompt, enter "boot hd0a:/bsd", >> and it boots Just Fine from the local hard disk (only boot pulled >> from the remote CD). Boot loader! Reinstalled boot: >> >> # installboot -v sd0 >> Using / as root >> installing bootstrap on /dev/rsd0c >> using first-stage /usr/mdec/biosboot, second-stage /usr/mdec/boot >> copying /usr/mdec/boot to /boot >> /boot is 3 blocks x 32768 bytes >> fs block shift 3; part offset 64; inode block 24, offset 2088 >> master boot record (MBR) at sector 0 >> partition 3: type 0xA6 offset 64 size 2000397671 >> /usr/mdec/biosboot will be written at sector 64 >> >> good, right? >> >> Reboot off local hard disk, boom. problem is still there. maybe >> not the boot loader. :-/ >> >> Verified /boot on trouble system and good system are the same. >> >> I'm not going to cry "bug", since there are two nearly identical >> systems working just fine. But I can't think of what I did wrong >> or what to do to fix it. >> >> Suggestions? > > You are hitting -DHEAP_LIMIT=0xA0000 in /boot. The code is in libsa/alloa.c > > No idea why. But something in that system is different. > > You do have one weird line in your disklabel output: a filesystem > mounted on swap?
that's an mfs. This application has one directory which has a HUGE benefit to an MFS for tmp files. Though the reboot happens long before the mfs is created. $ more /etc/fstab cde728ba2c9bbe7.b none swap sw ccde728ba2c9bbe7.a / ffs rw,noatime 1 1 ccde728ba2c9bbe7.h /home ffs rw,noatime,nodev,nosuid 1 2 ccde728ba2c9bbe7.e /tmp ffs rw,noatime,nodev,nosuid 1 2 ccde728ba2c9bbe7.d /usr ffs rw,noatime,nodev 1 2 ccde728ba2c9bbe7.f /var ffs rw,noatime,nodev,nosuid 1 2 ccde728ba2c9bbe7.g /repo ffs rw,noatime,nodev 1 2 ccde728ba2c9bbe7.i /repo/anoncvs/dev ffs rw,noatime,nosuid 1 2 /dev/sd0b /repo/anoncvs/tmp mfs rw,nodev,nosuid,-m=1,-s=3072000,-i=2048 0 0 > Can you boot into single user mode? nope. Considering how fast the reboot happens, I wouldn't have expected it to, unless something is very different very early in the boot process. This is what happened: On the console: Using drive 0, partion 3. Loading... probing: pc0 com0 com1 mem[631K 3038M 2M 68K 72K 176k 64K 13312M a20=on] disk: fd0 hd0+ >> OpenBSD/amd64 BOOT 3.33 switching console to com0 and then on the serial console: >> OpenBSD/amd64 BOOT 3.33 boot> boot -s booting hd0a:/bsd: 8484304+2429960+244080+0+667648 [643739heap full (0x9d4fc+65536) (boom. reboot) here's a dmesg diff between the "good" and "bad" machines... $ diff -u dmesg.good dmesg.bad --- dmesg.good Sun Nov 12 14:51:30 2017 +++ dmesg.bad Sun Nov 12 14:51:21 2017 @@ -1,7 +1,7 @@ OpenBSD 6.2-current (GENERIC.MP) #203: Sat Nov 11 19:01:19 MST 2017 dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP real mem = 17131339776 (16337MB) -avail mem = 16605302784 (15836MB) +avail mem = 16605294592 (15836MB) mpath0 at root scsibus0 at mpath0: 256 targets mainbus0 at root @@ -16,46 +16,46 @@ acpihpet0 at acpi0: 14318179 Hz acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat cpu0 at mainbus0: apid 0 (boot processor) -cpu0: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3193.18 MHz +cpu0: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3193.22 MHz cpu0: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu0: 256KB 64b/line 8-way L2 cache -acpihpet0: recalibrated TSC frequency 3192750214 Hz +acpihpet0: recalibrated TSC frequency 3192750287 Hz cpu0: smt 0, core 0, package 0 mtrr: Pentium Pro MTRR support, 10 var ranges, 88 fixed ranges cpu0: apic clock running at 99MHz cpu0: mwait min=64, max=64, C-substates=0.2.1.1, IBE cpu1 at mainbus0: apid 1 (application processor) -cpu1: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu1: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu1: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu1: 256KB 64b/line 8-way L2 cache cpu1: smt 1, core 0, package 0 cpu2 at mainbus0: apid 2 (application processor) -cpu2: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu2: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu2: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu2: 256KB 64b/line 8-way L2 cache cpu2: smt 0, core 1, package 0 cpu3 at mainbus0: apid 3 (application processor) -cpu3: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu3: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu3: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu3: 256KB 64b/line 8-way L2 cache cpu3: smt 1, core 1, package 0 cpu4 at mainbus0: apid 4 (application processor) -cpu4: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu4: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu4: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu4: 256KB 64b/line 8-way L2 cache cpu4: smt 0, core 2, package 0 cpu5 at mainbus0: apid 5 (application processor) -cpu5: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu5: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu5: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu5: 256KB 64b/line 8-way L2 cache cpu5: smt 1, core 2, package 0 cpu6 at mainbus0: apid 6 (application processor) -cpu6: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu6: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu6: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu6: 256KB 64b/line 8-way L2 cache cpu6: smt 0, core 3, package 0 cpu7 at mainbus0: apid 7 (application processor) -cpu7: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.75 MHz +cpu7: Intel(R) Xeon(R) CPU E31230 @ 3.20GHz, 3192.76 MHz cpu7: FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,DEADLINE,AES,XSAVE,AVX,NXE,RDTSCP,LONG,LAHF,PERF,ITSC,SENSOR,ARAT cpu7: 256KB 64b/line 8-way L2 cache cpu7: smt 1, core 3, package 0 @@ -121,16 +121,15 @@ wsdisplay0: screen 1-5 added (80x25, vt100 emulation) pcib0 at pci0 dev 31 function 0 "Intel C202 LPC" rev 0x04 ahci0 at pci0 dev 31 function 2 "Intel 6 Series AHCI" rev 0x04: msi, AHCI 1.3 -ahci0: port 2: 3.0Gb/s -ahci0: port 3: 3.0Gb/s +ahci0: port 0: 3.0Gb/s +ahci0: port 1: 1.5Gb/s scsibus1 at ahci0: 32 targets -sd0 at scsibus1 targ 2 lun 0: <ATA, Samsung SSD 850, EXM0> SCSI3 0/direct fixed naa.50025388400562d4 +sd0 at scsibus1 targ 0 lun 0: <ATA, Samsung SSD 850, EXM0> SCSI3 0/direct fixed naa.50025388400563fe sd0: 976762MB, 512 bytes/sector, 2000409264 sectors, thin -sd1 at scsibus1 targ 3 lun 0: <ATA, Samsung SSD 850, EXM0> SCSI3 0/direct fixed naa.5002538c70007b02 -sd1: 1953514MB, 512 bytes/sector, 4000797360 sectors, thin +cd0 at scsibus1 targ 1 lun 0: <PLDS, DVD+-RW DS-8A8SH, KD51> ATAPI 5/cdrom removable ichiic0 at pci0 dev 31 function 3 "Intel 6 Series SMBus" rev 0x04: apic 0 int 19 iic0 at ichiic0 -sdtemp0 at iic0 addr 0x18: mcp98243 +sdtemp0 at iic0 addr 0x18: stts2002 sdtemp1 at iic0 addr 0x19: mcp98243 sdtemp2 at iic0 addr 0x1a: mcp98243 sdtemp3 at iic0 addr 0x1b: stts2002 @@ -141,7 +140,6 @@ isa0 at pcib0 isadma0 at isa0 com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo -com0: console com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo pckbc0 at isa0 port 0x60/5 irq 1 irq 12 pcppi0 at isa0 port 0x61 @@ -155,14 +153,23 @@ uhidev1: iclass 3/1 ums0 at uhidev1: 3 buttons, Z dir wsmouse0 at ums0 mux 0 +umass0 at uhub2 port 2 configuration 1 interface 0 "Avocent USB Composite Device-1" rev 2.00/0.00 addr 4 +umass0: using SCSI over Bulk-Only +scsibus2 at umass0: 2 targets, initiator 0 +sd1 at scsibus2 targ 1 lun 0: <iDRAC, LCDRIVE, 0323> SCSI0 0/direct removable +umass1 at uhub2 port 2 configuration 1 interface 1 "Avocent USB Composite Device-1" rev 2.00/0.00 addr 4 +umass1: using SCSI over Bulk-Only +scsibus3 at umass1: 2 targets, initiator 0 +cd1 at scsibus3 targ 1 lun 0: <iDRAC, Virtual CD, 0323> SCSI0 5/cdrom removable +sd2 at scsibus3 targ 1 lun 1: <iDRAC, Virtual Floppy, 0323> SCSI0 0/direct removable uhub3 at uhub1 port 1 configuration 1 interface 0 "Intel Rate Matching Hub" rev 2.00/0.00 addr 2 uhub4 at uhub3 port 5 configuration 1 interface 0 "Standard Microsystems product 0x2514" rev 2.00/0.00 addr 3 vscsi0 at root -scsibus2 at vscsi0: 256 targets +scsibus4 at vscsi0: 256 targets softraid0 at root -scsibus3 at softraid0: 256 targets -root on sd0a (ff6add5e908e72c7.a) swap on sd0b dump on sd0b -bnx0: address d4:ae:52:b9:6a:80 +scsibus5 at softraid0: 256 targets +root on sd0a (ccde728ba2c9bbe7.a) swap on sd0b dump on sd0b +bnx0: address d4:ae:52:b9:6a:10 brgphy0 at bnx0 phy 1: BCM5709 10/100/1000baseT PHY, rev. 8 -bnx1: address d4:ae:52:b9:6a:81 +bnx1: address d4:ae:52:b9:6a:11 brgphy1 at bnx1 phy 1: BCM5709 10/100/1000baseT PHY, rev. 8 Nick.