Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Nicolas Formichella
Hello,

I am wanting to test a deployment for a Debian 11 KVM VM using a Debian 12 KVM 
VM, but it crashes and bootloops with seemingly no reasoning.

- Debian 12 FAI was booting correctly on the same VM
- Switching nfsroot config from Bookworm to Bullseye made it bootloop
- Replacing `rootovl` by `rd.live.overlay.overlayfs=1 
rd.live.overlay.readonly=1`[1] did not help
- Remaking the NFSROOT doesn't help
- faiserver and fai are both in `NFSROOT_EXTRA_HOSTS` and were correctly 
contacted in Bookworm

The crash seems to come right after NFS mounts but even debug log seem to say 
nothing, and sometimes it just straight up hangs at IPv6 configuration.

Hanging log using `systemd.log_level=debug systemd.log_target=console 
systemd.journald.forward_to_console=1` : [2]
Crash log using `systemd.log_level=debug systemd.log_target=console 
systemd.journald.forward_to_console=1` : [3]

pxelinux.cfg config (tried with a fresh `fai-chboot - : 
```
append initrd=initrd.img-5.10.0-29-amd64 ip=dhcp  root=/srv/fai/nfsroot:vers=3 
rd.live.overlay.overlayfs=1 rd.live.overlay.readonly=1 console=tty0 
console=ttyS0,115200  FAI_FLAGS=debug,sshd,createvt 
FAI_CONFIG_SRC=nfs://faiserver/srv/fai/fai_config/ FAI_ACTION=install 
systemd.log_level=debug systemd.log_target=console 
systemd.journald.forward_to_console=1
```

(nfs://faiserver/srv/fai/fai_config/ is correct and is a NFS share)

Are there 

[1]: https://github.com/dracutdevs/dracut/issues/1565
[2]: Wasn't able to capture that
[3]: https://paste.opensuse.org/pastes/28fcc183cacb

Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Dirk Geschke
Hi Nicolas,

> The crash seems to come right after NFS mounts but even debug log seem to say 
> nothing, and sometimes it just straight up hangs at IPv6 configuration.
> 
> pxelinux.cfg config (tried with a fresh `fai-chboot - : 
> ```
> append initrd=initrd.img-5.10.0-29-amd64 ip=dhcp  
> root=/srv/fai/nfsroot:vers=3 rd.live.overlay.overlayfs=1 
> rd.live.overlay.readonly=1 console=tty0 console=ttyS0,115200  
> FAI_FLAGS=debug,sshd,createvt 
> FAI_CONFIG_SRC=nfs://faiserver/srv/fai/fai_config/ FAI_ACTION=install 
> systemd.log_level=debug systemd.log_target=console 
> systemd.journald.forward_to_console=1
> ```

shouldn't there be something like an NFS server in the configuration?

E.g. root=nfs::/srv/fai/nfsroot:vers=3

Or how should it find the the NFS server?

Best regards

Dirk

-- 
+--+
| Dr. Dirk Geschke   / Plankensteinweg 61/ 85435 Erding|
| Telefon: 08122-559448  / Mobil: 0176-96906350 / Fax: 08122-9818106   |
| d...@geschke-online.de / d...@lug-erding.de  / kont...@lug-erding.de |
+--+



Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Thomas Lange
> On Tue, 7 May 2024 08:13:42 +, Nicolas Formichella 
>  said:

> Hello,
> I am wanting to test a deployment for a Debian 11 KVM VM using a Debian 
12 KVM VM, but it crashes and bootloops with seemingly no reasoning.

> ```
> append initrd=initrd.img-5.10.0-29-amd64 ip=dhcp  
root=/srv/fai/nfsroot:vers=3 rd.live.overlay.overlayfs=1 
rd.live.overlay.readonly=1 console=tty0 console=ttyS0,115200  
FAI_FLAGS=debug,sshd,createvt 
FAI_CONFIG_SRC=nfs://faiserver/srv/fai/fai_config/ FAI_ACTION=install 
systemd.log_level=debug systemd.log_target=console 
systemd.journald.forward_to_console=1
> ```

FAI never used these options:
  rd.live.overlay.overlayfs=1 rd.live.overlay.readonly=1
I'm pretty sure they will break things.
Instead you have to use the option rootovl.

There may be problems if your nfsroot is not build properly. First,
which FAI version are you using? Do you use systemd inside the nfsroot?
FAI 6.2 uses systemd during the installation and if you still have an
older /etc/fai/NFSROOT config file systsemd may be missing inside the
nfsroot. If you still have this line in NFSROOT (when using FAI 6.2),
please remove it abd rebuild your nfsroot.

sysvinit-core systemd-sysv-


If you are using FAI < 6.2, the nfsroot does not use systemd and I
wonder why you have added some systemd.log... options to the kernel
command line.
-- 
regards Thomas


Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Nicolas Formichella
Hello Thomas and thanks all for your answers

I am using FAI v.6.2.2 on a Debian 12 base

>Do you use systemd inside the nfsroot? Yes
> FAI never used these options:
  rd.live.overlay.overlayfs=1 rd.live.overlay.readonly=1
I'm pretty sure they will break things.
Instead you have to use the option rootovl. : This was a debugging attempt, 
resetting it to

```
append initrd=initrd.img-5.10.0-29-amd64 ip=dhcp root=/srv/fai/nfsroot:vers=3 
rootovl  FAI_FLAGS=verbose,sshd,createvt 
FAI_CONFIG_SRC=nfs://faiserver/srv/fai/fai_config/ FAI_ACTION=install
```

Did not fix anything and exhibits the same crashing behaviour, which did not 
occur when running

> If you still have this line in NFSROOT (when using FAI 6.2),
please remove it abd rebuild your nfsroot.

> sysvinit-core systemd-sysv-

This VM was clean-installed with 6.2.2 and does not contain the line in the 
/etc/fai/NFSROOT file

Regards,
Nicolas


From: linux-fai  on behalf of Thomas Lange 

Sent: Tuesday 7 May 2024 11:09
To: fully automatic installation for Linux 
Subject: Re: Boot-looping or hanging Debian 11 FAI UEFI PXE
 
> On Tue, 7 May 2024 08:13:42 +, Nicolas Formichella 
>  said:

    > Hello,
    > I am wanting to test a deployment for a Debian 11 KVM VM using a Debian 
12 KVM VM, but it crashes and bootloops with seemingly no reasoning.

    > ```
    > append initrd=initrd.img-5.10.0-29-amd64 ip=dhcp  
root=/srv/fai/nfsroot:vers=3 rd.live.overlay.overlayfs=1 
rd.live.overlay.readonly=1 console=tty0 console=ttyS0,115200  
FAI_FLAGS=debug,sshd,createvt 
FAI_CONFIG_SRC=nfs://faiserver/srv/fai/fai_config/ FAI_ACTION=install 
systemd.log_level=debug systemd.log_target=console 
systemd.journald.forward_to_console=1
    > ```

FAI never used these options:
  rd.live.overlay.overlayfs=1 rd.live.overlay.readonly=1
I'm pretty sure they will break things.
Instead you have to use the option rootovl.

There may be problems if your nfsroot is not build properly. First,
which FAI version are you using? Do you use systemd inside the nfsroot?
FAI 6.2 uses systemd during the installation and if you still have an
older /etc/fai/NFSROOT config file systsemd may be missing inside the
nfsroot. If you still have this line in NFSROOT (when using FAI 6.2),
please remove it abd rebuild your nfsroot.

sysvinit-core systemd-sysv-


If you are using FAI < 6.2, the nfsroot does not use systemd and I
wonder why you have added some systemd.log... options to the kernel
command line.
--
regards Thomas

Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Thomas Lange
> On Tue, 7 May 2024 09:59:35 +, Nicolas Formichella 
>  said:

> This VM was clean-installed with 6.2.2 and does not contain the line in 
the /etc/fai/NFSROOT file
Please put the log of fai-make-nfsroot -fv (it's also in /var/log/fai/)
onto paste.debian.net so I can check if the nfsroot was build
properly.

regards Thomas


Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Nicolas Formichella
Hello Thomas,

`fai-make-nfsroot -fv` : https://paste.debian.net/hidden/5f60f92a/

Regards,
Nicolas


From: linux-fai  on behalf of Thomas Lange 

Sent: Tuesday 7 May 2024 12:49
To: fully automatic installation for Linux 
Subject: Re: Boot-looping or hanging Debian 11 FAI UEFI PXE
 
> On Tue, 7 May 2024 09:59:35 +, Nicolas Formichella 
>  said:

    > This VM was clean-installed with 6.2.2 and does not contain the line in 
the /etc/fai/NFSROOT file
Please put the log of fai-make-nfsroot -fv (it's also in /var/log/fai/)
onto paste.debian.net so I can check if the nfsroot was build
properly.

regards Thomas

Re: Boot-looping or hanging Debian 11 FAI UEFI PXE

2024-05-07 Diskussionsfäden Thomas Lange
> On Tue, 7 May 2024 12:50:34 +, Nicolas Formichella 
>  said:

> Hello Thomas,
> `fai-make-nfsroot -fv` : https://paste.debian.net/hidden/5f60f92a/

I wonder why your log does not end like mine. This is mine:


dracut: Generating /boot/initrd.img-5.10.0-9-amd64
dracut: dracut-squash only supports systemd bases initramfs
/usr/lib/dracut/modules.d/45url-lib/module-setup.sh: line 35: warning: command 
substitution: ignored null byte in input
/usr/lib/dracut/modules.d/45url-lib/module-setup.sh: line 35: warning: command 
substitution: ignored null byte in input
'/srv/fai/nfsroot/boot/vmlinuz-5.10.0-9-amd64' -> 
'/srv/tftp/fai/vmlinuz-5.10.0-9-amd64'
'/srv/fai/nfsroot/boot/initrd.img-5.10.0-9-amd64' -> 
'/srv/tftp/fai/initrd.img-5.10.0-9-amd64'
TFTP environment prepared. Enable DHCP and start the TFTP daemon on root 
/srv/tftp/fai.
FAI packages and related packages inside the nfsroot:
dracut 051-1
dracut-live051-1
dracut-network 051-1
dracut-squash  051-1
fai-client 5.10.3
fai-nfsroot5.10.3
fai-setup-storage  5.10.3
Waiting for background jobs to finish
[2]+  Running nice xz -q $NFSROOT/var/tmp/base.tar &  (wd: 
/srv/fai/nfsroot)
fai-make-nfsroot finished properly.

-- 
regards Thomas