Andreas Enge <andr...@enge.fr> writes:

> Am Tue, Oct 29, 2024 at 12:42:32PM +0100 schrieb Andreas Enge:
>> I have restarted nginx and nar-herder (just an educated guess), but
>> to no avail.
>
> I think the problem is further upstream, in the place where the nars are
> actually stored (hatysa). When I download a file that is in the cache,
> such as
>    wget 
> https://bordeaux.guix.gnu.org/nar/zstd/000z8mclk0p7jmrijj6l54bpnam29bqj-r-minfi-1.48.0
> it works.

Hmm, yeah the nar-herder/nginx on hatysa seems to be stuck, it looks
like quite a low level problem since I see btrfs related errors in
/var/log/messages

Oct 29 10:14:07 localhost vmunix: [2910798.796657] rcu: INFO: rcu_preempt 
detected stalls on CPUs/tasks:
Oct 29 10:14:07 localhost vmunix: [2910798.802941] rcu:         Tasks blocked 
on level-0 rcu_node (CPUs 0-15): P30086
Oct 29 10:14:07 localhost vmunix: [2910798.809730] rcu:         (detected by 
11, t=5252 jiffies, g=253094433, q=9317 ncpus=16)
Oct 29 10:14:07 localhost vmunix: [2910798.817292] task:nginx           state:R 
 running task     stack:0     pid:30086 tgid:30086 ppid:30085  flags:0x0000000c
Oct 29 10:14:07 localhost vmunix: [2910798.828334] Call trace:
Oct 29 10:14:07 localhost vmunix: [2910798.830945]  dump_backtrace+0x90/0xe8
Oct 29 10:14:07 localhost vmunix: [2910798.834789]  show_stack+0x18/0x24
Oct 29 10:14:07 localhost vmunix: [2910798.838273]  sched_show_task+0x12c/0x18c
Oct 29 10:14:07 localhost vmunix: [2910798.842364]  
rcu_sched_clock_irq+0xd00/0xdcc
Oct 29 10:14:07 localhost vmunix: [2910798.846804]  
update_process_times+0x68/0xac
Oct 29 10:14:07 localhost vmunix: [2910798.851155]  tick_sched_handle+0x34/0x58
Oct 29 10:14:07 localhost vmunix: [2910798.855245]  
tick_nohz_highres_handler+0x50/0xa8
Oct 29 10:14:07 localhost vmunix: [2910798.860030]  
__hrtimer_run_queues+0x138/0x1b0
Oct 29 10:14:07 localhost vmunix: [2910798.864556]  hrtimer_interrupt+0xe8/0x244
Oct 29 10:14:07 localhost vmunix: [2910798.868736]  
arch_timer_handler_phys+0x34/0x44
Oct 29 10:14:07 localhost vmunix: [2910798.873349]  
handle_percpu_devid_irq+0x84/0x130
Oct 29 10:14:07 localhost vmunix: [2910798.878048]  
generic_handle_domain_irq+0x2c/0x44
Oct 29 10:14:07 localhost vmunix: [2910798.882834]  gic_handle_irq+0x4c/0x110
Oct 29 10:14:07 localhost vmunix: [2910798.886748]  call_on_irq_stack+0x24/0x4c
Oct 29 10:14:07 localhost vmunix: [2910798.890839]  
do_interrupt_handler+0x80/0x84
Oct 29 10:14:07 localhost vmunix: [2910798.895189]  el1_interrupt+0x34/0x68
Oct 29 10:14:07 localhost vmunix: [2910798.898934]  
el1h_64_irq_handler+0x18/0x24
Oct 29 10:14:07 localhost vmunix: [2910798.903195]  el1h_64_irq+0x64/0x68
Oct 29 10:14:07 localhost vmunix: [2910798.906763]  xas_descend+0x20/0x8c
Oct 29 10:14:07 localhost vmunix: [2910798.910332]  filemap_get_entry+0x5c/0x1ac
Oct 29 10:14:07 localhost vmunix: [2910798.914513]  
__filemap_get_folio+0x4c/0x394
Oct 29 10:14:07 localhost vmunix: [2910798.918864]  
alloc_extent_buffer+0x210/0x860 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.924037]  read_tree_block+0x20/0x16c 
[btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.928766]  
btrfs_release_path+0x2ac/0x3b4 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.933843]  
btrfs_search_slot+0x340/0xb24 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.938835]  
btrfs_delete_one_dir_name+0x194/0x4c8 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.944516]  
btrfs_lookup_bio_sums+0x1e4/0x40c [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.949854]  __btrfs_panic+0xb1c/0xec4 
[btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.954498]  btrfs_submit_bio+0x28/0x40 
[btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.959224]  
ordered_data_exit+0x638/0x2e28 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.964300]  
extent_readahead+0x34c/0x384 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.969203]  
btrfs_transaction_exit+0x1a3c/0x3f08 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910798.974796]  read_pages+0x80/0x2e0
Oct 29 10:14:07 localhost vmunix: [2910798.978374]  
page_cache_ra_unbounded+0x160/0x1e4
Oct 29 10:14:07 localhost vmunix: [2910798.983159]  
page_cache_ra_order+0x8c/0x2f8
Oct 29 10:14:07 localhost vmunix: [2910798.987510]  
ondemand_readahead+0x170/0x30c
Oct 29 10:14:07 localhost vmunix: [2910798.991859]  page_cache_sync_ra+0x74/0xa4
Oct 29 10:14:07 localhost vmunix: [2910798.996036]  filemap_get_pages+0xc4/0x66c
Oct 29 10:14:07 localhost vmunix: [2910799.000215]  filemap_read+0xbc/0x398
Oct 29 10:14:07 localhost vmunix: [2910799.003959]  
btrfs_release_file+0xc84/0x13d8 [btrfs]
Oct 29 10:14:07 localhost vmunix: [2910799.009128]  vfs_read+0x24c/0x2ec
Oct 29 10:14:07 localhost vmunix: [2910799.012617]  ksys_pread64+0x78/0xbc
Oct 29 10:14:07 localhost vmunix: [2910799.016271]  
__arm64_sys_pread64+0x20/0x2c
Oct 29 10:14:07 localhost vmunix: [2910799.020533]  invoke_syscall+0x48/0x118
Oct 29 10:14:07 localhost vmunix: [2910799.024451]  
el0_svc_common.constprop.0+0x40/0xe8
Oct 29 10:14:07 localhost vmunix: [2910799.029322]  do_el0_svc+0x20/0x2c
Oct 29 10:14:07 localhost vmunix: [2910799.032803]  el0_svc+0x34/0xb8
Oct 29 10:14:07 localhost vmunix: [2910799.036025]  
el0t_64_sync_handler+0x13c/0x158
Oct 29 10:14:07 localhost vmunix: [2910799.040547]  el0t_64_sync+0x190/0x194


> As far as I know, only Chris has access to this machine.

We should be able to get other people SSH access, I'll try restarting
the machine.

Attachment: signature.asc
Description: PGP signature

Reply via email to