Hello,
I've recently experienced three segfaults from ods-signer. In all three cases 
the log contains errors about keys that can not be found. I initially assumed 
they were erroneously deleted from the HSM but 'ods-hsmutil' is able to find 
them. After restarting the signer it seems to work fine.

I'm using OpenDNSSEC 1.3.9-5 and SoftHSM 1.3.3-2 as provided by Debian/wheezy.

The logs below have been abbreviated and redacted. I'll provide full logs upon 
request.
I've increased logging and enabled coredumps, so if it happens again I may have 
more information.

Crash on host Ramanujan
Mar 18 00:10:53 ramanujan ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 00:10:53 ramanujan ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 00:10:53 ramanujan ods-enforcerd: Key remove successful.
Mar 18 00:10:53 ramanujan ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 00:10:53 ramanujan ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 00:10:53 ramanujan ods-enforcerd: Key remove successful.
Mar 18 00:10:54 ramanujan ods-signerd: [hsm] idle libhsm connection, trying to 
reopen
Mar 18 00:10:55 ramanujan ods-signerd: [hsm] hsm_get_slot_id(): could not find 
token with the name LocalHSM
Mar 18 00:10:55 ramanujan ods-signerd: [hsm] unable to get key: key 
d1f03a7b14eac19b355e23ce1b47d0d9 not found
Mar 18 00:10:55 ramanujan ods-signerd: [hsm] unable to sign: get key failed
Mar 18 00:10:55 ramanujan ods-signerd: [rrset] unable to sign RRset[6]: error 
creating RRSIG RR
Mar 18 00:10:55 ramanujan ods-signerd: [worker[4]] sign zone example3.nl 
failed: 1 of 11 signatures failed
Mar 18 00:10:55 ramanujan ods-signerd: [worker[4]] backoff task [sign] for zone 
example3.nl with 60 seconds
... same for many other zones ...
Mar 18 00:14:02 ramanujan ods-signerd: [hsm] unable to get key: key 
d56c511a78d2a9406b0d135edc80a758 not found
Mar 18 00:14:02 ramanujan ods-signerd: [hsm] unable to get key: key 
d56c511a78d2a9406b0d135edc80a758 not found
Mar 18 00:14:02 ramanujan ods-signerd: [hsm] unable to sign: get key failed
Mar 18 00:14:02 ramanujan ods-signerd: [rrset] unable to sign RRset[6]: error 
creating RRSIG RR
Mar 18 00:14:02 ramanujan ods-signerd: [worker[7]] sign zone example3.org 
failed: 1 of 10 signatures failed
Mar 18 00:14:02 ramanujan ods-signerd: [worker[7]] backoff task [sign] for zone 
example3.org with 120 seconds
Mar 18 00:14:03 ramanujan ods-signerd: [hsm] unable to get key: key 
cf06b88e9a3867461af1bd628fda4d51 not found
Mar 18 00:14:03 ramanujan ods-signerd: [hsm] unable to get key: key 
cf06b88e9a3867461af1bd628fda4d51 not found
Mar 18 00:14:03 ramanujan ods-signerd: [hsm] unable to sign: get key failed
Mar 18 00:14:03 ramanujan ods-signerd: [rrset] unable to sign RRset[6]: error 
creating RRSIG RR
Mar 18 00:14:03 ramanujan ods-signerd: [worker[7]] sign zone example4.org 
failed: 1 of 13 signatures failed
Mar 18 00:14:03 ramanujan ods-signerd: [rrset] unable to sign RRset[6]: error 
creating RRSIG RR
Mar 18 00:14:03 ramanujan ods-signerd: [worker[7]] sign zone example4.org 
failed: 1 of 13 signatures failed
Mar 18 00:14:03 ramanujan ods-signerd: [worker[7]] backoff task [sign] for zone 
example4.org with 120 seconds
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to sign: get key failed
Mar 18 00:14:20 ramanujan ods-signerd: [rrset] unable to sign RRset[6]: error 
creating RRSIG RR
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to sign: get key failed
Mar 18 00:14:20 ramanujan ods-signerd: [rrset] unable to sign RRset[12]: error 
creating RRSIG RR
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan ods-signerd: [hsm] unable to get key: key 
462167fef14dff802a768a2234003d60 not found
Mar 18 00:14:20 ramanujan kernel: [1059710.358951] ods-signerd[24797]: segfault 
at 10000000010 ip 00007fbb353b206a sp 00007fbb2ed4d490 error 4 in 
libc-2.13.so[7fbb3533b000+180000]



First crash on host Metagross
Mar 14 05:10:36 metagross ods-signerd: [STATS] example.eu RR[count=0 
time=0(sec)] NSEC3[count=0 time=0(sec)] RRSIG[new=1 reused=9 time=3(sec) 
avg=0(sig/sec)] AUDIT[time=0(sec)] TOTAL[time=3(sec)] 
Mar 14 05:10:36 metagross ods-enforcerd: Purging keys...
Mar 14 05:10:36 metagross ods-signerd: [STATS] example2.eu RR[count=0 
time=0(sec)] NSEC3[count=0 time=0(sec)] RRSIG[new=1 reused=8 time=2(sec) 
avg=0(sig/sec)] AUDIT[time=0(sec)] TOTAL[time=2(sec)] 
Mar 14 05:10:37 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 14 05:10:37 metagross ods-signerd: [hsm] idle libhsm connection, trying to 
reopen
Mar 14 05:10:37 metagross ods-signerd: [hsm] idle libhsm connection, trying to 
reopen
Mar 14 05:10:37 metagross ods-signerd: ../../../signer/src/daemon/engine.c at 
367 could not pthread_join(engine->drudgers[i]->thread_id, NULL): Invalid 
argument
Mar 14 05:10:37 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 14 05:10:37 metagross ods-enforcerd: Key remove successful.
Mar 14 05:10:38 metagross ods-enforcerd: Policy standbyyourkey found.
Mar 14 05:10:38 metagross ods-enforcerd: Key sharing is Off.
Mar 14 05:10:38 metagross ods-enforcerd: No zones on policy standbyyourkey, 
skipping...
Mar 14 05:10:38 metagross ods-enforcerd: Purging keys...
Mar 14 05:10:38 metagross ods-signerd: ../../../signer/src/daemon/engine.c at 
367 could not pthread_join(engine->drudgers[i]->thread_id, NULL): Invalid 
argument
Mar 14 05:10:39 metagross kernel: [52691.560115] ods-signerd[3896]: segfault at 
7f30a8a3a9d0 ip 00007f30b83bad8c sp 00007f30b4a41dc0 error 4 in 
libpthread-2.13.so[7f30b83b3000+17000]





Second crash on host Metagross
Mar 18 05:10:28 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:28 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:28 metagross ods-enforcerd: Key remove successful.
... many more ...
Mar 18 05:10:41 metagross ods-signerd: [hsm] idle libhsm connection, trying to 
reopen
Mar 18 05:10:41 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:41 metagross ods-enforcerd: Key remove successful.
Mar 18 05:10:41 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:41 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:41 metagross ods-enforcerd: Key remove successful.
Mar 18 05:10:41 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: Key remove successful.
Mar 18 05:10:42 metagross ods-signerd: [hsm] idle libhsm connection, trying to 
reopen
Mar 18 05:10:42 metagross kernel: [398294.951400] ods-signerd[28480]: segfault 
at 7f352f47b9d0 ip 00007f3535df9d8c sp 00007f3531c7fdc0 error 4 in 
libpthread-2.13.so[7f3535df2000+17000]
Mar 18 05:10:42 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: Key remove successful.
Mar 18 05:10:42 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: SoftHSM: C_DestroyObject: An object 
has been destroyed
Mar 18 05:10:42 metagross ods-enforcerd: Key remove successful.
... enforcer continues destroying ...



-- 
Casper Gielen <cgie...@uvt.nl> | LIS UNIX
PGP fingerprint = 16BD 2C9F 8156 C242 F981  63B8 2214 083C F80E 4AF7

Universiteit van Tilburg | Postbus 90153, 5000 LE
Warandelaan 2 | Telefoon 013 466 4100 | G 236 | http://www.uvt.nl


_______________________________________________
Opendnssec-user mailing list
Opendnssec-user@lists.opendnssec.org
https://lists.opendnssec.org/mailman/listinfo/opendnssec-user

Reply via email to