from:"Aleksa Sarai"

[PATCH 2/2] selftests/mount_setattr: add smoke tests for open_tree_attr(2) bug

2025-08-07 Thread Aleksa Sarai

: Aleksa Sarai --- .../selftests/mount_setattr/mount_setattr_test.c | 77 ++ 1 file changed, 64 insertions(+), 13 deletions(-) diff --git a/tools/testing/selftests/mount_setattr/mount_setattr_test.c b/tools/testing/selftests/mount_setattr/mount_setattr_test.c index

[PATCH 1/2] open_tree_attr: do not allow id-mapping changes without OPEN_TREE_CLONE

2025-08-07 Thread Aleksa Sarai

nd was likely unintentional. Fixes: 2462651ffa76 ("fs: allow changing idmappings") Cc: # v6.15+ Signed-off-by: Aleksa Sarai --- fs/namespace.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/fs/namespace.c b/fs/namespace.c index 55f28cebbe7d..196a7873897b 100644 --- a/f

[PATCH 0/2] open_tree_attr: do not allow id-mapping changes without OPEN_TREE_CLONE

2025-08-07 Thread Aleksa Sarai

ikely unintentional. For what it's worth, I found this while working on the open_tree_attr(2) man page, and was trying to figure out what open_tree_attr(2)'s behaviour was in the (slightly fruity) ~OPEN_TREE_CLONE case. Signed-off-by: Aleksa Sarai --- Aleksa Sarai (2): open_tree_

Re: [PATCH v3 1/2] fscontext: do not consume log entries when returning -EMSGSIZE

2025-08-06 Thread Aleksa Sarai

On 2025-08-06, Al Viro wrote: > On Thu, Aug 07, 2025 at 03:55:23AM +1000, Aleksa Sarai wrote: > > > - goto err_free; > > - ret = -EFAULT; > > - if (copy_to_user(_buf, p, n) != 0) > > - goto err_free; > > + if (copy_to_user(_buf

[PATCH v3 1/2] fscontext: do not consume log entries when returning -EMSGSIZE

2025-08-06 Thread Aleksa Sarai

Howells Cc: # v5.2+ Signed-off-by: Aleksa Sarai --- fs/fsopen.c | 54 ++ 1 file changed, 30 insertions(+), 24 deletions(-) diff --git a/fs/fsopen.c b/fs/fsopen.c index 1aaf4cb2afb2..538fdf814fbf 100644 --- a/fs/fsopen.c +++ b/fs/fsopen.c @@ -18,47

[PATCH v3 2/2] selftests/filesystems: add basic fscontext log tests

2025-08-06 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/filesystems/.gitignore | 1 + tools/testing/selftests/filesystems/Makefile | 2 +- tools/testing/selftests/filesystems/fclog.c| 130 + 3 files changed, 132 insertions(+), 1 deletion(-) diff --git a/tools

[PATCH v3 0/2] fscontext: do not consume log entries when returning -EMSGSIZE

2025-08-06 Thread Aleksa Sarai

before we remove the message from the ring buffer, let's just do that instead. While we're at it, refactor some fscontext_read() into a separate helper to make the ring buffer logic a bit easier to read. Fixes: 007ec26cdc9f ("vfs: Implement logging through fs_context") Signed-

[PATCH v2 1/2] fscontext: do not consume log entries when returning -EMSGSIZE

2025-08-05 Thread Aleksa Sarai

Howells Cc: # v5.2+ Signed-off-by: Aleksa Sarai --- fs/fsopen.c | 54 ++ 1 file changed, 30 insertions(+), 24 deletions(-) diff --git a/fs/fsopen.c b/fs/fsopen.c index 1aaf4cb2afb2..538fdf814fbf 100644 --- a/fs/fsopen.c +++ b/fs/fsopen.c @@ -18,47

[PATCH v2 2/2] selftests/filesystems: add basic fscontext log tests

2025-08-05 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/filesystems/.gitignore | 1 + tools/testing/selftests/filesystems/Makefile | 2 +- tools/testing/selftests/filesystems/fclog.c| 135 + 3 files changed, 137 insertions(+), 1 deletion(-) diff --git a/tools

[PATCH v2 0/2] fscontext: do not consume log entries when returning -EMSGSIZE

2025-08-05 Thread Aleksa Sarai

before we remove the message from the ring buffer, let's just do that instead. While we're at it, refactor some fscontext_read() into a separate helper to make the ring buffer logic a bit easier to read. Fixes: 007ec26cdc9f ("vfs: Implement logging through fs_context") Signed-

[PATCH 2/2] selftests/filesystems: add basic fscontext log tests

2025-08-05 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/filesystems/.gitignore | 1 + tools/testing/selftests/filesystems/Makefile | 2 +- tools/testing/selftests/filesystems/fclog.c| 137 + 3 files changed, 139 insertions(+), 1 deletion(-) diff --git a/tools

[PATCH 1/2] fscontext: do not consume log entries for -EMSGSIZE case

2025-08-05 Thread Aleksa Sarai

we remove the message from the ring buffer, let's just do that instead. Fixes: 007ec26cdc9f ("vfs: Implement logging through fs_context") Cc: David Howells Cc: # v5.2+ Signed-off-by: Aleksa Sarai --- fs/fsopen.c | 22 ++ 1 file changed, 14 insertions(+), 8 del

[PATCH 0/2] fscontext: do not consume log entries for -EMSGSIZE case

2025-08-05 Thread Aleksa Sarai

we remove the message from the ring buffer, let's just do that instead. Fixes: 007ec26cdc9f ("vfs: Implement logging through fs_context") Signed-off-by: Aleksa Sarai --- Aleksa Sarai (2): fscontext: do not consume log entries for -EMSGSIZE case selftests/filesys

Re: (subset) [PATCH RFC v3 00/10] extensible syscalls: CHECK_FIELDS to allow for easier feature detection

2024-10-21 Thread Aleksa Sarai

On 2024-10-21, Christian Brauner wrote: > On Thu, 10 Oct 2024 07:40:33 +1100, Aleksa Sarai wrote: > > This is something that I've been thinking about for a while. We had a > > discussion at LPC 2020 about this[1] but the proposals suggested there > > never materialised

[PATCH RFC v3 10/10] selftests: mount_setattr: add CHECK_FIELDS selftest

2024-10-09 Thread Aleksa Sarai

While we're at it -- to make testing for error codes with ASSERT_* easier, make the sys_* wrappers return -errno in case of an error. For some reason, the include doesn't correct import , leading to compilation errors -- so just import manually. Signed-off-by: Aleksa Sarai --- too

[PATCH RFC v3 09/10] mount_setattr: add CHECK_FIELDS flag to usize argument

2024-10-09 Thread Aleksa Sarai

mfollow_supported = ((attr.attr_clr | attr.attr_set) & MOUNT_ATTR_NOSYMFOLLOW); mountattr_idmap_supported = ((attr.attr_clr | attr.attr_set) & MOUNT_ATTR_IDMAP) && (attr.userns_fd != 0); break; } } Signed-off-by: Aleks

[PATCH RFC v3 08/10] selftests: clone3: add CHECK_FIELDS selftests

2024-10-09 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/clone3/.gitignore | 1 + tools/testing/selftests/clone3/Makefile| 4 +- .../testing/selftests/clone3/clone3_check_fields.c | 264 + 3 files changed, 267 insertions(+), 2 deletions(-) diff --git

[PATCH RFC v3 07/10] clone3: add CHECK_FIELDS flag to usize argument

2024-10-09 Thread Aleksa Sarai

roup_supported = (how.flags & CLONE_INTO_CGROUP) && (how.cgroup != 0); break; } } Signed-off-by: Aleksa Sarai --- kernel/fork.c | 30 -- 1 file changed, 28 insertions(+), 2 deletions(-) diff --git a/kernel/fo

[PATCH RFC v3 06/10] selftests: openat2: add CHECK_FIELDS selftests

2024-10-09 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/openat2/Makefile | 2 + tools/testing/selftests/openat2/openat2_test.c | 161 - 2 files changed, 161 insertions(+), 2 deletions(-) diff --git a/tools/testing/selftests/openat2/Makefile b/tools/testing

[PATCH RFC v3 05/10] selftests: openat2: add 0xFF poisoned data after misaligned struct

2024-10-09 Thread Aleksa Sarai

We should also verify that poisoned data after a misaligned struct is also handled correctly by is_zeroed_user(). This test passes with no kernel changes needed, so is_zeroed_user() was correct already. Fixes: b28a10aedcd4 ("selftests: add openat2(2) selftests") Signed-off-by: Al

[PATCH RFC v3 04/10] openat2: add CHECK_FIELDS flag to usize argument

2024-10-09 Thread Aleksa Sarai

ps://lwn.net/Articles/830666/ Signed-off-by: Aleksa Sarai --- arch/alpha/include/uapi/asm/errno.h| 3 +++ arch/mips/include/uapi/asm/errno.h | 3 +++ arch/parisc/include/uapi/asm/errno.h | 3 +++ arch/sparc/include/uapi/asm/errno.h| 3 +++ fs/open.c

[PATCH RFC v3 03/10] openat2: explicitly return -E2BIG for (usize > PAGE_SIZE)

2024-10-09 Thread Aleksa Sarai

("open: introduce openat2(2) syscall") Signed-off-by: Aleksa Sarai --- fs/open.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/fs/open.c b/fs/open.c index 22adbef7ecc2..30bfcddd505d 100644 --- a/fs/open.c +++ b/fs/open.c @@ -1458,6 +1458,8 @@ SYSCALL_DEFINE4(openat2, int, dfd,

[PATCH RFC v3 02/10] sched_getattr: port to copy_struct_to_user

2024-10-09 Thread Aleksa Sarai

sched_getattr(2) doesn't care about trailing non-zero bytes in the (ksize > usize) case, so just use copy_struct_to_user() without checking ignored_trailing. Signed-off-by: Aleksa Sarai --- kernel/sched/syscalls.c | 42 ++ 1 file changed, 2 in

[PATCH RFC v3 01/10] uaccess: add copy_struct_to_user helper

2024-10-09 Thread Aleksa Sarai

elpers to make the necessary scanning easier, but it should be fairly easy to add some if necessary. Signed-off-by: Aleksa Sarai --- include/linux/uaccess.h | 97 + 1 file changed, 97 insertions(+) diff --git a/include/linux/uaccess.h b/include/lin

[PATCH RFC v3 00/10] extensible syscalls: CHECK_FIELDS to allow for easier feature detection

2024-10-09 Thread Aleksa Sarai

row flags in the near future: * openat2 * clone3 * mount_setattr [1]: https://lwn.net/Articles/830666/ [2]: https://youtu.be/ggD-eb3yPVs Signed-off-by: Aleksa Sarai --- Changes in v3: - Fix copy_struct_to_user() return values in case of clear_user() failure. - v2: <https://lore.kernel.org/r

Re: [PATCH v3 1/2] exec: fix up /proc/pid/comm in the execveat(AT_EMPTY_PATH) case

2024-10-01 Thread Aleksa Sarai

On 2024-10-01, Aleksa Sarai wrote: > On 2024-10-01, Tycho Andersen wrote: > > From: Tycho Andersen > > > > Zbigniew mentioned at Linux Plumber's that systemd is interested in > > switching to execveat() for service execution, but can't, because the > &

Re: [PATCH v3 1/2] exec: fix up /proc/pid/comm in the execveat(AT_EMPTY_PATH) case

2024-10-01 Thread Aleksa Sarai

m for most cases and so we can revisit the argv[0] discussion in another 10 years. :D Reviewed-by: Aleksa Sarai > Signed-off-by: Tycho Andersen > Suggested-by: Zbigniew Jędrzejewski-Szmek > CC: Aleksa Sarai > Link: https://github.com/uapi-group/kernel-features#set-comm-field-before-exe

Re: [RFC PATCH 0/3] introduce PIDFD_SELF

2024-09-30 Thread Aleksa Sarai

this when running Go code in runc that did precisely this -- /proc/self gave you the wrong fd table in very specific circumstances that were annoying to debug. For languages with green-threading you can't turn off (like Go) these kinds of issues pop up surprisingly often. > We can adjust the pidfd_send_signal() call to infer the correct scope > (actually nicely we can do that without any change there, by having > __pidfd_get_pid() set f_flags accordingly). > > So TL;DR: I agree, I will respin with PIDFD_SELF referring to the thread. > > My question in return here then is - should we introduce PIDFD_SELF_PROCESS > also (do advise if you feel this naming isn't quite right) - to provide > thread group leader behaviour? Sorry to bike-shed, but to match /proc/self and /proc/thread-self, maybe they should be called PIDFD_SELF (for tgid) and PIDFD_THREAD_SELF (for current's tid)? In principle I guess users might use PIDFD_SELF by accident but if we mirror the naming with /proc/{,thread-}self that might not be that big of an issue? Just a thought. > > Thanks! > -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

[PATCH RFC v2 10/10] selftests: mount_setattr: add CHECK_FIELDS selftest

2024-09-05 Thread Aleksa Sarai

While we're at it -- to make testing for error codes with ASSERT_* easier, make the sys_* wrappers return -errno in case of an error. For some reason, the include doesn't correct import , leading to compilation errors -- so just import manually. Signed-off-by: Aleksa Sarai --- too

[PATCH RFC v2 09/10] mount_setattr: add CHECK_FIELDS flag to usize argument

2024-09-05 Thread Aleksa Sarai

mfollow_supported = ((attr.attr_clr | attr.attr_set) & MOUNT_ATTR_NOSYMFOLLOW); mountattr_idmap_supported = ((attr.attr_clr | attr.attr_set) & MOUNT_ATTR_IDMAP) && (attr.userns_fd != 0); break; } } Signed-off-by: Aleks

[PATCH RFC v2 08/10] selftests: clone3: add CHECK_FIELDS selftests

2024-09-05 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/clone3/.gitignore | 1 + tools/testing/selftests/clone3/Makefile| 4 +- .../testing/selftests/clone3/clone3_check_fields.c | 264 + 3 files changed, 267 insertions(+), 2 deletions(-) diff --git

[PATCH RFC v2 07/10] clone3: add CHECK_FIELDS flag to usize argument

2024-09-05 Thread Aleksa Sarai

roup_supported = (how.flags & CLONE_INTO_CGROUP) && (how.cgroup != 0); break; } } Signed-off-by: Aleksa Sarai --- kernel/fork.c | 30 -- 1 file changed, 28 insertions(+), 2 deletions(-) diff --git a/kernel/fo

[PATCH RFC v2 06/10] selftests: openat2: add CHECK_FIELDS selftests

2024-09-05 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- tools/testing/selftests/openat2/Makefile | 2 + tools/testing/selftests/openat2/openat2_test.c | 161 - 2 files changed, 161 insertions(+), 2 deletions(-) diff --git a/tools/testing/selftests/openat2/Makefile b/tools/testing

[PATCH RFC v2 05/10] selftests: openat2: add 0xFF poisoned data after misaligned struct

2024-09-05 Thread Aleksa Sarai

We should also verify that poisoned data after a misaligned struct is also handled correctly by is_zeroed_user(). This test passes with no kernel changes needed, so is_zeroed_user() was correct already. Fixes: b28a10aedcd4 ("selftests: add openat2(2) selftests") Signed-off-by: Al

[PATCH RFC v2 04/10] openat2: add CHECK_FIELDS flag to usize argument

2024-09-05 Thread Aleksa Sarai

ps://lwn.net/Articles/830666/ Signed-off-by: Aleksa Sarai --- arch/alpha/include/uapi/asm/errno.h| 3 +++ arch/mips/include/uapi/asm/errno.h | 3 +++ arch/parisc/include/uapi/asm/errno.h | 3 +++ arch/sparc/include/uapi/asm/errno.h| 3 +++ fs/open.c

[PATCH RFC v2 03/10] openat2: explicitly return -E2BIG for (usize > PAGE_SIZE)

2024-09-05 Thread Aleksa Sarai

("open: introduce openat2(2) syscall") Signed-off-by: Aleksa Sarai --- fs/open.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/fs/open.c b/fs/open.c index 22adbef7ecc2..30bfcddd505d 100644 --- a/fs/open.c +++ b/fs/open.c @@ -1458,6 +1458,8 @@ SYSCALL_DEFINE4(openat2, int, dfd,

[PATCH RFC v2 02/10] sched_getattr: port to copy_struct_to_user

2024-09-05 Thread Aleksa Sarai

sched_getattr(2) doesn't care about trailing non-zero bytes in the (ksize > usize) case, so just use copy_struct_to_user() without checking ignored_trailing. Signed-off-by: Aleksa Sarai --- kernel/sched/syscalls.c | 42 ++ 1 file changed, 2 in

[PATCH RFC v2 01/10] uaccess: add copy_struct_to_user helper

2024-09-05 Thread Aleksa Sarai

elpers to make the necessary scanning easier, but it should be fairly easy to add some if necessary. Signed-off-by: Aleksa Sarai --- include/linux/uaccess.h | 98 + 1 file changed, 98 insertions(+) diff --git a/include/linux/uaccess.h b/include/lin

[PATCH RFC v2 00/10] extensible syscalls: CHECK_FIELDS to allow for easier feature detection

2024-09-05 Thread Aleksa Sarai

row flags in the near future: * openat2 * clone3 * mount_setattr [1]: https://lwn.net/Articles/830666/ [2]: https://youtu.be/ggD-eb3yPVs Signed-off-by: Aleksa Sarai --- Changes in v2: - Add CHECK_FIELDS support to mount_setattr(2). - Fix build failure on architectures with custom errno values.

Re: [PATCH] syscalls: Document OCI seccomp filter interactions & workaround

2020-11-24 Thread Aleksa Sarai

ls > - > @@ -575,3 +609,6 @@ References and Sources > - Recommendation from Linus Torvalds that x32 system calls should prefer > compatibility with 64-bit versions rather than 32-bit versions: > https://lkml.org/lkml/2011/8/31/244 > + - Linux Configuration se

[PATCH] openat2: reject RESOLVE_BENEATH|RESOLVE_IN_ROOT

2020-10-07 Thread Aleksa Sarai

user of openat2(2) at the moment is LXC which doesn't specify both flags and so no userspace programs will break as a result. Cc: # v5.6+ Fixes: fddb5d430ad9 ("open: introduce openat2(2) syscall") Acked-by: Christian Brauner Signed-off-by: Aleksa Sarai

Re: [PATCH v2 2/2] vfs: add fchmodat2 syscall

2020-09-15 Thread Aleksa Sarai

umode_t mode, int flags); > asmlinkage long sys_fchownat(int dfd, const char __user *filename, uid_t > user, >gid_t group, int flag); > asmlinkage long sys_fchown(unsigned int fd, uid_t user, gid_t group); > diff --git a/include/uapi/asm-generic/unistd.h > b/include/uapi/asm-generic/unistd.h > index 995b36c2ea7d..ebf5cdb3f444 100644 > --- a/include/uapi/asm-generic/unistd.h > +++ b/include/uapi/asm-generic/unistd.h > @@ -859,9 +859,11 @@ __SYSCALL(__NR_openat2, sys_openat2) > __SYSCALL(__NR_pidfd_getfd, sys_pidfd_getfd) > #define __NR_faccessat2 439 > __SYSCALL(__NR_faccessat2, sys_faccessat2) > +#define __NR_fchmodat2 440 > +__SYSCALL(__NR_fchmodat2, sys_fchmodat2) > > #undef __NR_syscalls > -#define __NR_syscalls 440 > +#define __NR_syscalls 441 > > /* > * 32 bit systems traditionally used different > -- > 2.21.0 > -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH v8 1/2] Add a "nosymfollow" mount option.

2020-08-27 Thread Aleksa Sarai

t->mnt_flags makes more sense. The restriction should apply in the context of whatever filesystem contains the symlink, and that would matches FreeBSD's semantics (at least as far as I can tell from a quick look at sys/kern/vfs_lookup.c). -- Aleksa Sarai Senior Software Engineer (Containers)

Re: [PATCH RESEND] fs: Move @f_count to different cacheline with @f_mode

2020-08-26 Thread Aleksa Sarai

rate to ASLR or other base-related randomisation. However it depends on having CONFIG_GCC_PLUGIN_RANDSTRUCT=y and I believe (at least for distribution kernels) this isn't a widely-used configuration. -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyp

Re: [PATCH] MAINTAINERS: add namespace entry

2020-08-26 Thread Aleksa Sarai

ink this is an acceptable way to talk about a peer within the kernel community -- attributing malicious intent without any justification other than "I feel this is the case" is little more than a character assassination, and I don't see why you would feel that such a statement is just

Re: [PATCH v7] Add a "nosymfollow" mount option.

2020-08-11 Thread Aleksa Sarai

n the history and motivation for this patch can be > found here: > > https://sites.google.com/a/chromium.org/dev/chromium-os/chromiumos-design-docs/hardening-against-malicious-stateful-data#TOC-Restricting-symlink-traversal Looks good. Did you plan to add an in-tree test for this (you co

Re: [PATCH] Userfaultfd: Avoid double free of userfault_ctx and remove O_CLOEXEC

2020-08-06 Thread Aleksa Sarai

On 2020-08-04, Eric Biggers wrote: > On Wed, Aug 05, 2020 at 01:47:58PM +1000, Aleksa Sarai wrote: > > On 2020-08-04, Lokesh Gidra wrote: > > > when get_unused_fd_flags returns error, ctx will be freed by > > > userfaultfd's release function, which is indirectly

Re: [PATCH] Userfaultfd: Avoid double free of userfault_ctx and remove O_CLOEXEC

2020-08-04 Thread Aleksa Sarai

; if (fd < 0) { > fput(file); > - goto out; > + return fd; > } > > ctx->owner = file_inode(file); > fd_install(fd, file); > - > -out: > - if (fd < 0) { > - mmdrop(ctx->mm); > - k

Re: [RFC][PATCH] exec: Freeze the other threads during a multi-threaded exec

2020-07-28 Thread Aleksa Sarai

lly all container workloads. -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: strace of io_uring events?

2020-07-16 Thread Aleksa Sarai

kely inform what it would mean to map ptrace events > to CQEs. So, I think they're related, and we should get seccomp hooked > up right away, and that might help us see how (if) ptrace should be > attached. We could just emulate the seccomp-bpf API with the pseudo-syscalls done as a result of CQEs, though I'm not sure how happy folks will be with this kind of glue code in "seccomp-uring" (though in theory it would allow us to attach existing filters to io_uring...). -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH 0/5] RFC: connector: Add network namespace awareness

2020-07-13 Thread Aleksa Sarai

't > > look like the patches were ever developed further. From reading the > > discussion thread on that patch set it appears that I should be doing > > some form of polling on the /proc files. > > Recently Christian Brauner implemented pidfd complete with a poll > operation that reports when a process terminates. > > If you are willing to change your userspace code switching to pidfd > should be all that you need. While this does solve the problem of getting exit notifications in general, you cannot get the exit code. But if they don't care about that then we can solve that problem another time. :D -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH 0/5] RFC: connector: Add network namespace awareness

2020-07-02 Thread Aleksa Sarai

In 2017, I wrote that GNU Shepherd uses cn_proc, however I'm pretty sure (looking at the code now) that it wasn't true then and isn't true now (Shepherd seems to just do basic pidfile liveliness checks). So even the niche example I used then doesn't actually use cn_proc. [1]: https://lore.kernel.org/lkml/a2fa1602-2280-c5e8-cac9-b718eaea5...@suse.de/ -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

[PATCH] symlink.7: document magic-links more completely

2020-06-09 Thread Aleksa Sarai

nat2.2. Signed-off-by: Aleksa Sarai --- man7/symlink.7 | 31 ++- 1 file changed, 22 insertions(+), 9 deletions(-) diff --git a/man7/symlink.7 b/man7/symlink.7 index 07b1db3a3764..ed99bc4236f1 100644 --- a/man7/symlink.7 +++ b/man7/symlink.7 @@ -84,6 +84,21 @@ as th

Re: seccomp feature development

2020-05-19 Thread Aleksa Sarai

On 2020-05-19, Alexei Starovoitov wrote: > On Wed, May 20, 2020 at 11:20:45AM +1000, Aleksa Sarai wrote: > > No it won't become copy_from_user(), nor will there be a TOCTOU race. > > > > The idea is that seccomp will proactively copy the struct (and > > recurs

Re: seccomp feature development

2020-05-19 Thread Aleksa Sarai

On 2020-05-19, Alexei Starovoitov wrote: > On Mon, May 18, 2020 at 7:53 PM Aleksa Sarai wrote: > > > > On 2020-05-19, Jann Horn wrote: > > > On Mon, May 18, 2020 at 11:05 PM Kees Cook wrote: > > > > ## deep argument inspection > > > > >

Re: seccomp feature development

2020-05-19 Thread Aleksa Sarai

On 2020-05-19, Aleksa Sarai wrote: > On 2020-05-19, Christian Brauner wrote: > > On Tue, May 19, 2020 at 05:09:29PM +1000, Aleksa Sarai wrote: > > > On 2020-05-18, Kees Cook wrote: > > > > - the sizes of these EA structs are, by design, growable over time. > &

Re: seccomp feature development

2020-05-19 Thread Aleksa Sarai

On 2020-05-19, Christian Brauner wrote: > On Tue, May 19, 2020 at 05:09:29PM +1000, Aleksa Sarai wrote: > > On 2020-05-18, Kees Cook wrote: > > > - the sizes of these EA structs are, by design, growable over time. > > > seccomp and its users need to be handle thi

Re: seccomp feature development

2020-05-19 Thread Aleksa Sarai

an "accept" bitmask addition without processing the attached > filters... You could use the accept mask -- take the logical and of all the filters' masks and that set is the ones you can skip and auto-accept. > - process attached filters! Each time a filter is added, have the

Re: seccomp feature development

2020-05-18 Thread Aleksa Sarai

memory; they could be inlined while loading > the program. Actually, not even the registers would have to exist in a > seccomp_data struct in memory, we could just replace the loads with > reads from the pt_regs, too.) > > > (However, this needs to be carefully considered given that seccomp_data > > is embedded in user_notif... should the EA struct from userspace also be > > copied into user_notif? More thoughts on this below...) > > > > For user_notif, I think we need something in and around these options: > > > > - make a new API that explicitly follows EA struct design > > (and while read()/write() might be easier[4], I tend to agree with > > Jann and we need to stick to ioctl(): as Tycho noted, "read/write is > > for data". Though I wonder if read() could be used for the notifications, > > which ARE data, and use ioctl() for the responses?) > > Just as a note: If we use read() there, we'll never be able to > transfer things like FDs through that API. And we run into the age-old "read() for management can be a bit hairy" problem. -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: How about just O_EXEC? (was Re: [PATCH v5 3/6] fs: Enable to enforce noexec mounts or file exec through O_MAYEXEC)

2020-05-18 Thread Aleksa Sarai

whether O_MAYEXEC is usable on a system depends on the distribution) but it means that any program which uses O_MAYEXEC cannot rely on it to provide the security guarantees they expect. Even if the program goes and reads the sysctl value, it could change underneath them. If this is just meant to be

Re: [PATCH] seccomp: Add group_leader pid to seccomp_notif

2020-05-17 Thread Aleksa Sarai

: on a new version and switch to copy_struct_from_user(). I was a little surprised when I found out that user_notif doesn't do it this way a while ago (and although in theory it is userspace's fault, ideally we could have an API that doesn't have built-in footguns). -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH v5 0/6] Add support for O_MAYEXEC

2020-05-06 Thread Aleksa Sarai

) did not return EINVAL when > VALID_OPEN_FLAGS check failed. > Now it does, so I do not see a problem that interpreter will use > simple open(), ( Although that path might be manipulated, but file > contents will be verified by IMA) You don't get -EINVAL from open() in the case of unknown flags, that's something only openat2() does in the open*() family. Hence why it's only introduced for openat2(). > 2. When you apply a new flag to mount, it means that IMA will check > all files under this mount and it does not matter whether the file in > question is a script or not. > IMHO it is too hard overhead for performance reasons. > > Regards, > LEv -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH v3 4/7] linux/signal.h: Ignore SIGINFO by default in new tasks

2020-04-30 Thread Aleksa Sarai

ay Devil's advocate -- pid1 has also always had a default-ignore signal mask (which included SIGPWR), so any pid1 that obeyed SIGPWR already had a non-default signal mask (and thus wouldn't be affected by this patch). But I do agree that this seems like a strange change to ma

Re: [PATCH v3 0/5] Add support for RESOLVE_MAYEXEC

2020-04-29 Thread Aleksa Sarai

patchset does[1] and I think it's a reasonable way of solving such problems. -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [LTP] [fs] ce436509a8: ltp.openat203.fail

2020-04-28 Thread Aleksa Sarai

ing on the > E2BIG we are testing for? Does the kernel export it somewhere? No, you would have to effectively binary search on -E2BIG at the moment. CHECK_FIELDS is a proposal I have which would allow you to get get the size of the in-kernel struct, but it's still a proposal. In

[PATCH v2] cgroup: pids: use atomic64_t for pids->limit

2019-10-16 Thread Aleksa Sarai

Because pids->limit can be changed concurrently (but we don't want to take a lock because it would be needlessly expensive), use atomic64_ts instead. Fixes: commit 49b786ea146f ("cgroup: implement the PIDs subsystem") Cc: sta...@vger.kernel.org # v4.3+ Signed-off-by: Al

Re: [PATCH] cgroup: pids: use {READ,WRITE}_ONCE for pids->limit operations

2019-10-16 Thread Aleksa Sarai

On 2019-10-17, Aleksa Sarai wrote: > On 2019-10-16, Tejun Heo wrote: > > Hello, Aleksa. > > > > On Wed, Oct 16, 2019 at 07:32:19PM +1100, Aleksa Sarai wrote: > > > Maybe I'm misunderstanding Documentation/atomic_t.txt, but it looks to > > > me like

Re: [PATCH] cgroup: pids: use {READ,WRITE}_ONCE for pids->limit operations

2019-10-16 Thread Aleksa Sarai

On 2019-10-16, Tejun Heo wrote: > Hello, Aleksa. > > On Wed, Oct 16, 2019 at 07:32:19PM +1100, Aleksa Sarai wrote: > > Maybe I'm misunderstanding Documentation/atomic_t.txt, but it looks to > > me like it's explicitly saying that I shouldn't use atomic64_t i

Re: [PATCH v2] usercopy: Avoid soft lockups in test_check_nonzero_user()

2019-10-16 Thread Aleksa Sarai

AGE_SIZE the test takes multiple seconds. Instead > tweak it to only scan a 1024 byte region, but make it cross the > page boundary. > > Fixes: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") > Suggested-by: Aleksa Sarai > Signed-off-by: Michael

Re: [PATCH] cgroup: pids: use {READ,WRITE}_ONCE for pids->limit operations

2019-10-16 Thread Aleksa Sarai

On 2019-10-14, Tejun Heo wrote: > Hello, Aleksa. > > On Tue, Oct 15, 2019 at 02:59:31AM +1100, Aleksa Sarai wrote: > > On 2019-10-14, Tejun Heo wrote: > > > On Sat, Oct 12, 2019 at 12:05:39PM +1100, Aleksa Sarai wrote: > > > > Because pids->limit can be

Re: [PATCH] cgroup: pids: use {READ,WRITE}_ONCE for pids->limit operations

2019-10-14 Thread Aleksa Sarai

On 2019-10-14, Tejun Heo wrote: > On Sat, Oct 12, 2019 at 12:05:39PM +1100, Aleksa Sarai wrote: > > Because pids->limit can be changed concurrently (but we don't want to > > take a lock because it would be needlessly expensive), use the > > appropriate memory barr

Re: [PATCH] usercopy: Avoid soft lockups in test_check_nonzero_user()

2019-10-12 Thread Aleksa Sarai

On 2019-10-12, Michael Ellerman wrote: > Aleksa Sarai writes: > > On 2019-10-11, Michael Ellerman wrote: > >> On a machine with a 64K PAGE_SIZE, the nested for loops in > >> test_check_nonzero_user() can lead to soft lockups, eg: > ... > >>

[PATCH] cgroup: pids: use {READ,WRITE}_ONCE for pids->limit operations

2019-10-11 Thread Aleksa Sarai

Because pids->limit can be changed concurrently (but we don't want to take a lock because it would be needlessly expensive), use the appropriate memory barriers. Fixes: commit 49b786ea146f ("cgroup: implement the PIDs subsystem") Cc: sta...@vger.kernel.org # v4.3+ Signed-of

Re: [PATCH 1/2] clone3: add CLONE3_CLEAR_SIGHAND

2019-10-11 Thread Aleksa Sarai

On 2019-10-11, Michael Kerrisk wrote: > Why CLONE3_CLEAR_SIGHAND rather than just CLONE_CLEAR_SIGHAND? There are no more flag bits left for the classic clone()/clone2() (the last one was used up by CLONE_PIDFD) -- thus this flag is clone3()-only. -- Aleksa Sarai Senior Software Engin

Re: [PATCH] usercopy: Avoid soft lockups in test_check_nonzero_user()

2019-10-10 Thread Aleksa Sarai

AGE_SIZE the test takes multiple seconds. Instead > tweak it to only scan a 1024 byte region, but make it cross the > page boundary. > > Fixes: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") > Suggested-by: Aleksa Sarai > Signed-off-by: Michael Ellerman

Re: [PATCH v4 1/4] lib: introduce copy_struct_from_user() helper

2019-10-10 Thread Aleksa Sarai

On 2019-10-10, Michael Ellerman wrote: > Aleksa Sarai writes: > > A common pattern for syscall extensions is increasing the size of a > > struct passed from userspace, such that the zero-value of the new fields > > result in the old kernel behaviour (allowing for a

Re: [PATCH 3/3] bpf: use copy_struct_from_user() in bpf() syscall

2019-10-10 Thread Aleksa Sarai

oing. > Note that copy_struct_from_user() is calling min() already. So > technically, the min_t() call could go. But the size is used further > below so leave it. > > [1]: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") > Signed-off-by: Christian Brauner

Re: [PATCH 2/3] bpf: use copy_struct_from_user() in bpf_prog_get_info_by_fd()

2019-10-10 Thread Aleksa Sarai

r() is doing. > Note that copy_struct_from_user() is calling min() already. So > technically, the min_t() call could go. But the info_len is used further > below so leave it. > > [1]: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") > Signed-off-by: Ch

Re: [PATCH 1/3] bpf: use check_zeroed_user() in bpf_check_uarg_tail_zero()

2019-10-10 Thread Aleksa Sarai

nd-rolled version. > > [1]: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") > Signed-off-by: Christian Brauner Acked-by: Aleksa Sarai > --- > kernel/bpf/syscall.c | 22 +++--- > 1 file changed, 7 insertions(+), 15 deletions(-) >

Re: [PATCH RFC 3/3] openat2.2: document new openat2(2) syscall

2019-10-09 Thread Aleksa Sarai

On 2019-10-09, Michael Kerrisk (man-pages) wrote: > Hello Aleksa, > > Thanks for this. It's a great piece of documentation work! > > I would prefer the path_resolution(7) piece as a separate patch. Thanks, and will do. > On 10/3/19 4:55 PM, Aleksa Sarai wrote: > >

Re: [PATCH RFC 2/3] open.2: add O_EMPTYPATH documentation

2019-10-09 Thread Aleksa Sarai

re though, and the magic-link changes (plus O_EMPTYPATH) will probably land after openat2(2) since there is some remaining work to do. > On 10/3/19 4:55 PM, Aleksa Sarai wrote: > > Some of the wording around empty paths in path_resolution(7) also needed > > to be reworked since it&#x

Re: [PATCH RFC 1/3] symlink.7: document magic-links more completely

2019-10-09 Thread Aleksa Sarai

On 2019-10-09, Michael Kerrisk (man-pages) wrote: > On 10/3/19 4:55 PM, Aleksa Sarai wrote: > > Traditionally, magic-links have not been a well-understood topic in > > Linux. Given the new changes in their semantics (related to the link > > mode of trailing magic-links),

Re: [PATCH] proc:fix confusing macro arg name

2019-10-08 Thread Aleksa Sarai

e wrong position, fix it. > > Signed-off-by: Miaohe Lin > Reviewed-by: Andrew Morton > Cc: Alexey Dobriyan > Signed-off-by: Andrew Morton Looks reasonable. Acked-by: Aleksa Sarai > --- > > include/linux/proc_fs.h | 4 ++-- > 1 file changed, 2 insertions(+), 2

Re: [PATCH RFC 1/3] symlink.7: document magic-links more completely

2019-10-07 Thread Aleksa Sarai

On 2019-10-07, Jann Horn wrote: > On Thu, Oct 3, 2019 at 4:56 PM Aleksa Sarai wrote: > > Traditionally, magic-links have not been a well-understood topic in > > Linux. Given the new changes in their semantics (related to the link > > mode of trailing magic-links),

Re: [PATCH] lib: test_user_copy: style cleanup

2019-10-06 Thread Aleksa Sarai

On 2019-10-06, Christian Brauner wrote: > On Sun, Oct 06, 2019 at 10:30:28AM +1100, Aleksa Sarai wrote: > > While writing the tests for copy_struct_from_user(), I used a construct > > that Linus doesn't appear to be too fond of: > > > > On 2019-10-04, Linus Torva

[PATCH] lib: test_user_copy: style cleanup

2019-10-05 Thread Aleksa Sarai

ssignment in test_copy_struct_from_user") Fixes: f5a1a536fa14 ("lib: introduce copy_struct_from_user() helper") Signed-off-by: Aleksa Sarai --- lib/test_user_copy.c | 15 +-- 1 file changed, 9 insertions(+), 6 deletions(-) diff --git a/lib/test_user_copy.c b/lib/te

Re: [GIT PULL] usercopy structs for v5.4-rc2

2019-10-04 Thread Aleksa Sarai

you add a layer of parentheses. You're quite right -- I was mindlessly copying the "ret |=" logic the rest of test_user_copy.c does without thinking about it. I'll include a cleanup for it in the openat2(2) series. -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH] usercopy: Add parentheses around assignment in test_copy_struct_from_user

2019-10-03 Thread Aleksa Sarai

> > Link: https://github.com/ClangBuiltLinux/linux/issues/731 > > Signed-off-by: Nathan Chancellor > > > > I'll take this. Aleksa, can I get your ack too, please? > > Acked-by: Christian Brauner Acked-by: Aleksa Sarai -- Aleksa Sarai Senior Software Engineer (Containers) SUSE Linux GmbH <https://www.cyphar.com/> signature.asc Description: PGP signature

Re: [PATCH] Documentation: update about adding syscalls

2019-10-03 Thread Aleksa Sarai

rmation on how to ensure that syscalls with structure > > arguments are extensible and add a section about naming conventions to > > follow when adding revised versions of already existing syscalls. > > > > Co-Developed-by: Aleksa Sarai > > Signed

Re: [PATCH RFC 3/3] openat2.2: document new syscall

2019-10-03 Thread Aleksa Sarai

Ignore this one (it's an older version of the openat2.2 patch) -- I sent it by accident. On 2019-10-04, Aleksa Sarai wrote: > Signed-off-by: Aleksa Sarai > --- > man2/open.2| 5 + > man2/openat2.2 | 381 + > ma

[PATCH RFC 0/3] document openat2(2) patch series

2019-10-03 Thread Aleksa Sarai

0190-1-cyp...@cyphar.com/ Aleksa Sarai (3): symlink.7: document magic-links more completely open.2: add O_EMPTYPATH documentation openat2.2: document new openat2(2) syscall man2/open.2| 47 - man2/openat2.2 | 381 + man7/path_re

[PATCH RFC 3/3] openat2.2: document new syscall

2019-10-03 Thread Aleksa Sarai

Signed-off-by: Aleksa Sarai --- man2/open.2| 5 + man2/openat2.2 | 381 + man7/path_resolution.7 | 57 -- 3 files changed, 426 insertions(+), 17 deletions(-) create mode 100644 man2/openat2.2 diff --git a/man2/open.2 b/man2

[PATCH RFC 3/3] openat2.2: document new openat2(2) syscall

2019-10-03 Thread Aleksa Sarai

or common errors. Signed-off-by: Aleksa Sarai --- man2/open.2| 5 + man2/openat2.2 | 381 + man7/path_resolution.7 | 57 -- 3 files changed, 426 insertions(+), 17 deletions(-) create mode 100644 man2/openat2.2 diff --git a/man2

[PATCH RFC 1/3] symlink.7: document magic-links more completely

2019-10-03 Thread Aleksa Sarai

Traditionally, magic-links have not been a well-understood topic in Linux. Given the new changes in their semantics (related to the link mode of trailing magic-links), it seems like a good opportunity to shine more light on magic-links and their semantics. Signed-off-by: Aleksa Sarai --- man7

[PATCH RFC 2/3] open.2: add O_EMPTYPATH documentation

2019-10-03 Thread Aleksa Sarai

Some of the wording around empty paths in path_resolution(7) also needed to be reworked since it's now legal (if you pass O_EMPTYPATH). Signed-off-by: Aleksa Sarai --- man2/open.2| 42 +- man7/path_resolution.7 | 17 - 2

[PATCH v4 3/4] sched_setattr: switch to copy_struct_from_user()

2019-09-30 Thread Aleksa Sarai

could come up with a more sane idea for how the syscall interface should look. [1]: commit 1251201c0d34 ("sched/core: Fix uclamp ABI bug, clean up and robustify sched_read_attr() ABI logic and code") Reviewed-by: Kees Cook Signed-off-by: Aleksa Sarai --- kernel/sched/c

[PATCH v4 4/4] perf_event_open: switch to copy_struct_from_user()

2019-09-30 Thread Aleksa Sarai

The change is very straightforward, and helps unify the syscall interface for struct-from-userspace syscalls. Reviewed-by: Kees Cook Signed-off-by: Aleksa Sarai --- kernel/events/core.c | 47 +--- 1 file changed, 9 insertions(+), 38 deletions(-) diff

[PATCH v4 2/4] clone3: switch to copy_struct_from_user()

2019-09-30 Thread Aleksa Sarai

The change is very straightforward, and helps unify the syscall interface for struct-from-userspace syscalls. Additionally, explicitly define CLONE_ARGS_SIZE_VER0 to match the other users of the struct-extension pattern. Reviewed-by: Kees Cook Signed-off-by: Aleksa Sarai --- include/uapi/linux

[PATCH v4 1/4] lib: introduce copy_struct_from_user() helper

2019-09-30 Thread Aleksa Sarai

Villemoes Signed-off-by: Aleksa Sarai --- include/linux/bitops.h | 7 +++ include/linux/uaccess.h | 70 + lib/strnlen_user.c | 8 +-- lib/test_user_copy.c| 136 ++-- lib/usercopy.c | 55 5 files ch

[PATCH v4 0/4] lib: introduce copy_struct_from_user() helper

2019-09-30 Thread Aleksa Sarai

s to copy_struct_from_user() while rt_sigprocmask(2) always rejects differently-sized struct arguments. Aleksa Sarai (4): lib: introduce copy_struct_from_user() helper clone3: switch to copy_struct_from_user() sched_setattr: switch to copy_struct_from_user() perf_event_open: switch to copy_

1 2 3 4 5 6 >

1 - 100 of 541 matches

Mail list logo