Anyone know if it's possible to update the Linux kernel so that namespaces are h...

eyberg · 2025-03-27T15:12:04 1743088324

When we speak of 'hard security boundaries' most people, in this space, are comparing to existing hardware backed isolation such as virtual machines. There are many container escapes each year because the chunk of api that they are required to cover is so large but more importantly it doesn't have isolation at the cpu level (eg: intel vt-x such as VMREAD, VMWRITE, VMLAUNCH, VMXOFF, VMXON).

This is what the entire public cloud is built on. You don't really read articles that often where someone is talking about breaking vm isolation on AWS and spying on the other tenants on the server.

vaylian · 2025-03-27T19:03:15 1743102195

> There are many container escapes each year because the chunk of api that they are required to cover is so large

What API? The kernel syscall API?

If we assume for a moment, that there are no bugs in the Linux namespace implementation, would containers be as safe as virtual machines?

eyberg · 2025-03-27T19:08:33 1743102513

No. As I'm responding to this Qualys just announced three new bypasses as of today: https://seclists.org/oss-sec/2025/q1/253 .

vaylian · 2025-03-27T19:39:11 1743104351

Sorry, can you elaborate? Your answer is not really clear. Why is it not possible for Linux namespaces to be secure?

flaminHotSpeedo · 2025-03-27T15:48:22 1743090502

> This is what the entire public cloud is built on.

Well... The entire public cloud except Azure. They've been caught multiple times for vulnerabilities stemming from the lack of hardware backed isolation between tenants.

richardwhiuk · 2025-03-27T16:50:35 1743094235

Azure has the same level of isolation for VMs at a hardware level as AWS.

flaminHotSpeedo · 2025-03-27T18:43:59 1743101039

How Azure isolates VM's is completely unrelated, because containers are not VM's. And if you meant to assert that Azure uses hardware assisted isolation between tenants in general, that was not the case for azurescape [1] or chaosDB [2].

[1] https://unit42.paloaltonetworks.com/azure-container-instance...

[2] https://www.wiz.io/blog/chaosdb-explained-azures-cosmos-db-v...

richardwhiuk · 2025-03-28T13:18:19 1743167899

It is the case for VMs that customers create.

It hasn't always been the case for manged services, but I don't think that's true for AWS either.

flaminHotSpeedo · 2025-03-28T14:57:35 1743173855

Unmanaged VM's created directly by customers still aren't relevant to this discussion. The whole point here is that everyone else uses some form of hardware assisted isolation between tenants, even in managed services that vend containers or other higher order compute primitives (i.e. Lambda, Cloud Functions, and hosted notebooks/shells).

Between first and second hand experience I can confidently say that, at a bare minimum, the majority of managed services at AWS, GCP, and even OCI use VM's to isolate tenant workloads. Not sure about OCI, but at least in GCP and AWS, security teams that review your service will assume that customers will break out of containers no matter how the container capabilities/permissions/configs are locked down.

GardenLetter27 · 2025-03-27T14:44:37 1743086677

A lot of use cases don't want that though. It's nice having lightweight network namespaces for example, just to separate the network stack for tunneling but still have X and Wayland working fine with the applications running there.

fulafel · 2025-03-28T06:16:28 1743142588

Have a look at gVisor for one approach.