CLOUDP-347194 - enable Pod Security Admission at `restricted` level #473

MaciejKaras · 2025-09-24T12:27:45Z

Summary

Based on the HELP-81729 ticket I investigated if our workloads align with restricted Pod Security Standards security level. Unfortunately we cannot test enforcing of the rules easily and guarantee meeting restricted profile. This is mainly because how our e2e tests are setup. For example we are using istio, which adds istio-init containers to provide service mesh network capabilities and Istio containers do not follow restricted profile. Our tests pod also does not follow the PSS requirements. There are also other issues we have faced when testing the enforcement and this requires more time allocation and we cannot promise timelines and priorities.

Because of this I have enabled the warnmode for restricted security level instead of enforce. For one complex e2e_om_ops_manager_backup_sharded_cluster test I have enforced the restricted level and only in single cluster, so that we can monitor our PSS alignment. More about levels and modes can be found here

Proof of Work

Passing CI for all tests that have warning + passing enforcement for e2e_om_ops_manager_backup_sharded_cluster test.

Example warning from the e2e_om_ops_manager_backup_tls test:

[2025/09/24 14:59:58.691] Warning: would violate PodSecurity "restricted:latest": unrestricted capabilities (container "istio-init" must not include "NET_ADMIN", "NET_RAW" in securityContext.capabilities.add), runAsNonRoot != true (container "istio-init" must not set securityContext.runAsNonRoot=false), runAsUser=0 (container "istio-init" must not set runAsUser=0)

Checklist

Have you linked a jira ticket and/or is the ticket in the title?
Have you checked whether your jira ticket required DOCSP changes?
Have you added changelog file?
- use skip-changelog label if not needed
- refer to Changelog files and Release Notes section in CONTRIBUTING.md for more details

github-actions · 2025-09-24T12:28:33Z

⚠️ (this preview might not be accurate if the PR is not rebased on current master branch)

MCK 1.5.0 Release Notes

New Features

Improve automation agent certificate rotation: the agent now restarts automatically when its certificate is renewed, ensuring smooth operation without manual intervention and allowing seamless certificate updates without requiring manual Pod restarts.

Bug Fixes

To follow the Pod Security Standards more secure default pod securityContext settings were added.
Operator deployment securityContext settings that have changed:
- allowPrivilegeEscalation: false
- capabilities.drop: [ ALL ]
- seccompProfile.type: RuntimeDefault
Other workloads:
- capabilities.drop: [ ALL ] - container level
- seccompProfile.type: RuntimeDefault - pod level

Note: If you require less restrictive securityContext settings please use template or podTemplate overrides.
Detailed information about overrides can be found in Modify Ops Manager or MongoDB Kubernetes Resource Containers.

lsierant · 2025-09-24T20:56:02Z

controllers/operator/construct/database_construction_test.go

 		SecurityContext: &corev1.SecurityContext{
 			ReadOnlyRootFilesystem:   ptr.To(true),
 			AllowPrivilegeEscalation: ptr.To(false),
+			Capabilities: &corev1.Capabilities{


is there any potential that adding this default will break any customer's workload (or rather prevent the operator from deploying or the workload sts from restarting) and will require some manual intervention? Just thinking about our semver guarantees.

I think this is our own deployment, that we manage. If the customer wants a managedSecurityContext they are allowed to, but otherwise we should be able to modify the one we provide.

cc @mircea-cosbuc

This is our defaults. Only problem I see with this is some customers now requiring explicitly setting capabilities.

Maybe this should be consider as a security fix? In that case we should be able to overwrite our defaults if they are not secure even if this forces customers to explicitly specify custom capabilities. What do you think?

Still, do we force customers (which don't care about it) to do any manual fix when upgrading? If yes, we need to bump major.

I don't understand what will we break here. We are changing our default SecurityContext for operator and other pods created. If customer wants to have dedicated SecurityContext or PodSecurityContext they need to set MANAGED_SECURITY_CONTEXT env var and our defaults will be completely overwritten. If they don't set MANAGED_SECURITY_CONTEXT every change they make to SecurityContext manually will be overwritten by our defaults.

Code that handles securityContext settings:

mongodb-kubernetes/mongodb-community-operator/pkg/kube/podtemplatespec/podspec_template.go

Lines 312 to 322 in 917723b

func WithDefaultSecurityContextsModifications() (Modification, container.Modification) {

managedSecurityContext := envvar.ReadBool(ManagedSecurityContextEnv) // nolint:forbidigo

configureContainerSecurityContext := container.NOOP()

configurePodSpecSecurityContext := NOOP()

if !managedSecurityContext {

configurePodSpecSecurityContext = WithSecurityContext(DefaultPodSecurityContext())

configureContainerSecurityContext = container.WithSecurityContext(container.DefaultSecurityContext())

}

return configurePodSpecSecurityContext, configureContainerSecurityContext

}

We have discussed with @lsierant that the change for the more strict Capabilities is only applied on the db/om containers, not the whole Pod. This will not affect other containers in the Pod i.e. security, istio sidecars that customer can have. The only change on the Pod level is adding seccompProfile: type: RuntimeDefault. We can do two things with it:

move the seccompProfile: type: RuntimeDefault to container level and don't specify it on pod level. We will have our containers with secure seccomp settings, but if customer will add any sidecar to it it will not have seccomp settings applied

keep it as is and secure entire Pod

@mircea-cosbuc looking for guidance here how to proceed

I think it's best to set it at pod level. I think based on @lsierant this needs clarity on what customers might need to change on upgrade (if anything), outlining those scenarios and deciding if it's a breaking change.

I've checked what are the consequences for using seccomp: type: RuntimeDefault and it defaults to what container runtime is used. Containerd and docker for example have very similar default seccomp profile -> https://docs.docker.com/engine/security/seccomp/#significant-syscalls-blocked-by-the-default-profile

Based on what I have found in official Kubernetes docs:

These profiles may differ between runtimes like CRI-O or containerd. They also differ for its used hardware architectures. But generally speaking, those default profiles allow a common amount of syscalls while blocking the more dangerous ones, which are unlikely or unsafe to be used in a containerized application.

Additionally on Red Hat OpenShift Container Platform RuntimeDefault is often enforced by default via Security Context Constraints (SCCs).

To summarise it is unlikely that users of our Operator require more syscalls permissions in MongoDB workloads than what is allowed by RuntimeDefault seccomp. Nevertheless I should add comment in the changelog how to mitigate securityContext defaults by using managedSecurityContext.

@lsierant @mircea-cosbuc let me know if that justifies approving PR. I have already edited changelog.

vinilage

LGTM!

vinilage · 2025-10-07T17:58:01Z

changelog/20250924_fix_provide_more_secure_pod_and_container.md

+* To follow the [Pod Security Standards](https://v1-32.docs.kubernetes.io/docs/concepts/security/pod-security-standards/) more secure default pod `securityContext` settings were added.
+Operator deployment `securityContext` settings that have changed:
+   - `allowPrivilegeEscalation: false`
+   - `capabilities.drop: [ ALL ]`
+   - `seccompProfile.type: RuntimeDefault`
+
+  Other workloads:
+   - `capabilities.drop: [ ALL ]` - container level
+   - `seccompProfile.type: RuntimeDefault` - pod level
+
+> **Note**: If you require less restrictive `securityContext` settings please use `template` or `podTemplate` overrides.
+> Detailed information about overrides can be found in [Modify Ops Manager or MongoDB Kubernetes Resource Containers](https://www.mongodb.com/docs/kubernetes/current/tutorial/modify-resource-image/).


…nit` containers (#474) # Summary We are using `Istio` as a service mesh provider for our Multi Cluster tests. The way it works by default is `Istio` adds privileged `init-istio` container to every Pod that configures network accordingly. >By default Istio injects an init container, istio-init, in pods deployed in the mesh. The istio-init container sets up the pod network traffic redirection to/from the Istio sidecar proxy. This requires the user or service-account deploying pods to the mesh to have sufficient Kubernetes RBAC permissions to deploy [containers with the NET_ADMIN and NET_RAW capabilities](https://kubernetes.io/docs/tasks/configure-pod-container/security-context/#set-capabilities-for-a-container). While this works fine it is not meeting the [PSS](https://v1-32.docs.kubernetes.io/docs/concepts/security/pod-security-standards/) restricted level, thus making it less secure. Related [HELP-81729](https://jira.mongodb.org/browse/HELP-81729) and #473 that enables `restricted` level in `warn` mode. Additionally we provide Istio sidecar configuration as an example in our code snippets thus not following the best practice. There is another way to configure Istio mesh that does not require `istio-init` init-container - using [Istio CNI node agent](https://istio.io/latest/docs/setup/additional-setup/cni/#using-the-istio-cni-node-agent). This PR configures our e2e tests and code snippets that way. Great blog entry about difference between `istio-init` and Istio CNI node agent architecture -> https://www.solo.io/blog/traffic-ambient-mesh-istio-cni-node-configuration. With `istio-init`: <img width="810" height="820" alt="image" src="https://github.com/%3Ca%20href="https://github.com/user-attachments/assets/026350af-3b51-4fe9-9cb8-c8911e661eca">https://github.com/user-attachments/assets/026350af-3b51-4fe9-9cb8-c8911e661eca" /> With `Istio CNI node agent`: <img width="942" height="1084" alt="image" src="https://github.com/%3Ca%20href="https://github.com/user-attachments/assets/37733169-7737-4063-90a0-de3d116402a9">https://github.com/user-attachments/assets/37733169-7737-4063-90a0-de3d116402a9" /> ⚠️ Init containers execute before the sidecar proxy starts, which can result in traffic loss during their execution. This can be avoided by setting `runAsUser: 1337`. More info -> https://istio.io/latest/docs/setup/additional-setup/cni/#compatibility-with-application-init-containers ## Proof of Work Passing CI is enough. Since `private_gke_code_snippets` are not run automatically in CI I've triggered manual patch to test this -> https://spruce.mongodb.com/version/68d50e694baed3000742566d/tasks?sorts=STATUS%3AASC%3BBASE_STATUS%3ADESC ## Checklist - [ ] Have you linked a jira ticket and/or is the ticket in the title? - [x] Have you checked whether your jira ticket required DOCSP changes? - [x] Have you added changelog file? - use `skip-changelog` label if not needed - refer to [Changelog files and Release Notes](https://github.com/mongodb/mongodb-kubernetes/blob/master/CONTRIBUTING.md#changelog-files-and-release-notes) section in CONTRIBUTING.md for more details

CLOUDP-347194 - enable Pod Security Admission at restricted level

2f9b4ee

MaciejKaras added 5 commits September 24, 2025 14:50

Fix unit tests + linter issues

158c53c

Disable enforce for multi-cluster test

32c2593

Add changelog file

99af9b3

Fix enforcing the restricted mode

697917e

Update the changelog

195097a

MaciejKaras mentioned this pull request Sep 24, 2025

Update Istio configuration to use CNI node agents instead of istio-init containers #474

Merged

3 tasks

MaciejKaras marked this pull request as ready for review September 24, 2025 15:04

MaciejKaras requested review from a team and vinilage as code owners September 24, 2025 15:04

MaciejKaras requested review from m1kola, mircea-cosbuc and nammn September 24, 2025 15:04

lsierant reviewed Sep 24, 2025

View reviewed changes

MaciejKaras requested a review from lsierant September 29, 2025 18:05

MaciejKaras added 2 commits October 1, 2025 13:38

Add more details about overrides in the changelog

65e9742

Update changelog syntax

2d2bb43

vinilage approved these changes Oct 7, 2025

View reviewed changes

	func WithDefaultSecurityContextsModifications() (Modification, container.Modification) {
	managedSecurityContext := envvar.ReadBool(ManagedSecurityContextEnv) // nolint:forbidigo
	configureContainerSecurityContext := container.NOOP()
	configurePodSpecSecurityContext := NOOP()
	if !managedSecurityContext {
	configurePodSpecSecurityContext = WithSecurityContext(DefaultPodSecurityContext())
	configureContainerSecurityContext = container.WithSecurityContext(container.DefaultSecurityContext())
	}

	return configurePodSpecSecurityContext, configureContainerSecurityContext
	}

CLOUDP-347194 - enable Pod Security Admission at restricted level #473

Are you sure you want to change the base?

CLOUDP-347194 - enable Pod Security Admission at restricted level #473

Uh oh!

Conversation

MaciejKaras commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Proof of Work

Checklist

Uh oh!

github-actions bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

MCK 1.5.0 Release Notes

New Features

Bug Fixes

Uh oh!

lsierant Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaciejKaras Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

mircea-cosbuc Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

MaciejKaras Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

lsierant Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaciejKaras Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaciejKaras Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

mircea-cosbuc Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

MaciejKaras Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vinilage left a comment

Choose a reason for hiding this comment

Uh oh!

vinilage Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CLOUDP-347194 - enable Pod Security Admission at `restricted` level #473

CLOUDP-347194 - enable Pod Security Admission at `restricted` level #473

MaciejKaras commented Sep 24, 2025 •

edited

Loading

github-actions bot commented Sep 24, 2025 •

edited

Loading

lsierant Sep 24, 2025 •

edited

Loading

lsierant Oct 1, 2025 •

edited

Loading

MaciejKaras Oct 1, 2025 •

edited

Loading

MaciejKaras Oct 1, 2025 •

edited

Loading