The proposed "netdevices" field provides a declarative way to specify which host network devices should be moved into a container's network namespace. This approach is similar than the existing "devices" field used for block devices but uses a dictionary keyed by the interface name instead. The proposed scheme is based on the existing representation of network device by the `struct net_device` https://docs.kernel.org/networking/netdevices.html. This proposal focuses solely on moving existing network devices into the container namespace. It does not cover the complexities of network configuration or network interface creation, emphasizing the separation of device management and network configuration. Signed-off-by: Antonio Ojea <aojea@google.com>
7.6 KiB
Linux Features Structure
This document describes the Linux-specific section of the Features structure.
Namespaces
namespaces
(array of strings, OPTIONAL) The recognized names of the namespaces, including namespaces that might not be supported by the host operating system. The runtime MUST recognize the elements in this array as thetype
oflinux.namespaces
objects inconfig.json
.
Example
"namespaces": [
"cgroup",
"ipc",
"mount",
"network",
"pid",
"user",
"uts"
]
Capabilities
capabilities
(array of strings, OPTIONAL) The recognized names of the capabilities, including capabilities that might not be supported by the host operating system. The runtime MUST recognize the elements in this array in theprocess.capabilities
object ofconfig.json
.
Example
"capabilities": [
"CAP_CHOWN",
"CAP_DAC_OVERRIDE",
"CAP_DAC_READ_SEARCH",
"CAP_FOWNER",
"CAP_FSETID",
"CAP_KILL",
"CAP_SETGID",
"CAP_SETUID",
"CAP_SETPCAP",
"CAP_LINUX_IMMUTABLE",
"CAP_NET_BIND_SERVICE",
"CAP_NET_BROADCAST",
"CAP_NET_ADMIN",
"CAP_NET_RAW",
"CAP_IPC_LOCK",
"CAP_IPC_OWNER",
"CAP_SYS_MODULE",
"CAP_SYS_RAWIO",
"CAP_SYS_CHROOT",
"CAP_SYS_PTRACE",
"CAP_SYS_PACCT",
"CAP_SYS_ADMIN",
"CAP_SYS_BOOT",
"CAP_SYS_NICE",
"CAP_SYS_RESOURCE",
"CAP_SYS_TIME",
"CAP_SYS_TTY_CONFIG",
"CAP_MKNOD",
"CAP_LEASE",
"CAP_AUDIT_WRITE",
"CAP_AUDIT_CONTROL",
"CAP_SETFCAP",
"CAP_MAC_OVERRIDE",
"CAP_MAC_ADMIN",
"CAP_SYSLOG",
"CAP_WAKE_ALARM",
"CAP_BLOCK_SUSPEND",
"CAP_AUDIT_READ",
"CAP_PERFMON",
"CAP_BPF",
"CAP_CHECKPOINT_RESTORE"
]
Cgroup
cgroup
(object, OPTIONAL) represents the runtime's implementation status of cgroup managers.
Irrelevant to the cgroup version of the host operating system.
v1
(bool, OPTIONAL) represents whether the runtime supports cgroup v1.v2
(bool, OPTIONAL) represents whether the runtime supports cgroup v2.systemd
(bool, OPTIONAL) represents whether the runtime supports system-wide systemd cgroup manager.systemdUser
(bool, OPTIONAL) represents whether the runtime supports user-scoped systemd cgroup manager.rdma
(bool, OPTIONAL) represents whether the runtime supports RDMA cgroup controller.
Example
"cgroup": {
"v1": true,
"v2": true,
"systemd": true,
"systemdUser": true,
"rdma": false
}
Seccomp
seccomp
(object, OPTIONAL) represents the runtime's implementation status of seccomp.
Irrelevant to the kernel version of the host operating system.
enabled
(bool, OPTIONAL) represents whether the runtime supports seccomp.actions
(array of strings, OPTIONAL) The recognized names of the seccomp actions. The runtime MUST recognize the elements in this array in thesyscalls[].action
property of thelinux.seccomp
object inconfig.json
.operators
(array of strings, OPTIONAL) The recognized names of the seccomp operators. The runtime MUST recognize the elements in this array in thesyscalls[].args[].op
property of thelinux.seccomp
object inconfig.json
.archs
(array of strings, OPTIONAL) The recognized names of the seccomp architectures. The runtime MUST recognize the elements in this array in thearchitectures
property of thelinux.seccomp
object inconfig.json
.knownFlags
(array of strings, OPTIONAL) The recognized names of the seccomp flags. The runtime MUST recognize the elements in this array in theflags
property of thelinux.seccomp
object inconfig.json
.supportedFlags
(array of strings, OPTIONAL) The recognized and supported names of the seccomp flags. This list may be a subset ofknownFlags
due to some flags not supported by the current kernel and/or libseccomp. The runtime MUST recognize and support the elements in this array in theflags
property of thelinux.seccomp
object inconfig.json
.
Example
"seccomp": {
"enabled": true,
"actions": [
"SCMP_ACT_ALLOW",
"SCMP_ACT_ERRNO",
"SCMP_ACT_KILL",
"SCMP_ACT_LOG",
"SCMP_ACT_NOTIFY",
"SCMP_ACT_TRACE",
"SCMP_ACT_TRAP"
],
"operators": [
"SCMP_CMP_EQ",
"SCMP_CMP_GE",
"SCMP_CMP_GT",
"SCMP_CMP_LE",
"SCMP_CMP_LT",
"SCMP_CMP_MASKED_EQ",
"SCMP_CMP_NE"
],
"archs": [
"SCMP_ARCH_AARCH64",
"SCMP_ARCH_ARM",
"SCMP_ARCH_MIPS",
"SCMP_ARCH_MIPS64",
"SCMP_ARCH_MIPS64N32",
"SCMP_ARCH_MIPSEL",
"SCMP_ARCH_MIPSEL64",
"SCMP_ARCH_MIPSEL64N32",
"SCMP_ARCH_PPC",
"SCMP_ARCH_PPC64",
"SCMP_ARCH_PPC64LE",
"SCMP_ARCH_S390",
"SCMP_ARCH_S390X",
"SCMP_ARCH_X32",
"SCMP_ARCH_X86",
"SCMP_ARCH_X86_64"
],
"knownFlags": [
"SECCOMP_FILTER_FLAG_LOG"
],
"supportedFlags": [
"SECCOMP_FILTER_FLAG_LOG"
]
}
AppArmor
apparmor
(object, OPTIONAL) represents the runtime's implementation status of AppArmor.
Irrelevant to the availability of AppArmor on the host operating system.
enabled
(bool, OPTIONAL) represents whether the runtime supports AppArmor.
Example
"apparmor": {
"enabled": true
}
SELinux
selinux
(object, OPTIONAL) represents the runtime's implementation status of SELinux.
Irrelevant to the availability of SELinux on the host operating system.
enabled
(bool, OPTIONAL) represents whether the runtime supports SELinux.
Example
"selinux": {
"enabled": true
}
Intel RDT
intelRdt
(object, OPTIONAL) represents the runtime's implementation status of Intel RDT.
Irrelevant to the availability of Intel RDT on the host operating system.
enabled
(bool, OPTIONAL) represents whether the runtime supports Intel RDT.
Example
"intelRdt": {
"enabled": true
}
MountExtensions
mountExtensions
(object, OPTIONAL) represents whether the runtime supports certain mount features, irrespective of the availability of the features on the host operating system.
idmap
(object, OPTIONAL) represents whether the runtime supports idmap mounts using theuidMappings
andgidMappings
properties of the mount.enabled
(bool, OPTIONAL) represents whether the runtime parses and attempts to use theuidMappings
andgidMappings
properties of mounts if provided. Note that it is possible for runtimes to have partial implementations of id-mapped mounts support (such as only allowing mounts which have mappings matching the container's user namespace, or only allowing the id-mapped bind-mounts). In such cases, runtimes MUST still set this value totrue
, to indicate that the runtime recognises theuidMappings
andgidMappings
properties.
Example
"mountExtensions": {
"idmap":{
"enabled": true
}
}
NetDevices
netDevices
(object, OPTIONAL) represents the runtime's implementation status of Linux network devices.
enabled
(bool, OPTIONAL) represents whether the runtime supports the capability to move Linux network devices into the container's network namespace.
Example
"netDevices": {
"enabled": true
}