ch09

2025-01-01 15:38:42 +01:00 · 2025-01-01 15:38:42 +01:00 · 3715a3b62b
parent f4f6fa119a
commit 3715a3b62b
9 changed files with 148 additions and 0 deletions
--- a/09_secure_virtualised_workloads.md
+++ b/09_secure_virtualised_workloads.md
@ -0,0 +1,148 @@
 # Secure virtualised workloads
 * **workload**: any job/payload that needs to be executed on infrastructure
    * straight on OS
    * in a VM
    * in a container
 ## Virtual machines
 ![VM architecture](./img/ch09/vm_diagram.png)
 * physical hardware
    * CPU, memory, chipset, I/O...
    * resources often underutilized
    * no isolation
 * hardware-level abstraction
    * virtual hardware
    * encapsulate all OS and application state
 * virtualization software
    * hypervisor/VMM
    * extra level of indirection to decouple hardware and OS
    * strong isolation between VMs
    * improves utilization
 * secure multiplexing
    * isolation on hardware level
    * failure of one VM does not affect others
 * entire VM is a file
    * easy to snapshot, clone, move, distribute
 * create once, run anywhere (well we try)
 * types
    * **type 1**: hypervisor runs on bare metal (no host OS) (VMWare, Microsoft
      Hyper-V, KVM...)
    * **type 2**: hypervisor runs on host OS (Virtualbox, VMWare Workstation...)
        * relies on host OS to manage calls to hardware
        * adds latency
        * security risks of host OS exploitable
        * aimed towards developers
 ![Type 1 virtualisation](./img/ch09/type_1_hypervisor.png){width=50%} \ ![Type 2 virtualisation](./img/ch09/type_2_hypervisor.png){width=50%}
 ## Containers
 * virtualization on OS level
 * much more lightweight -> more dense utilization
 * share same host OS / kernel
 * advantages
    * much faster startup
    * easier to manage
    * more containers per host than VMs
 * no hardware isolation, so security issues
 * the future
    * blur the line between contains and VMs
    * **Kata-containers**: lightweight VM per container (better security)
    * **Microsoft HyperV**: sometimes wraps containers in lightweight VM
 * Linux Security Modules (LSM)
    * hostile processes can break out of container (badly configured
      namespaces, kernel exploits...)
    * LSM defines mandatory access control
    * lists allowed capabilities (syscalls) per process
    * defined by sysadmin
    * prevents niche syscalls from being exploited
 * types
    * **OS-level containerization**: spawn containers straight on host OS + kernel
        * isolation using kernel functionality (namespaces, cgroups...)
        * no need for full guest OS
        * no hardware extensions
        * attackers could escape container and compromise host
        * Docker
    * **micro-VM**: containers in lightweight VMs on host
        * utilizes hardware-enforced isolation
        * containers do not share kernel
        * safer
        * slower startup, worse performance
    * **unikernel**: application compiled together with tailored kernel
        * monitor appplication on syscalls used
        * once known, construct microkernel and fixed-purpose image
        * no user space, only kernel space
        * much smaller attack surface (kernel only contains what's necessary)
        * runs straight on hypervisor or bare metal
        * small footprint, quick to start
    * **sandboxing**: container in sandbox running copy of host kernel
        * syscalls translated to host kernel
        * good isolation
        * slow
        * not all syscalls supported (yet)
 ![Container layout](./img/ch09/container.png){width=50%} \ ![Micro-VM layout](./img/ch09/micro_vm.png){width=50%}
 ![Unikernel layout](./img/ch09/unikernel.png){width=50%} \ ![Sandbox layout](./img/ch09/sandbox.png){width=50%}
 ## Linux kernel isolation support
 * [https://linuxcontainers.org/]
 * built into Linux kernel
 * LXC (Linux Containers)
    * OS-level virtualization for running containers on Linux host
    * low-level, difficult to use
 * LXD (Linux Container Hypervisor)
    * built on top of LXC
    * Canonical development
    * focus on containerising entire operations systems, not individual applications
 ### Cgroups
 * control groups
 * Linux feature to separate processes into groups
    * resource limiting e.g. cpu shares
    * prioritization e.g. cpu pinning
    * device access
 ### Namespaces
 * provide isolated view of global resources for a group of processes
    * only see other processes in namespaces
    * only see allowed devices, users, file system...
    * 2 PIDs: global one and one within namespace
    * own root file system (copy of host root)
 ## WebAssembly
 * W3C standard for portable high-performance applications
 * binary code
    * compiled to virtual CPU
    * runs in runtime
 * portable compilation target
 * near-native performance
 * WebAssembly System Interface (WASI): OS-level functionality + integrated
  security
 ## Trusted execution environment
 * confidential computing: protect data in use
    * at-rest data: data on storage, just encrypt it
    * in-transit data: use ewncryption
    * in-use data: needs to be decrypted before it can be used in application
    * TEE looks to address data in use security concern
 * protect *guest* from untrustworthy *host*
    * confidentiality: unauthorized entities cannot view data used in TEE, data
      is encrypted in-memory
    * integrity: prevent tampering (checksums)
    * provable origin: hardware-signed evidence of origina and current state so
      client can verify and decide to trust code running in TEE
 * AMD Secure Encrypted Virtualization (SEV, SEV-ES)
 * Intel Software Guard Extensions (SGX)
 * Intel Trusted Domain Extensions (TDX)
 ![Container architecture](./img/ch09/container_diagram.png)
--- a/img/ch09/container.png
+++ b/img/ch09/container.png
--- a/img/ch09/container_diagram.png
+++ b/img/ch09/container_diagram.png
--- a/img/ch09/micro_vm.png
+++ b/img/ch09/micro_vm.png
--- a/img/ch09/sandbox.png
+++ b/img/ch09/sandbox.png
--- a/img/ch09/type_1_hypervisor.png
+++ b/img/ch09/type_1_hypervisor.png
--- a/img/ch09/type_2_hypervisor.png
+++ b/img/ch09/type_2_hypervisor.png
--- a/img/ch09/unikernel.png
+++ b/img/ch09/unikernel.png
--- a/img/ch09/vm_diagram.png
+++ b/img/ch09/vm_diagram.png