Helm Charts

Helm charts are provided to facilitate the deployment and configuration of SysFlow on Kubernetes.

These charts have been tested on minikube and IBM Cloud Kubernetes Service. They shoud work on vanilla Kubernetes installations but it’s possible that minor differences in how authentication is handled by different cloud providers require small modifications to the charts.

These scripts have been tested with helm versions 2 and 3. Some helm commands may not work with other versions of helm.

Prerequisites

kubectl (installing kubectl)
Helm (installing helm)
Docker (optional)

Install minikube (optional)

To deploy SysFlow on a local Kubernetes instance (for development or testing), start by installing minikube in your macOS, Linux, or Windows system.

For example, to install minikube in Linux distributions, run:

curl -LO https://storage.googleapis.com/minikube/releases/latest/minikube-linux-amd64
sudo install minikube-linux-amd64 /usr/local/bin/minikube

Then, start your cluster:

minikube start

Note: to install SysFlow on minikube, set sfcollector.ebpf and sfcollector.mountEtc to true in values.yaml located inside each chart.

Check the minikube docs for additional installation options.

Tip: run eval $(minikube docker-env) to allow your Docker CLI to connect to minikube’s Docker environment.

The recommended driver for minikube is VirtualBox. Check the VirtualBox docs for installation instructions for your environment.

A note about Docker pull limits: If you run into an error when deploying SysFlow on minikube, check the logs to see if it’s related to the Docker pull limit being reached. It most likely is. To work around this inconvenience, connect to Minikube’s Docker environment (see above), log into Docker with docker login command, and pull the desired images manually, before installing the helm charts. Make sure the images pull policies are set to the default value IfNotPresent.

Deploy SysFlow

The SysFlow agent can be deployed in S3 (batch) or rsyslog (stream) export configurations.

Setup

Clone this repository and change directory as follows:

git clone https://github.com/sysflow-telemetry/sf-deployments.git
cd sf-deployments/helm

Installing the SysFlow agent with S3 Exporter

In this configuration, SysFlow exports the collected telemetry as trace files (batches of SysFlow records) to any S3-compliant object storage service.

This chart is located in charts/sf-exporter-chart, which deploys the SysFlow Collector and Exporter as a daemonset. The collector monitors the node, and writes trace files to a shared memory volume /mnt/data which the exporter manages and reads from to push completed traces to a S3-compliant object storage. The /mnt/data/ is mapped to a tmpfs filesystem, and you can specify its size using the tmpfsSize.

Installation scripts are provided to make installation easier. These scripts set up the environment including k8s secrets for S3 authentication. To connect to an S3-compliant data store, first take note of which port the S3 data store (s3Port) is configured. Minio installations listen on port 9000 by default. Also, if TLS is enabled on the S3 datastore, ensure s3Secure is true. Ensure that the s3Bucket is set to the desired S3 bucket location. The s3Location (aka s3_region), s3AccessKey and s3SecretKey and s3Endpoint are each passed in through the installation script if you use it.

To deploy the SysFlow agent with S3 export:

./scripts/installExporterChart.sh <s3_region> <s3_access_key> <s3_secret_key> <s3_endpoint> <s3_bucket>

Installing the SysFlow agent with rsyslog exporter

In this configuration, SysFlow exports the collected telemetry as events streamed to a rsyslog collector. This deployment enables the creation of customized edge pipelines, and offers a built-in policy engine to filter, enrich, and alert on SysFlow records.

This chart is located in charts/sf-processor-chart, which deploys the SysFlow Collector and Processor as a daemonset. The collector monitors the node, and streams SysFlow records to the processor, which executes a configurable edge analytic pipeline and export events to a rsyslog endpoint.

To deploy the SysFlow agent with rsyslog export:

./scripts/installProcessorChart.sh <syslog_host> <syslog_port> <syslog_proto>

Checking installation

To check that the install worked, run:

kubectl get pods -n sysflow

To check the log output of the collector container in a pod:

kubectl logs -f -c sfcollector <podname>  -n sysflow

To check the log output of the exporter container in a pod:

kubectl logs -f -c sfexporter <podname>  -n sysflow

To check the log output of the processor container in a pod:

kubectl logs -f -c sfprocessor <podname>  -n sysflow

Removing the SysFlow agent

To remove the SysFlow agent:

./scripts/deleteChart.sh

Advanced customizations

Most of the defaults should work out of the box. The collector is currently set to rotating files in 5 min intervals (or 300 seconds). CGroup resource limits can be set on the collector, exporter, and processor to limit resource usage. These can be adjusted depending on requirements and resources limitations.

Note: sfcollector.dropMode is set to true by default for performance considerations.

Kubernetes can use different container runtimes. Older versions used the docker runtime; however, newer versions typically run either containerd or crio. It’s important to know which runtime you have if you want to get the full benefits of SysFlow. You tell the collector which runtime you are using based on the sock file you refer to in the criPath variable. If you are using the docker runtime, leave criPath blank. If you are using containerd, set criPath to “/var/run/containerd/containerd.sock” and if you are using crio, set criPath to “/var/run/crio/crio.sock”. If SysFlow files are empty or the container name variable is set to incomplete in SysFlow traces, this typically means that the runtime socket is not connected properly.

Note: the installation script installs the pods into a K8s namespace called sysflow.

Below is the list of customizable attributes for the charts, organized by component. These can be modified directly into the values.yaml located in each chart’s directory. They can also be set directly into the helm command invoked by our installation scripts through --set <attribute>=<value> parameters.

SysFlow Collector

parameter	description	default
sfcollector.imagepullpolicy	Pull policy for image (Always\|Never\|IfNotPresent)	Always
sfcollector.repository	Image repository	sysflowtelemetry/sf-collector
sfcollector.tag	Image tag	latest
sfcollector.interval	Interval in seconds to roll new trace files	300
sfcollector.outDir	Directory in which collector writes trace files	/mnt/data/
sfcollector.filter	Filter expression	“"container.type!=host and container.name!=sfexporter and container.name!=sfcollector"”
sfcollector.criPath	Container runtime socket path. Use this “/var/run/containerd/containerd.sock”if running containerd runtime. Use “/var/run/crio/crio.sock” if running crio runtime.	“”
sfcollector.dropMode	Drop mode filters syscalls in the kernel before they are passed up to the collector, resulting in much better performance and fewer event drops. Note: It filters mmap system calls from the event stream.	true
sfcollector.fileOnly	Filters out any descriptor that is not a file, including unix sockets and pipes	false
sfcollector.procFlow	Enables the creation of process flows	false
sfcollector.readMode	Sets mode for reads: `0` enables recording all file reads as flows. `1` disables all file reads. `2` disables recording file reads to noisy directories: “/proc/”, “/dev/”, “/sys/”, “//sys/”, “/lib/”, “/lib64/”, “/usr/lib/”, “/usr/lib64/”.	0
sfcollector.ebpf	Enables ebpf probe (required for minikube deployment)	false
sfcollector.mountEtc	Mounts etc directory in container (required for minikube and Google COS)	false
sfcollector.collectionMode	Template modes for enabling certain system calls. Currently supports 3 modes: flow” - full sysflows, “consume” - file reads, writes, closes turned off, “nofiles” - no fileevents or fileflows	flow
sfcollector.enableStats	When enabled, logs stats on containers, processes, networkflows, fileflows and records written at interval set by “interval” attribute	false

SysFlow Exporter

parameter	description	default
sfexporter.enabled	Indicates whether the exporter will be used in the k8s deployment	false
sfexporter.imagepullpolicy	Pull policy for image (Always\|Never\|IfNotPresent)	Always
sfexporter.repository	Image repository	sysflowtelemetry/sf-exporter
sfexporter.tag	Image tag	latest
sfexporter.log	Exporter logging level. Can be DEBUG, INFO, WARNING, ERROR, CRITICAL	INFO
sfexporter.type	Type of trace export - “s3” to export to S3 storage, “local” for local copy	s3
sfexporter.interval	Interval in seconds to check whether to export trace files	5
sfexporter.outDir	Directory shared between the collector and exporter and where collector writes	/mnt/data/
sfexporter.dirs	Directories (comma separated) from which exporter will copy	/mnt/data
sfexporter.toDir	Directories (comma separated) to copy trace too - only used when type = “local”. Must have same number of entries as dirs attribute	commented out
sfexporter.mode	modes of copy (comma separated) move-del - move and delete file once finished writing - this is the only mode local copy supports. cont-update - continuously copy file over at interval (s3), cont-update-recur - continously update a directory structure recursively (s3). Must have same number of entries as dirs attribute	move-del
sfexporter.s3Endpoint	S3 host address (only used when type s3)	“<ip address>”
sfexporter.s3Port	S3 port (only used when type s3)	443
sfexporter.s3Bucket	S3 bucket where to push traces (only used when type s3). Can be a comma separated list of buckets. Must have same number of entries as dirs attribute	“<s3 bucket>”
sfexporter.s3Location	S3 location (only used when type s3)	“<s3 region>”
sfexporter.s3AccessKey	S3 access key (only used when type s3)	“<s3 access key>”
sfexporter.s3SecretKey	S3 secret key (only used when type s3)	“<s3 secret key>”
sfexporter.s3Secure	S3 connection, `true` if TLS-enabled, `false` otherwise (only used when type s3)	false

SysFlow Processor

parameter	description	default
sfprocessor.imagepullpolicy	Pull policy for image (Always\|Never\|IfNotPresent)	Always
sfprocessor.repository	Image repository	sysflowtelemetry/sf-processor
sfprocessor.tag	Image tag	latest
sfprocessor.export	Export type (`terminal`\|`file`\|`syslog`)	syslog
sfprocessor.override	Override processor exporter in pipeline.json with values.yaml settings	true
sfprocessor.syslogHost	rsyslog host address	localhost
sfprocessor.syslogPort	rsyslog port	514
sfprocessor.syslogProto	rsyslog protocol (`udp`\|`tcp`\|`tcp+tls`)	tcp
sfprocessor.configMapEnabled	‘true’ if using config map for policy configs	‘true’
sfprocessor.findingsDir	Directory to which raw findings are written. Must be the same as the findings.path value in the pipeline.json	/mnt/findings