ServiTrace Replication Package

The replication package for our manuscript Let’s Trace It: Fine-Grained Serverless Benchmarking for Synchronous and Asynchronous Applications consists of the ServiTrace tool (previously called ServiBench), our analysis of the Azure workload traces, the ten benchmarking applications and benchmarking harness, and our analysis of the obtained dataset. ServiTrace analyzes distributed serverless traces using a novel algorithm and heuristics for extracting a detailed latency breakdown, leverages a suite of serverless applications representative of production usage, including synchronous and asynchronous serverless applications with external service integrations, and automates comprehensive, end-to-end experiments to capture application-level performance.

ServiTrace

ServiTrace is a meta-benchmarking tool to orchestrate reproducible serverless application benchmarking.

Reproducible deployments: sb abstracts away dependencies using Docker and automatically mounts application code and credentials (when needed) into the right container directories.
Automated load generation: sb provides different classes of invocation patterns derived from real-world traces. It integrates with the open source load testing tool k6.
Clear box application insights: for instrumented applications, sb implements detailed trace analysis using distributed tracing such as AWS X-Ray and Azure Application Insights.

Quick Setup

Install Docker 19.03+
Install Python 3.7+ (tested with 3.7, 3.8. 3.9, 3.10)

ARM-based Apple M1 systems are not yet fully supported. The trace analyzer works with Python 3.10 but the Docker integration can cause some problems with certain applications.

Install the sb tool:

a) pipx (recommended for CLI)

python3 -m pip install --upgrade pip
python3 -m pip install --user pipx
python3 -m pipx ensurepath  # might require terminal restart
cd servi-bench
pipx install --editable .

b) venv (required for SDK when using the programmatic API)

python3 -m venv sb-env
source sb-env/bin/activate  # depends on shell
cd servi-bench
python3 -m pip install --upgrade pip
pip install --editable .

Build the sb Dockerfile via sb init
Login for providers via sb login PROVIDER: Supported for aws, azure, google, ibm.

Credentials

The credentials are stored in a Docker volume called PROVIDER-secrets (e.g., aws-secrets) and selectively mounted when needed. They can be deleted via sb logout PROVIDER. Check credentials via sb check_credentials PROVIDER

Getting Started

# Run empty mock benchmark locally
sb test --file=tests/fixtures/mock_benchmark/mock_benchmark.py

# Get an AWS and Azure app
git clone git@github.com/ANONYMIZED/faas-migration.git
# AWS
cd faas-migration/ThumbnailGenerator/Lambda
# Azure
cd faas-migration/ThumbnailGenerator/Azure

# Benchmarking lifecycle using a `*_benchmark.py` file:
# 1) Deploy app
sb prepare
# 2a) Invoke app a single time
sb invoke
# 2b) Invoke app 10 times sequentially
sb invoke 10
# 2c) Benchmark with a pre-configured workload_type (steady|fluctuating|spikes|jump)
sb invoke fluctuating
# 3) Download traces
sb get_traces
# 4) Analyze latest traces
sb analyze_traces
# Hint for analyzing previous traces: sb analyze_traces logs/DATETIME/traces.json
# 5) Cleanup all cloud infrastructure
sb cleanup

Hints:

*_benchmark.py files in the current working directory are automatically detected (if only a single file exists).
sb test sequentially executes prepare, invoke (with workload_type=3) and, cleanup.
sb invoke custom_per_minute_rate_trace.csv supports custom CSV workload traces.
Checkout the AWS X-Ray Console for result traces (6h retention!) or CloudWatch logs.

Debugging

sb shell IMAGE starts an interactive shell with all auto-mounts in a given Docker IMAGE.
More examples of *_benchmark.py files are available under tests/fixtures (covered by integration tests)
Insert the code import code; code.interact(local=dict(globals(), **locals())) on any line to prompt an interactive Python shell.
Checkout the DEVELOPMENT docs for more details.

Adding a custom application

Create a *_benchmark.py file in the main directory of your application.
Implement hooks for prepare(spec), invoke(spec), and cleanup(spec) as shown under mock_benchmark.py. Key functionality:
- spec.run(CMD, image=DOCKERIMAGE) Runs a given CMD in a DOCKERIMAGE and returns its stdout.
- spec.build(IMAGE_TAG) Builds a Dockerfile and tags it with IMAGE_TAG.
- spec['KEY'] provides a persistent key-value store across different benchmark cycles (e.g., share state between prepare and invoke)
- The BENCHMARK_CONFIG constant initializes the key-value store and specifies configurable attributes (e.g., region) and meta-information (e.g., provider).
- The working directory is defined by the location of *_benchmark.py (i.e., same directory).
- sb mounts the working directory by default into any Docker container. If files at higher levels are required, the root benchmark config allows to mount higher level directories (e.g., parent using ..).
- sb integrates with k6 for load testing.
- sb invoke automatically generates a workload_options.json file with k6 options.
- sb invoke and sb get_traces automatically create logs in the working directory under logs with the start timestamp of the invocation.
Instrument your application (provider- and language-dependent):
- AWS: Enable X-Ray tracing and add language-specific instrumentation as described here.
- Azure: TODO(clarify how to use Azure Insights metrics for distributed tracing)

Further documentation

Reproducing our Experiments

We describe how to reproduce our experiments in the AWS cloud environment to collect a new dataset following the same experiment design.

Carefully review the experiment configuration and be aware that experiments with high-load levels can cost 1000s of USD in cloud bills!

Preparation

Create an AWS account for sb following these instructions
Create an X-Ray sampling rule called NoSampling with highest priority (1) and 100% fixed sampling rate.

Run Experiments

Set up a load generator in AWS EC2 following these instructions. Use the alias lg12 in the SSH config or adjust set the environment variable SB_DATA_SOURCE for the following steps.

Copy the experiment_plans into the load generator

scp experiment_plans/exp* ec2-user@lg12:/home/ec2-user

Run the experiment as described in each experiment plan using tmux because ordinary SSH sessions might disconnect during long running experiments. The experiment plans automate the benchmarking lifecycle including application deployment, trace collection, and application cleanup.
Download the collected traces using make retrieve_logs

Analysis of the Azure Workload Traces

The RStudio project in azuredataset-analysis analyzes invocation patterns in the Azure Function Traces [1].

Requirements

RStudio with R 4.1.2
Install R packages by running 00_install_packages.R
Download the original Azure Functions Traces dataset here as instructed in 01_download.R

Generate Plots and Export Traces

Run run.R to merge and filter the dataset, generate invocation plots, and export selected trace files into the results directory.
Run summarize_filtering.R to print a summary of the filtering process step by step.
Run summarize_patterns.R to merge the traces with the invocation patterns from manual classification.

Results

results/invocations/20min_min1rps/*.pdf contains 200 PDFs with a line plot showing invocation rates over 20 minutes for traces with request rates > 1 request per second (rps) generated by 03_plot_invocations.R
results/invocations/20min_min1rps/sampled.csv is a list of the function hashes for all samples functions.
results/invocations/20min_min1rps/trace_classification.csv contains the manual trace classification for each of the 200 PDFs.
results/traces/20min_picks/*.csv contains one selected 20 minute trace for each of the 4 invocation patterns generated by 04_export_traces.R.

R Session Info

Summary of R session with detailed version report:

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] forcats_0.5.1     stringr_1.4.0     readr_2.1.1       tidyr_1.1.4       tibble_3.1.6      ggplot2_3.3.5     tidyverse_1.3.1   purrr_0.3.4       dplyr_1.0.7      
[10] data.table_1.14.2

loaded via a namespace (and not attached):
 [1] Rcpp_1.0.7       cellranger_1.1.0 pillar_1.6.4     compiler_4.1.2   dbplyr_2.1.1     tools_4.1.2      digest_0.6.29    jsonlite_1.7.2   lubridate_1.8.0 
[10] lifecycle_1.0.1  gtable_0.3.0     pkgconfig_2.0.3  rlang_0.4.12     reprex_2.0.1     cli_3.1.0        rstudioapi_0.13  DBI_1.1.2        haven_2.4.3     
[19] xml2_1.3.3       withr_2.4.3      httr_1.4.2       fs_1.5.2         generics_0.1.1   vctrs_0.3.8      hms_1.1.1        grid_4.1.2       tidyselect_1.1.1
[28] glue_1.6.0       R6_2.5.1         fansi_0.5.0      readxl_1.3.1     farver_2.1.0     tzdb_0.2.0       modelr_0.1.8     magrittr_2.0.1   backports_1.4.1 
[37] scales_1.1.1     ellipsis_0.3.2   rvest_1.0.2      assertthat_0.2.1 colorspace_2.0-2 labeling_0.4.2   utf8_1.2.2       stringi_1.7.6    munsell_0.5.0   
[46] broom_0.7.11     crayon_1.4.2

References

[1] M. Shahrad, R. Fonseca, I. Goiri, G. Chaudhry, P. Batum, J. Cooke, E. Laureano, C. Tresness, M. Russinovich, and R. Bianchini Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider, 2020 USENIX Annual Technical Conference, USENIX ATC 2020, July 15-17, 2020. 2020.

Benchmarking Applications

ServiTrace supports the following ten applications out of the box. See Adding a custom application for information on how to integrate additional applications.

### Minimal Baseline This application implemetnts the API Gateway + Lambda pattern and serves as our minimal baseline for a serverless application.

This application is from Serverlessland (https://serverlessland.com/patterns/apigw-lambda-cdk) and was forked from the https://github.com/aws-samples/serverless-patterns/ repository. To use this application, run the following commands:

cd serverless-patterns/src/
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Thumbnail Generator

The Thumbnail Generator application generates a thumbnail of an image uploaded to a storage bucket. The first function implements an HTTP API to upload an image to a storage bucket. The storage event then triggers a second function to generate a thumbnail of the image and store it in another storage bucket.

This application is from the following study by Yussupov et al.:

Vladimir Yussupov, Uwe Breitenbücher, Frank Leymann, and Christian Müller. Facing the unplanned migration of serverless applications: A study on portability problems, solutions, and dead ends. In Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing, pages 273–283, 2019.

It was originally forked from the https://github.com/iaas-splab/faas-migration repository. To use this application, run the following commands:

cd applications/faas-migration/ThumbnailGenerator/Lambda
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Event Processing

The Event Processing application generates and inserts event into an input queue. The queue triggers a lambda which pre-processes the event and places it in the ingested queue. The placement of an event in the ingested queue triggers another lambda to process the event and store the results in the database.

This application is from the following study by Yussupov et al.:

Vladimir Yussupov, Uwe Breitenbücher, Frank Leymann, and Christian Müller. Facing the unplanned migration of serverless applications: A study on portability problems, solutions, and dead ends. In Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing, pages 273–283, 2019.

It was originally forked from the https://github.com/iaas-splab/faas-migration repository. To use this application, run the following commands:

cd applications/faas-migration/Event-Processing/Lambda
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Facial Recognition

Facial Recognition app takes a user uploaded image, extracts a face from it, and detects if the face already exists in the database. If the face does not already exist in the database, the app indexes the face and saves a thumbnail of the face to object storage.

This application is part of the AWS Wild Rydes workshop (https://www.image-processing.serverlessworkshops.io/) and was cloned from the https://github.com/aws-samples/aws-serverless-workshops/ repository.To use this application, run the following commands:

cd applications/aws-serverless-workshops/ImageProcessing/
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Model Training

Model Training application reads datasets from object storage, trains machine learning models on those datasets, and stores the trained models in object storage.

This application is from the following publication by Kim et al.:

Jeongchul Kim and Kyungyong Lee. FunctionBench: A suite of workloads for serverless cloud function service. In Proceedings of the 12th IEEE International Conference on Cloud Computing (CLOUD WIP), pages 502–504, 2019.

It was originally forked from the https://github.com/kmu-bigdata/serverless-faas-workbench repository. To use this application, run the following commands:

cd applications/serverless-faas-workbench/aws/cpu-memory/model_training/
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Realworld Backend

RealWorld Backend uses a FaaS to create, read, update, and delete user and article information stored in a database.

This application is an AWS implementation of the real world spec (https://github.com/gothinkster/realworld) and was forked from the https://github.com/anishkny/realworld-dynamodb-lambda repository. To use this application, run the following commands:

cd applications/faas-migration-go/aws
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Hello Retail

The Hello Retail! is a retail inventory catalog application backed by a database. Users can upload product information and categorize products into categories. Supports sending an SMS if a product does not have an image. Uploaded images are stored in object storage.

This application was forked from the https://github.com/etsangsplk/hello-retail repository. To use this application, run the following commands:

cd applications/hello-retail
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Todo API

The Todo API application is a simple to-do app which uses a FaaS to create, read, update, and delete todos stored in a database.

This application is from the following study by Yussupov et al.:

Vladimir Yussupov, Uwe Breitenbücher, Frank Leymann, and Christian Müller. Facing the unplanned migration of serverless applications: A study on portability problems, solutions, and dead ends. In Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing, pages 273–283, 2019.

It was originally forked from the https://github.com/iaas-splab/faas-migration repository. To use this application, run the following commands:

cd applications/faas-migration-go/aws
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Matrix Multiplication

The Matrix Multiplication application generates a random matrix, partitions the matrix, and distributes it for multiplication. Workers perform the multiplication and write the results to S3. The results are then combined to get the final result of the multiplication. The app is directed by an orchestration service.

This application is from the following study by Yussupov et al.:

Vladimir Yussupov, Uwe Breitenbücher, Frank Leymann, and Christian Müller. Facing the unplanned migration of serverless applications: A study on portability problems, solutions, and dead ends. In Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing, pages 273–283, 2019.

It was originally forked from the https://github.com/iaas-splab/faas-migration repository. To use this application, run the following commands:

cd applications/faas-migration/MatrixMultiplication/Lambda
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Video Processing

The Video Processing application reads videos from object storage, applies filters, and transcodes them. The transcoded videos are stored in object storage.

This application is from the following publication by Kim et al.:

Jeongchul Kim and Kyungyong Lee. FunctionBench: A suite of workloads for serverless cloud function service. In Proceedings of the 12th IEEE International Conference on Cloud Computing (CLOUD WIP), pages 502–504, 2019.

It was originally forked from the https://github.com/kmu-bigdata/serverless-faas-workbench repository. To use this application, run the following commands:

cd applications/serverless-faas-workbench/aws/cpu-memory/video_processing/
sb prepare
sb invoke fluctuating
sb get_traces
sb analyze_traces
sb cleanup

Critical Path Extraction for Asynchronous Invocations

To extract the critical path, we use Algorithm 1, which is a modified version of the weighted longest-path algorithm proposed for microservices at OSDI'20 in the paper FIRM: An Intelligent Fine-grained Resource Management Framework for SLO-Oriented Microservices (Qiu et al.).

We re-implement their algorithm and provide a few test cases that demonstrate some limitations. We fix an ordering bug and show that it cannot handle asynchronous invocations in serverless traces.

We provide the following implementations:

critical_path_algorithm_async.py
- Our Algorithm 1 with extensive documentation.
- Description of high-level algorithm.
- Challenges and solutions we use to solve them (e.g., related to timing).
- Detailed line-by-line explanations, sometimes proving alternative Python implementations and direct mappings to the algorithm in the paper.
critical_path_algorithm_async_test.py
- Test cases for Algorithm 1
critical_path_algorithm_sync.py
- Fixed version of original algorithm by Qiu et al.
critical_path_algorithm_sync_test.py
- Failing test cases for original algorithm

Analysis of the obtained Dataset

The dataset-analysis contains scripts and instructions to reproduce:

a) the data analysis based on our published dataset
b) the cloud experiments in a serverless cloud environment using sb.
c) the invocation pattern analysis based on the Azure Function Traces dataset (see Analysis of the Azure Workload Traces)

Download the dataset

The full dataset (~6GB compressed, ~70GB uncompressed) collected during our experiments in available on Zenodo:

https://doi.org/10.5281/zenodo.5879445

Preparation

We first use the sb trace analyzer to pre-process the raw traces before generating plots.

Activate virtual environment with sb

source sb-env/bin/activate  # depends on shell

Pre-process traces through sb analyzer
```
make analyze_traces
```

Generate Plots

Install Python 3.7+
Create virtual environment
```
python3 -m venv sb-dataset-analysis
```

Install the Python dependencies

pip install --upgrade pip
pip install -r requirements.txt

Generate plots

a) Plain Python
```
make generate_plots
make generate_plots_all
```
b) VSCode Interactive: Run individual cells with the Python extension in interactive mode

The data source can be configured via SB_DATA_SOURCE (default lg12) or directly through SB_DATA_DIR (default data/lg12/raw).

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
applications		applications
azuredataset-analysis		azuredataset-analysis
dataset-analysis		dataset-analysis
figures		figures
servi-bench		servi-bench
LICENSE		LICENSE
README.md		README.md

License

ServiTrace/ReplicationPackage

Folders and files

Latest commit

History

Repository files navigation

ServiTrace Replication Package

ServiTrace

Quick Setup

Credentials

Getting Started

Debugging

Adding a custom application

Further documentation

Reproducing our Experiments

Preparation

Run Experiments

Analysis of the Azure Workload Traces

Requirements

Generate Plots and Export Traces

Results

R Session Info

References

Benchmarking Applications

Thumbnail Generator

Event Processing

Facial Recognition

Model Training

Realworld Backend

Hello Retail

Todo API

Matrix Multiplication

Video Processing

Critical Path Extraction for Asynchronous Invocations

Analysis of the obtained Dataset

Download the dataset

Preparation

Generate Plots

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages