My name is Leo and I am interested in Observability, Distributed Systems, and Star Wars.
Benchmarking Highly Distributed Tracing in OpenTelemetry
In recent years the two research fields around internet of things (IoT) devices as well as cloud native architectures and micro services have seen an increased amount of attention. In this paper we ask the question whether it is viable to combine these fields by using the tooling that was created for tracing in highly distributed systems to measure the performance and reliability of smart devices on the edge. Specifically, we want to identify the performance characteristics of existing tooling when deployed on commodity hardware. By measuring the maximum throughput of the OpenTelemetry Collector under different configurations, we want to evaluate its fitness for such highly distributed tracing. We found that even under high loads in the range of multiple thousands of requests per second, the collector can reliably sustain a high level of throughput with small latency when passing incoming data on. Especially when sampling only a fraction of all requests, a single collector deployed on commodity hardware is able to handle several thousand of clients simultaneously.
Large Scale Outage Visibility on the Control Plane
With the increasing cloud usage for access to fast and well-connected computational power, cloud outages have also become a growing risk for businesses and individuals alike. We derive a method to analyze publicly available BGP data to measure the visibility of cloud providers' outages on the Internet control plane. We then utilize this method to analyze an outage of Cloudflare, a large DNS and content provider. Cloudflare鈥檚 outage study shows that visible traces can be found in BGP, enabling data-driven outage studies.
Building a lexer for lambda
Building a lexer for lambda, an interpreter for the untyped lambda calculus.
Building a Spotify scraper
Building a concurrent scraper for Spotify Charts in Go