cover of episode Building Kafka without Disks with Richie Artoul and Ryan Worl from WarpStream Labs

Building Kafka without Disks with Richie Artoul and Ryan Worl from WarpStream Labs

2024/1/16
logo of podcast Software Huddle

Software Huddle

Frequently requested episodes will be transcribed first

Shownotes Transcript

In this episode, We spoke with the founders of WarpStream Labs, Richard Artoul and Ryan Worl.

WarpStream is a fascinating rethink of Kafka -- how could you simplify and improve the Kafka design by slightly tweaking your constraints? The result is very compelling -- a Kafka-compatible API that bypasses local disk by writing everything directly to S3. For the tradeoff of a slightly higher end-to-end latency, you can get a Kafka cluster that's much cheaper and way easier to operate.

Richie and Ryan have been working on high-scale data systems for years and were the engineers behind Husky, Datadog's custom-built database for logs and metrics. In this episode, they walk us through their experience building WarpStream. They touch on all the hard parts of building your own system (including why it's gotten easier!), as well as some of the difficult problems they had to solve for full compatibility with existing Kafka client libraries. They also touch on using FoundationDB, their thoughts on S3 Express One Zone, and whether AWS's cross-AZ network costs are a scam.

Lots of interesting thoughts here from a really sharp team.