cover of episode How Apache Pinot Achieves 200,000 Queries per Second (with Tim Berglund)

How Apache Pinot Achieves 200,000 Queries per Second (with Tim Berglund)

2024/3/20
logo of podcast Developer Voices

Developer Voices

Frequently requested episodes will be transcribed first

Shownotes Transcript

The likes of LinkedIn and Uber use Pinot to power some astonishingly high-scale queries against realtime data. The numbers alone would make an impressive case-study. But behind the headline lies a fascinating set of architectural decisions and constraints to get there. So how does Pinot work? How does it process queries? How are the various roles split across a cluster? And equally important - what does it not try to achieve.

Joining me to go through the nuts and bolts of how Pinot handles SQL queries is Tim Berglund, veteran technology explainer of the realtime-data world. He takes us through Pinot step-by-step, covering the roles of brokers, servers, controllers and minions as we build up the picture of a query engine that's interesting in theory and massively performant in practice.

Apache Pinot: https://pinot.apache.org/)

Apache Pinot Docs: https://docs.pinot.apache.org/)

StarTree: https://startree.ai/))Event Driven Design episode with Bobby Calderwood: https://youtu.be/V7vhSHqMxus)

Tim on Twitter: https://twitter.com/tlberglund

Kris on Mastodon: http://mastodon.social/@krisajenkins

Kris on LinkedIn: https://www.linkedin.com/in/krisjenkins/)

Kris on Twitter: https://twitter.com/krisajenkins)

#podcast #softwaredevelopment #apachepinot #database #dataengineering #sql