Home

Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the dis

Episodes

Total: 448

Summary Controlling access to a database is a solved problem… right? It can be straightforward

Summary Building internal expertise around big data in a large organization is a major competitive a

Summary The past year has been an active one for the timeseries market. New products have been laun

Summary The Hadoop platform is purpose built for processing large, slow moving data in long-running

Summary As more companies and organizations are working to gain a real-time view of their business,

Summary Processing high velocity time-series data in real-time is a complex challenge. The team at

Summary Every business needs a pipeline for their critical data, even if it is just pasting into a

Summary Apache Spark is a popular and widely used tool for a variety of data oriented projects. Wit

Summary Distributed systems are complex to build and operate, and there are certain primitives that

Summary When your data lives in multiple locations, belonging to at least as many applications, it

Summary Modern applications and data platforms aspire to process events and data in real time at sc

Summary A data lake can be a highly valuable resource, as long as it is well built and well managed

Summary Business intelligence is a necessity for any organization that wants to be able to make inf

Summary Jupyter notebooks have gained popularity among data scientists as an easy way to do explora

Summary As data science becomes more widespread and has a bigger impact on the lives of people, it

Summary With the growth of the Hadoop ecosystem came a proliferation of implementations for the Hiv

SummaryOne of the most complex aspects of managing data for analytical workloads is moving it from a

Summary There are countless sources of data that are publicly available for use. Unfortunately, com

Summary As your data needs scale across an organization the need for a carefully considered approach

Summary Every business with a website needs some way to keep track of how much traffic they are get