cover of episode #036 Why Distributed Processing Is Super Important

#036 Why Distributed Processing Is Super Important

2018/9/10
logo of podcast Plumbers of Data Science

Plumbers of Data Science

Frequently requested episodes will be transcribed first

Shownotes Transcript

You need to become comfortable with distributed processing. Data Science or the Internet of Things, the amount of data that is getting produced and processed grows like crazy. In this podcast I talk about how a platform for distributed processing looks like. I talk about the different layers that need parallelization, as well as the tools you can use for on premise installations or clouds like AWS, Azure or Google Cloud. Big Data tools like Kafka, Spark or server less like Kinesis or Lambda functions.