cover of episode Apache Hudi: Large Scale Data Systems with Vinoth Chandar

Apache Hudi: Large Scale Data Systems with Vinoth Chandar

2021/5/13
logo of podcast Data Archives - Software Engineering Daily

Data Archives - Software Engineering Daily

Frequently requested episodes will be transcribed first

Shownotes Transcript

Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC data pipeline development (AWS.amazon.com)).

In this episode we speak to Vinoth Chandar, VP of Apache Hudi. Vinoth is the creator of the Hudi project at Uber. He continues to lead its evolution at the Apache Software Foundation. Previously he was a Principal Engineer at Confluent, and a Sr Staff Engineer/Manager at Uber before that. We discuss building large scale distributed and data systems.

Sponsorship inquiries: [email protected])

The post Apache Hudi: Large Scale Data Systems with Vinoth Chandar) appeared first on Software Engineering Daily).