Loading…
Budapest Data 2015 has ended
Thursday, June 4 • 13:30 - 14:00
Designing Agile Data Pipelines

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Agile software development values responding to change over following a plan. Responding to changes means allowing data scientists to experiment with data and allowing developers to easily modify data processing and even make mistakes without taking huge risks. Well designed data pipelines gives organizations flexible data analysis. In this session we'll show how to design architectures that make it easy and safe to extend and modify data analysis software.

We will look at how to design an agile data processing architecture using Apache Hadoop, Apache Kafka and stream processing frameworks. The architectures we’ll discuss make it easy to add new data sources, experiment with new analysis algorithms and correct data processing errors. All this makes the data pipeline both flexible and safe.

Speakers
avatar for Ashish Singh

Ashish Singh

Software Engineer, Cloudera
Ashish Singh is a Software Engineer, working with Cloudera to empower Hadoop ecosystem to answer bigger questions. He contributes to Apache Kafka, Hive, Parquet and Sentry. Prior to joining Cloudera, he worked on optimizing MPI collective communications on High Performance Computing... Read More →


Thursday June 4, 2015 13:30 - 14:00 CEST
Mátyás II.

Attendees (0)