Loading…
Budapest Data 2015 has ended
Back To Schedule
Wednesday, June 3 • 16:15 - 16:45
Hive powered by Spark

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Hive has become de facto standard SQL on big data in Hadoop ecosystem. It is used extensively in data warehousing and data analytics with big data. Not long ago, Hive queries could only run on MapReduce and Tez. As Apache Spark become mature as an open-source data analytics cluster computing framework, it's also introduced to Apache Hive as a new, powerful execution engine. The obvious benefit is making Hive available to Spark users and providing a better performance and response time for existing Hive users. This presentation will talk about the motivation, design principles, architecture, etc. followed by a demo.

Speakers
avatar for Xuefu Zhang

Xuefu Zhang

Software Engineer, Cloudera
Xuefu Zhang has over 10 year’s experience in software development. Working for Cloudera since May 2013, he spends a lot of his efforts on Apache Hive and Pig. He also worked in the Hadoop team at Yahoo when the majority of the development on Hadoop was still there. Xuefu Zhang is... Read More →


Wednesday June 3, 2015 16:15 - 16:45 CEST
Mátyás I

Attendees (0)