Shanghai, China
June 24–26, 2019
Click here for more information and registration

Simultaneous translation will be provided for all keynote and breakout sessions.

To view the Chinese version of this schedule please go here.

Venue + Sponsor Showcase Map
场馆 + 赞助商展示区地图
Back To Schedule
Tuesday, June 25 • 15:05 - 15:40
HDFS CSI Plugin: Speed Up Kubernetes in On-Premises Big Data Cluster - Yi Chen & Junping Du, Tencent

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Kubernetes not only becomes predominant in public cloud area these days, but also becomes a new trend in on-premises big data cluster environment, as an alternative of Hadoop YARN, a resource schedule component. In on-premise big data cluster, majority data are saved in HDFS. How to consume big data in HDFS with Kubernetes is a new challenge to users.
In the talk we will introduce our CSI compatible HDFS plugin design and architecture first. Then, we will share our best practices and knowledge about how big data workload Spark use HDFS CSI plugin to access HDFS data when running on K8s. In the end, the TPC-DS benchmark suite will be used to analysis performance comparison between Spark on K8s with HDFS and Spark on YARN with HDFS.


Junping Du

Architect, Tencent
Junping Du is chief architect for Tencent Cloud Big Data Department and responsible for cloud data warehouse engineering team. As Committer/PMC member, he serves as release manager of Hadoop 2.6.x and 2.8.x for Apache Hadoop community. Junping has more than 10 years industry experiences... Read More →

Yi Chen

Senior Software Engineer, Tencent
Yi Chen is a senior software engineer at Tencent Cloud, responsible for cloud data warehouse development. As a Hadoop committer/PMC member, she focuses on big data storage area, and also leads the Hadoop 2.9.1 release for Apache Hadoop community. Before joining Tencent, she was the... Read More →

Tuesday June 25, 2019 15:05 - 15:40 CST