Loading…
Shanghai, China
June 24–26, 2019
Click here for more information and registration

Simultaneous translation will be provided for all keynote and breakout sessions.
我们将为所有主题演讲和分组会议提供同声传译服务。

To view the Chinese version of this schedule please go here.
请点击此处查看中文版本。

Venue + Sponsor Showcase Map
场馆 + 赞助商展示区地图
Back To Schedule
Tuesday, June 25 • 11:45 - 12:20
High Available + Scalable Prometheus with Thanos in Alibaba - Guo'an Qin, Alibaba & Tao Li, Alibaba

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Alibaba Group is using Kubernetes to support the world's largest e-commerce business. With the respect of the availability and scalability, how to provide reliable fine-grained monitoring and alerting services is a indeed challenge.

In this talk, we'll share the experiences in developing a fine-grained monitoring system with high availability and scalability based on the open source project Prometheus and Thanos. This system mainly supports Alibaba's cluster management system, which has 4 million TPS and 10K requests per-second.

We will have a discussion in following topics. 1) How to support a large-scale scenarios using Prometheus? 2) How to solve data query problem caused by multiple Prometheus instance with low query latency using Thanos? 3) The lessons we learnt from Prometheus and Thanos's configuration, such as target discovery and management of recording rule and alerting rule.

Speakers
GQ

Guo'an Qin

Staff Engineer, Alibaba
Guo'an Qin is a staff engineer at Alibaba. He works in the sigma scheduler team. He worked in the Alibaba database team, where he developed a database scheduling system that supported the operation and maintenance of the Alibaba database.
avatar for Tao Li

Tao Li

Senior Engineer, Alibaba Cloud
Tao Li, senior engineer of Alibaba Cloud. He works in container service team, focusing on cost optimization and ensuring runtime quality through scheduling in warehouse-scale, with years of developing experience in K8s scheduling.


final pdf

Tuesday June 25, 2019 11:45 - 12:20 CST
515