February 8, 2020

329 words 2 mins read

ThirdEye: LinkedIns business-wide monitoring platform

ThirdEye: LinkedIns business-wide monitoring platform

Failures or issues in a product or service can negatively affect the business. Detecting issues in advance and recovering from them is crucial to keeping the business alive. Join Akshay Rai to learn more about LinkedIn's next-generation open source monitoring platform, an integrated solution for real-time alerting and collaborative analysis.

Talk Title ThirdEye: LinkedIns business-wide monitoring platform
Speakers Akshay Rai (Linkedin)
Conference Strata Data Conference
Conf Tag Make Data Work
Location New York, New York
Date September 24-26, 2019
URL Talk Page
Slides Talk Slides
Video

Mean time to detect (MTTD) and mean time to restore (MTTR) describe how long it takes to discover a problem and how long it takes you to restore the issue after it was detected. The shorter the MTTD and MTTR, the less time spent in outage and the more availability your product retains. Given that products and services inevitably break at some point, you need to be adept at detecting and restoring service as soon as possible. The issue triage and restoration lifecycle is made up of several steps: capturing metrics, detection (requiring monitoring and alerting), escalation, investigating, and remediation. Each segment of the triage needs to be measured for efficiency and effectiveness in order to keep these metrics as short as possible. Akshay Rai walks you through ThirdEye, a self-service experience enabling anyone to rapidly identify and investigate deviations in business and system metrics. At LinkedIn, Third Eye is used by several teams spanning business analysts and engineers, and over 10K metrics are actively monitored. ThirdEye provides anomaly detection and collaborative dashboards for data analysis and brings together critical data that impacts metrics in a single place: holidays, deployments, company-wide issues and more. You’ll leave with an understanding of the concepts behind the open source ThirdEye project, how it’s built, and a look into ThirdEye’s insights and long-term plans. Akshay also gives you a powerful analysis of how ThirdEye helped detect and investigate some of the major issues that occurred on LinkedIn.

comments powered by Disqus