November 30, 2019

312 words 2 mins read

Performance anomaly detection at scale (sponsored by Salesforce)

Performance anomaly detection at scale (sponsored by Salesforce)

Automated anomaly detection in production using simple data science techniques enables you to more quickly identify an issue and reduce the time it takes to get customers out of an outage. Tuli Nivas shows how to apply simple statistics to change how performance data is viewed and how to easily and effectively identify issues in production.


Talk Title	Performance anomaly detection at scale (sponsored by Salesforce)
Speakers	Tuli Nivas (Salesforce)
Conference	Velocity
Conf Tag	Build resilient systems at scale
Location	New York, New York
Date	September 20-22, 2016
URL	Talk Page
Slides	Talk Slides
Video

As performance engineers, we understand the importance of software testing during and after development in order to identify any and all performance bottlenecks. Due to various constraints—whether a scaled-down test environment, data volume, or code integration limitations—it’s not always possible to catch all bugs in test. If performance bottlenecks are not identified and resolved in a timely manner, there’s a chance customers may be impacted. As a result, anomaly detection in production takes on an even bigger significance. The scale at which this kind of anomaly detection needs to be done is noteworthy—few servers in test versus thousands of servers in production, with time being of the utmost essence. That’s why anomaly detection at scale is one the biggest challenges for a performance engineer. One of the most widely used techniques to identify performance bugs is to look at time series data for the various metrics, which can then be used to find potential problems. However, this approach doesn’t scale well in production, even if time series data can be consolidated into a few charts. Tuli Nivas shares techniques that address how time consuming this kind of analysis can be and demonstrates how applying simple statistics and basic linear regression principles can improve productivity of a performance engineer tenfold or more. This session is sponsored by Salesforce.

performance metrics code anomaly detection

comments powered by Disqus

Using machine learning to determine drivers of bounce and conversion

Using machine learning to determine drivers of bounce and conversion

November 21, 2019

Google partnered with SOASTA to train a machine-learning model on a large sample of real-world performance, conversion, and bounce data. Patrick Meenan and Tammy Everts offer an overview of the resulting modelable to predict the impact of performance work and other site metrics on conversion and bounce rates.

Sell cron, buy Airflow: Modern data pipelines in finance

Sell cron, buy Airflow: Modern data pipelines in finance

November 29, 2019

Quantopian integrates financial data from vendors around the globe. As the scope of its operations outgrew cron, the company turned to Apache Airflow, a distributed scheduler and task executor. James Meickle explains how in less than six months, Quantopian was able to rearchitect brittle crontabs into resilient, recoverable pipelines defined in code to which anyone could contribute.

Tracing polyglot systems: An OpenTracing tutorial

Tracing polyglot systems: An OpenTracing tutorial

November 28, 2019

Priyanka Sharma and Yuri Shkuro demonstrate how distributed tracing works and how to employ it in the development and operations of your applications in the programming language of your choice: Java, Go, Python, Node.js, C#, or C++.

Is your performance analysis approach as cutting edge as your application architecture?

Is your performance analysis approach as cutting edge as your application architecture?

November 26, 2019

To analyze and improve the performance of modern applications, you must abandon outdated approaches and toolsets which are rooted to the physical topology of servers and JVMs. Jon Hodgson discusses a new paradigm to reveal unexpected relationships and hotspots obscured by the elasticity of containers and microservices so that you can find and fix issues with the most overarching business impact.

Measuring ad blocker impact on site performance

Measuring ad blocker impact on site performance

November 25, 2019

Users' ad blockers are impacting your site's perceived performance, but measuring the impact of ad blockers on actual and perceived performance can be difficult. Karan Kumar offers an overview of new testing he has created that measures the overall impact ad blockers have on the quality of user experience and performance across a number of sites.

Security and performance: Breaking the conundrum. . .again

Security and performance: Breaking the conundrum. . .again

November 23, 2019

Security techniques have generally focused on protecting users by blocking requests going to the origin, but security is also a concern at the browser. Sonia Burney and Sabrina Burney explore how security can be enforced at the browser level through a combination of optimization techniques and security enhancements, which overall provide an optimal end-user experience.