A high-performance system for deep learning inference and visual inspection

Moty Fania and Sergei Kom share their experience and lessons learned implementing an AI inference platform to enable internal visual inspection use cases. The platform is based on open source technologies and was designed for real-time, streaming, and online actuation.


Talk Title	A high-performance system for deep learning inference and visual inspection
Speakers	Moty Fania (Intel), Sergei Kom (Intel)
Conference	Strata Data Conference
Conf Tag	Make Data Work
Location	New York, New York
Date	September 11-13, 2018
URL	Talk Page
Slides	Talk Slides
Video

Recent years have seen significant evolvement of deep learning and AI capabilities. AI solutions can augment or replace mundane tasks, increase workforce productivity, and relieve human bottlenecks. Unlike traditional automation, these solutions include cognitive aspects that used to require human decision making. In some cases, deep learning has proven to be even more accurate than humans in identifying patterns and therefore can be effectively used to enable various kinds of automated, real-time decision making. The advanced analytics team at Intel IT recently implemented an internal visual inference platform—a high-performance system for deep learning inference—designed for production environments. This innovative system enables easy deployment of many DL models in production while enabling a closed feedback loop where data flows in and decisions are returned through a fast REST API. The system maximizes throughputs through batching and smart in-memory caching while maintaining its ability to support long short-term memory networks. It can be deployed either as a cluster or standalone node. To enable stream analytics at scale, the system was built in a modern microservices architecture using cutting-edge technologies such as TensorFlow, TensorFlow Serving, Redis, Flask, and more. It’s optimized to be easily deployed with Docker and Kubernetes and cuts down time to market for deploying a DL solution. By supporting different kinds of models and various inputs, including images and video streams, this system can enable the deployment of smart visual inspection solutions with real-time decision making. Moty Fania and Sergei Kom explain how Intel implemented the platform and share lessons learned along the way. Topics include:

A high-performance system for deep learning inference and visual inspection

A high-performance system for deep learning inference and visual inspection

Scaling AI Inference Workloads with GPUs and Kubernetes

Distributed training of deep learning models

Conda, Docker, and Kubernetes: The cloud-native future of data science (sponsored by Anaconda)

Conda, Docker, and Kubernetes: The cloud-native future of data science (sponsored by Anaconda)

Open Network API Development