February 3, 2020

273 words 2 mins read

Performance evaluation of GANs in a semisupervised OCR use case

Performance evaluation of GANs in a semisupervised OCR use case

Even in the age of big data, labeled data is a scarce resource in many machine learning use cases. Florian Wilhelm evaluates generative adversarial networks (GANs) when used to extract information from vehicle registrations under a varying amount of labeled data, compares the performance with supervised learning techniques, and demonstrates a significant improvement when using unlabeled data.


Talk Title	Performance evaluation of GANs in a semisupervised OCR use case
Speakers	Florian Wilhelm (inovex GmbH)
Conference	Artificial Intelligence Conference
Conf Tag	Put AI to Work
Location	London, United Kingdom
Date	October 9-11, 2018
URL	Talk Page
Slides	Talk Slides
Video

Online vehicle marketplaces are embracing artificial intelligence to ease the process of selling a vehicle on their platform. The tedious work of copying information from the vehicle registration document into some web form can be automated with the help of smart text-spotting systems, in which the seller takes a picture of the document, and the necessary information is extracted automatically. Florian Wilhelm details the components of a text-spotting system, including the subtasks of object detection and optical character recognition (OCR). Florian elaborates on the challenges of OCR in documents with various distortions and artifacts, which rule out off-the-shelf products for this task. After offering an overview of semisupervised learning based on generative adversarial networks (GANs), Florian evaluates the performance gains of this method compared to supervised learning. More specifically, for a varying amount of labeled data, he compares the accuracy of a convolution neural network (CNN) to a GAN that uses additional unlabeled data during the training phase, showing that GANs significantly outperform classical CNNs in use cases with a lack of labeled data.

intelligence automated intel adversarial network optical cnn ocr supervised object detection network use case performance gans neural network artificial intelligence

comments powered by Disqus

Scaling AI Inference Workloads with GPUs and Kubernetes

Scaling AI Inference Workloads with GPUs and Kubernetes

January 7, 2020

Deep Learning (DL) is a computational intense form of machine learning that has revolutionize many fields including computer vision, automated speech recognition, natural language processing and artif …

Operationalize deep learning models for fraud detection with Azure Machine Learning Workbench

Operationalize deep learning models for fraud detection with Azure Machine Learning Workbench

December 6, 2019

Advancements in computing technologies and ecommerce platforms have amplified the risk of online fraud, which results in billions of dollars of loss for the financial industry. This trend has urged companies to consider AI techniques, including deep learning, for fraud detection. Francesca Lazzeri and Jaya Mathew explain how to operationalize deep learning models with Azure ML to prevent fraud.

Lightning Talk: Artificial Intelligence the Next Digital Wave for Telcos

Lightning Talk: Artificial Intelligence the Next Digital Wave for Telcos

January 22, 2020

Since five years Artificial Intelligence has entered a new dimension, thanks to the progress of machine and deep learning algorithms, the availability of high performance GPU and labelled Data. Artif …

Neural Network Distiller: A PyTorch environment for neural network compression

Neural Network Distiller: A PyTorch environment for neural network compression

January 11, 2020

Neta Zmora offers an overview of Distiller, an open source Python package for neural network compression research. Neta discusses the motivation for compressing DNNs, outlines compression approaches, and explores Distiller's design and tools, supported algorithms, and code and documentation. Neta concludes with an example implementation of a compression research paper.

Learning how to design automatically updating AI with Apache Kafka and Deeplearning4j

Learning how to design automatically updating AI with Apache Kafka and Deeplearning4j

December 7, 2019

Jason Bell offers an overview of a self-learning knowledge system that uses Apache Kafka and Deeplearning4j to accept data, apply training to a neural network, and output predictions. Jason covers the system design and the rationale behind it and the implications of using a streaming data with deep learning and artificial intelligence.

Automatic 3D MRI knee damage classification with 3D CNN using BigDL on Spark

Automatic 3D MRI knee damage classification with 3D CNN using BigDL on Spark

November 29, 2019

Damage to the meniscus is a physically limiting injury that can lead to further medical complications. Automatically classifying this damage at the time of an MRI scan would allow quicker and more accurate diagnosis. Jennie Wang, Valentina Pedoia, Berk Norman, and Yulia Tell offer an overview of their classification system built with 3D convolutional neural networks using BigDL on Apache Spark.