February 3, 2020

268 words 2 mins read

How machines learn to code: Machine learning on source code

How machines learn to code: Machine learning on source code

Thomas Endres and Samuel Hopstock demonstrate how to apply machine learning techniques on a program's source code, covering problems you may encounter, how to get enough relevant training data, how to encode the source code as a feature vector so that it can be processed mathematically, what machine learning algorithms to use, and more.


Talk Title	How machines learn to code: Machine learning on source code
Speakers	Thomas Endres (TNG), Samuel Hopstock (TNG Technology Consulting)
Conference	Artificial Intelligence Conference
Conf Tag	Put AI to Work
Location	London, United Kingdom
Date	October 9-11, 2018
URL	Talk Page
Slides	Talk Slides
Video

Machine learning on source code is a new area of research in the field of artificial intelligence, which, unlike classical problems such as image segmentation, does not yet have established standard techniques. For instance there are standard methods for processing images that make machine learning algorithms pay attention to their two-dimensionality. However, there are currently no common techniques for encoding the semantic structure of source code. Therefore, you need new ways to mathematically represent the code of projects. This technology offers a variety of possible applications, for example, in the area of static code analysis or in the automatic selection of relevant test cases. Thomas Endres and Samuel Hopstock share methods for transferring classic machine learning approaches to this new field of expertise. Along the way, Thomas and Samuel detail approaches for both automatic and manual training data generation and offer an overview of suitable models and machine learning frameworks for this challenge. They conclude by exploring the possibilities of using such models for the analysis of code.

intelligence intel code framework math algorithm machine learning artificial intelligence

comments powered by Disqus

A day in the life of a data scientist: How do we train our teams to get started with AI?

A day in the life of a data scientist: How do we train our teams to get started with AI?

January 27, 2020

With the growing buzz around data science, many professionals want to learn how to become a data scientistthe role Harvard Business Review called the "sexiest job of the 21st century." Francesca Lazzeri and Jaya Mathew explain what it takes to become a data scientist and how artificial intelligence solutions have started to reinvent businesses.

Agile for data science teams

Agile for data science teams

January 26, 2020

Agile methodologies have been widely successful for software engineering teams but seem inappropriate for data science teams, because data science is part engineering, part research. Jennifer Prendki demonstrates how, with a minimum amount of tweaking, data science managers can adapt Agile techniques and establish best practices to make their teams more efficient.

Lightning Talk: Artificial Intelligence the Next Digital Wave for Telcos

Lightning Talk: Artificial Intelligence the Next Digital Wave for Telcos

January 22, 2020

Since five years Artificial Intelligence has entered a new dimension, thanks to the progress of machine and deep learning algorithms, the availability of high performance GPU and labelled Data. Artif …

Using big data to unlock the delivery of personalized, multilingual real-time chat services for global financial service organizations

Using big data to unlock the delivery of personalized, multilingual real-time chat services for global financial service organizations

January 17, 2020

Financial service clients demand increased data-driven personalization, faster insight-based decisions, and multichannel real-time access. Tim Walpole details how organizations can deliver real-time, vendor-agnostic, personalized chat services and explores issues around security, privacy, legal sign-off, data compliance, and how the internet of things can be used as a delivery platform.

Executive Briefing: When privacy scalesIntelligent product design under global data privacy regulation

Executive Briefing: When privacy scalesIntelligent product design under global data privacy regulation

January 12, 2020

Data-driven companies making intelligent products must design for security and privacy to be competitive globally. Amanda Casari details the high-level changes that EU General Data Protection Regulation (GDPR)-compliant businesses face and how this translates to teams designing products driven by machine learning and artificial intelligence.

Neural Network Distiller: A PyTorch environment for neural network compression

Neural Network Distiller: A PyTorch environment for neural network compression

January 11, 2020

Neta Zmora offers an overview of Distiller, an open source Python package for neural network compression research. Neta discusses the motivation for compressing DNNs, outlines compression approaches, and explores Distiller's design and tools, supported algorithms, and code and documentation. Neta concludes with an example implementation of a compression research paper.