ICON 2018 Tutorial - Explaining Deep Learning models for Natural Language Processing

Abstract

Deep learning techniques have demonstrated tremendous success in the natural language processing (NLP) community. This has led to the inclusion of deep learning components in many products in the industry.

Most models are trained on large amounts of data, which is generally created by users of the internet. These data may contain human biases and prejudices. Models learned using such data will also carry forward such prejudices, which can be really harmful. For example, Google Photos’ algorithm was criticized as a ‘racist algorithm’ for labeling a black engineer’s photos as a Gorilla. Such mishaps could be prevented if the companies are able to understand and validate the underlying rationale behind their deep learning components. Models and techniques that help understand these rationales fall under the purview of Explainable AI.

Explainable AI technologies are the need of the hour. With the acceptance of GDPR, which includes, among other things, the right to explanation, the need for such technologies is further raised. The aim of this tutorial is to give an extensive overview of existing explainable AI techniques, and describe which of them can be applied to deep learning models for NLP. The attendees will learn to frame their explanation requirement and apply these techniques to their problem statement.

This tutorial will span over three parts. In the first part, we discuss the basics of deep learning. In the second part of the tutorial, we discuss model explainability. In the third part, we plan to demonstrate two techniques: LIME [25] and LRP [2,3] for explaining models trained on Document Classification Sentiment Analysis

Kevin Patel

PhD Scholar, Dept of Computer Science & Engineering, IIT Bombay. Kevin Patel is a PhD student at IIT Bombay, since July 2014. He is investigating different aspects of word embeddings and the role they play in deep learning for NLP. He has published papers on both theory and applications of deep learning and word embeddings at various top NLP conferences such as ACL, EMNLP, IJCNLP, COLING and GWC. He is currently investigating how to explain the decisions made by deep neural networks for NLP, and what role do word embeddings play in this explainabilty. He has also delivered tutorials on deep learning in the past at ICON’15 and ICON’17.

Profile

Himanshu Singh

MTech Student, Dept of Computer Science & Engineering, IIT Bombay. Himanshu Singh is a Master’s student at CSE, IIT Bombay. His research areas include Explainable Artificial Intelligence , Machine Learning, Deep Learning, Word Sense Disambiguation and Natural Language Processing. His other areas of interest are IoT, Smart Home, Computer Vision and Entrepreneurship. He has worked in Samsung R&D Bangalore for two years as a Software Developer. He has filed two patents related to mobile and printing technology in the Indian Patent Office.

Profile

Dr. Pushpak Bhattacharyya, FNAE

Director and Professor of Computer Sc and Engg, IIT Patna Professor on Lien, CSE Department IIT Bombay Distinguished Alumnus, IIT Kharagpur. Ex-President Association for Computational Linguistics Ex-Vijay and Sita Vashee Chair Professor, CSE Dept., IIT Bombay.

Profile