CIS 607    

    Seminar on Deep Learning for Natural Language Processing    

Course Description

Deep learning has recently proved itself as a powerful branch of machine learning that has applications in a variety of domains (i.e., computer vision, robotics, etc.). In natural language processing (NLP), deep learning has been transformative and led to a new generation of methods with better performance, portability and robustness (e.g., machine translation, text generation/dialog systems). Especially, with the recent breakthrough in pretrained language models (e.g., BERT, GPT), building state-of-the-art NLP models can be done efficiently for different domains and languages. All of those advances are very recent and the demand for data scientists with deep learning expertise is growing very quickly. At the beginning of this seminar, we will cover the basic concepts of deep learning and NLP, and possibly provide some hand-on experience to implement the models. Afterward, we will review and discuss a collection of research papers on NLP with deep learning, including but not limited to the typical tasks of language modeling, question answering, information extraction, machine translation, natural language inference, dialog, summarization, domain adaptation, transfer learning, and multilingual learning. NLP is growing fast these days and we expect to read many exciting recent papers in this field.

Instructor

Thien Huu Nguyen, [email protected]

Lectures

One 90-minute lectures is delivered each week.

Prerequisites

Textbooks and Readings

Major Topics

Expected Learning Outcomes

In this seminar, we will review and discuss a collection of research papers on NLP with deep learning, including but not limited to the typical tasks of language modeling, question answering, information extraction, natural language inference, dialog and summarization.

Upon successful completion of the course, students will be able to:

Acquired Skills

Upon successful completion of the course, students will have acquired the following skills:

Class Organization

Each student in the class will need to choose some paper(s) on particular topics, present them in one of the classes and lead the discussion on the topic.

Each presentation will be given 35 minutes along with 5-10 minutes for discussions. More discussion is encouraged on Piazza.

After the presentation, the presenter should submit a summary about the presented paper/topic. The summary should follow the NAACL format (a.k.a. ACL style for 8.5x11 paper). The required length of the summary is between 2 and 3 pages.

For each presentation, we will have one student (other than the presenter) to serve as the reviewer. The role of the reviewer is to provide judgement/comments/suggestion or ask questions about the paper/topic in the discussion time after the presentation. Although all students need to read the papers being presented before each class to be able to actively contribute to the discussions, the reviewer would help to provide deeper judgement by reading and thinking critically about the papers/topics ahead of time.

IMPORTANT:

Please select a paper you want to present in this list. Write your name next to the papers you select (for presentations and reviews). You are welcome to choose another paper that is not in the list. Please talk with the instructor if you want to do this. All the paper assignments should be done before April 6 (no later than that) so we can schedule the presentations.

Tentative Schedule

Please sign up on Piazza for discussions.
Week Topics Presenter Slides Reviewer
04/02 Introduction to Modern Natural Language Processing, Low-source Scenarios and Techniques in NLP Thien
04/09 SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis (ACL 2020) Amir Link Qiuhao
Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing (EACL 2021, Demo) Minh Link Minh's work
04/16 Neural Extractive Text Summarization with Syntactic Compression (EMNLP 2019) Adib Link Rasti
Enabling Language Models to Fill in the Blanks (ACL 2020) Rasti Link Adib
04/23 DeFormer: Decomposing Pre-trained Transformersfor Faster Question Answering (ACL 2020) Minh Link Gong
Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning (ACL 2020) Luis Link Minh
04/30 Analyzing Individual Neurons in Pre-trained Language Models (EMNLP 2020) Kai - Zayd
Customizing Triggers with Concealed Data Poisoning (NAACL 2021) Zayd Link Kai
05/07 Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations (EMNLP 2020) Amir - Luis
Relation Extraction with Explanation (EMNLP 2020) Qiuhao Link Sam
05/14 GraphIE: A Graph-Based Framework for Information Extraction (NAACL 2019) Sam - Amir
What Does BERT with Vision Look At? (ACL 2020) Isaac -
05/21 XtremeDistil: Multi-stage Distillation for Massive Multilingual Models (ACL 2020) Gong - Isaac
Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus (ACL 2020) Luis -
05/28 Matching the Blanks: Distributional Similarity for Relation Learning (ACL 2019) Viet -
06/04 TBD TBD - TBD
TBD TBD - TBD

Course Requirements and Grading

This REMOTE course will be taught entirely using Zoom, Canvas, and Piazza.

Grading will be based on P/NP.