Everything Happens for a Reason - Discovering the Purpose of Actions in Procedural Text

2019 • EMNLP 2019 • Procedural Text • Relation Learning • Relational Learning • AI • Dataset • EMNLP • Graph • NLP • Reasoning

12 Dec 2019

Introduction

Procedural text comprehension tasks focus on modeling the effect of actions and predicting what happens next.
But they do not consider why some actions need to happen before other actions.
The paper proposes a new model called XPAD (eXPlainable Action Dependency) that considers the purpose of actions while predicting their effect.
The model favors effects that:
- explain more of actions in the text.
- are more plausible given the context.
An existing procedural text benchmark dataset (Propara) is expanded by adding the task of explaining actions by predicting their dependencies.
Link to the paper
Link to the dataset

Setup

Input
- Procedural (chronologically ordered text) sequence of T sentences.
- List of N participant entities, whose state changes at some step.
Output
- State change matrix $\pi(T \times N)$ with four possible states - move, create destroy, none.
- This matrix tracks how property changes after each step.
Dependency Explanation Graph
- Identify what steps are necessary to execute a given step (say s_i) and represent this dependency in the form of a dependency explanation graph G = <S, E>.
- In this graph, each node is a step and the direction of edge describes the order of dependency.

Dependency Graph Dataset

Propara dataset is expanded to extract the dependency graph using both heuristic and automated methods.
The automated method is based on the coherence assumption that if step s_j changes state of entity e_k then s_j is a precondition for the first subsequent step that changes the state of e_k.

XPAD Model

The model is based on the ProStruct system and uses an encoder-decoder based architecture.
Encoder
- Input: Sentence s_t and entity e_j.
- Sentence is encoded using the GloVe vectors and a BiLSTM model and the entity is encoded as an indicator variable.
- The combined representation is denoted as c_tj.
- This representation is passed through an MLP to generate k logits that encode the probability of each entity j undergoing a state change at step t.
Decoder
- Beam search is performed to decode the encoder representation into the state change matrix and dependency graph using a score function that ensures global consistency.
- Score function has two components:
  - State change score - depends on the likelihood that the selected state changes at step t given the text and state change history from steps s₁ to s_t-1.
  - Dependency graph score
    - This is based on the connectivity and likelihood of the resulting dependency explanation graph.
    - This score is used to bias the graph search towards:
      - predictions that have an identifiable purpose ie checking if a particular state change prediction leads to a connection in the dependency explanation graph.
      - graphs that are more likely according to the background knowledge to distinguish likely dependency links from the unlikely ones.
During training, XPAD has access to the correct path (in the search space) and learns to minimize the joint loss corresponding to predicting the state change and the dependency explanation graph.
During testing, XPAD performs beam search to predict the most likely state change and dependency explanation graph.

Experiments

Tasks:
- State change prediction
- Dependency explanation prediction
Baselines:
XPAD significantly outperforms all the baseline models on the dependency explanation task.
Improvements on the state change prediction task are less significant.
Removing dependency graph scores from XPAD leads to a drop in the F1 score.
The paper provides an elaborate discussion on the different types of errors that the XPAD system makes.

Papers I Read Notes and Summaries

Everything Happens for a Reason - Discovering the Purpose of Actions in Procedural Text

Introduction

Setup

Dependency Graph Dataset

XPAD Model

Experiments

Related Posts

Toolformer - Language Models Can Teach Themselves to Use Tools 10 Feb 2023

Synthesized Policies for Transfer and Adaptation across Tasks and Environments 29 Mar 2021

Deep Neural Networks for YouTube Recommendations 22 Mar 2021