COINS: dynamically generating COntextualized Inference rules for Narrative Story completion

Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific tex...

Full description

Saved in:
Bibliographic Details
Main Authors: Paul, Debjit (Author) , Frank, Anette (Author)
Format: Chapter/Article Conference Paper
Language:English
Published: August 2021
In: The 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing - proceedings of the conference ; Vol. 1: Long papers
Year: 2021, Pages: 5086-5099
DOI:10.18653/v1/2021.acl-long.395
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.18653/v1/2021.acl-long.395
Verlag, lizenzpflichtig, Volltext: https://aclanthology.org/2021.acl-long.395
Get full text
Author Notes:Debjit Paul, Anette Frank
Description
Summary:Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific textual outputs. In this paper we present Coins, a recursive inference framework that i) iteratively reads context sentences, ii) dynamically generates contextualized inference rules, encodes them, and iii) uses them to guide task-specific output generation. We apply to a Narrative Story Completion task that asks a model to complete a story with missing sentences, to produce a coherent story with plausible logical connections, causal relationships, and temporal dependencies. By modularizing inference and sentence generation steps in a recurrent model, we aim to make reasoning steps and their effects on next sentence generation transparent. Our automatic and manual evaluations show that the model generates better story sentences than SOTA baselines, especially in terms of coherence. We further demonstrate improved performance over strong pre-trained LMs in generating commonsense inference rules. The recursive nature of holds the potential for controlled generation of longer sequences.
Item Description:Gesehen am 10.07.2023
Physical Description:Online Resource
ISBN:9781954085527
DOI:10.18653/v1/2021.acl-long.395